Telegram Web Link
📚 image InternVL Family: Closing the Gap to Commercial Multimodal Models with Open-Source Suites —— A Pioneering Open-Source Alternative to GPT-4V

🖥 Github: https://github.com/opengvlab/internvl

📕 Paper: https://arxiv.org/abs/2404.16821v1

🔥Dataset: https://paperswithcode.com/dataset/visual-genome

@Machine_learn
Please open Telegram to view this post
VIEW IN TELEGRAM
📃 A review on graph neural networks for predicting synergistic drug combinations

📕 Journal:  Artificial Intelligence Review (I.F=12)
🗓 Publish year: 2024

🧑‍💻Authors: Milad Besharatifard, Fatemeh Vafaee
👌 University: University of New South Wales (UNSW), Sydney, Australia

📎 Study the paper

Telegram: @Machine_learn
Please open Telegram to view this post
VIEW IN TELEGRAM
MER 2024: Semi-Supervised Learning, Noise Robustness, and Open-Vocabulary Multimodal Emotion Recognition

🖥 Github: https://github.com/zeroqiaoba/mertools

📕 Paper: https://arxiv.org/abs/2404.17113v1

🔥Dataset: https://paperswithcode.com/dataset/voxceleb2

@Machine_learn
Please open Telegram to view this post
VIEW IN TELEGRAM
Forwarded from Papers
نفرات ۱ تا ۴ مقاله ی زیر خالی می باشد از دوستان اگر کسی خواست در خدمتیم

Title

Solar Energy Production Forecasting: A Comparative Study of LSTM, Bi-LSTM, and XGBoost Models with Activation Function Analysis



Abstract
This research focuses on the integration of Machine Learning (ML) methodologies and climatic parameters to predict solar panel energy generation, with a specific emphasis on addressing consumption-production imbalances. Leveraging a dataset sourced from the Kaggle platform, the study is conducted in the context of Estonia, aiming to optimize solar energy utilization in this geographic region. The dataset, obtained from Kaggle, encompasses comprehensive information on climatic variables, including sunlight intensity, temperature, and humidity, alongside corresponding solar panel energy output. Through the utilization of machine learning algorithms, such as XGBoost regression and neural networks, our predictive model endeavors to discern intricate patterns and correlations within these datasets. By tailoring the model to Estonia's climatic nuances, we seek to enhance the accuracy of energy production forecasts and, consequently, better manage the challenges associated with consumption-production imbalances. Furthermore, the research investigates the adaptability of the proposed model to diverse climatic conditions, ensuring its applicability for similar endeavors in other geographical locations. By utilizing Kaggle's rich dataset and employing advanced machine learning techniques, this study aims to contribute valuable insights that can inform sustainable energy policies and practices, ultimately promoting a more efficient and reliable renewable energy infrastructure.

Related Fields
Business, Marketing, Industrial Engineering, Computer Engineering.

Candidate Journals
1. Sustainability (5.8 CiteScore, 3.9 Impact Factor)
2. Archives of Computational Methods in Engineering (14.1 CiteScore, 9.7 Impact Factor)
3. Journal of Building Engineering (8.3 CiteScore, 6.4 Impact Factor)

@Raminmousa
@paper4money
@Machine_learn
Dynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image Editing

🖥 Github: https://github.com/wangkai930418/DPL

📕 Paper: https://arxiv.org/abs/2405.01496v1

🔥Dataset: https://neurips.cc/virtual/2023/poster/72801

@Machine_learn
Please open Telegram to view this post
VIEW IN TELEGRAM
Forwarded from Papers
Title:
Automatic Image Annotation (AIA) of AlmondNet-20 Method for Almond Detection by Improved CNN-based Model
Short title
Machine Learning, Convolutional Neural Networks (CNNs), Image Annotation, Food Industry, Almond, Nuts Detection

Abstract:
In response to the global demand for high-quality agricultural products, especially in the competitive nut market, we present an innovative approach to enhance the grading of almonds and their shells. Leveraging Deep Convolutional Neural Networks (AlmondNet-20), we achieved over 99% accuracy through 20 layers of CNN, employing data augmentation for robust almond-shell differentiation. Our model, trained over 1000 epochs, demonstrated a remarkable accuracy of 99%, with a low loss function of 0.0567. Test evaluations revealed perfect precision, recall, and F1-score for almond detection. This advanced classification system not only boosts grading accuracy but also ensures reliability in distinguishing almonds from shells globally, benefiting both experts and non-experts. The application of deep learning algorithms opens avenues for product patents, contributing to the economic value of our country.
Field
Food Industry, Agricultural Engineering, Industrial Engineering, Computer Engineering.
1. Agronomy (3.7 CiteScore, 5.2 Impact Factor)
2. Biosystems Engineering (10.1 CiteScore, 5.1 Impact Factor)
3. Precision Agriculture (9.9 CiteScore, 6.2 Impact Factor)

@Raminmousa
@Machine_learn
@Paper4money
Please open Telegram to view this post
VIEW IN TELEGRAM
🔥 Say Goodbye to LoRA, Hello to DoRA 🤩🤩

DoRA consistently outperforms LoRA with various tasks (LLM, LVLM, etc.) and backbones (LLaMA, LLaVA, etc.)

[Paper] https://arxiv.org/abs/2402.09353
[Code] https://github.com/NVlabs/DoRA

😄@Machine_learn
Please open Telegram to view this post
VIEW IN TELEGRAM
💡 Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers


Github: https://github.com/alpha-vllm/lumina-t2x
Paper: https://arxiv.org/abs/2405.05945
Demo: https://lumina.sylin.host/

@Machine_learn
Please open Telegram to view this post
VIEW IN TELEGRAM
Large Brain Model for Learning Generic Representations with Tremendous EEG Data in BCI

🖥 Github: https://github.com/935963004/labram

📕Paper: https://arxiv.org/abs/2405.18765v1

@Machine_learn
Please open Telegram to view this post
VIEW IN TELEGRAM
This media is not supported in your browser
VIEW IN TELEGRAM
Images that Sound: Composing Images and Sounds on a Single Canvas

abs: https://arxiv.org/abs/2405.12221
project page: https://ificl.github.io/images-that-sound/
code: https://github.com/IFICL/images-that-sound

This paper introduces an inference-time procedure that generates images that are also spectrograms corresponding to the prompt. It uses a latent image and audio diffusion model with same latent space (Stable Diffusion v1.5 and Auffusion) and denoise the same latent with both.

@Machine_learn
Forwarded from Papers
با عرض سلام این مقاله رو می خواییم برای Nature بفرستیم جایگاه های ۱ تا ۴ اش خالیه از دوستان کسی نیاز داشت در خدمتیم
Title:
Detection of brain tumors from images using the UNet architecture, with a comparative analysis of transfer learning methods and CNNs.
——————————————————————--
Abstract:
Health is crucial for human life, especially brain health, which is vital for all executive functions. Diagnosing brain health issues is often done using magnetic resonance imaging (MRI) devices, which provide critical data for health decision-makers. Images from these devices serve as a significant source of big data for artificial intelligence applications. This big data facilitates high performance in image processing classification problems, a subfield of artificial intelligence. In this study, we aim to classify brain tumors such as glioma, meningioma, and pituitary tumors from brain MRI images using the UNet architecture. To compare the results and gain a better understanding, we also employed Convolutional Neural Networks (CNN) and CNN-based models like Inception-V3, EfficientNetB4, VGG19, along with transfer learning methods for classification tasks. The models were evaluated using F-score, recall, precision, and accuracy metrics. The best accuracy result was achieved with CNN-VGG16, reaching 97%. The same transfer learning model also showed an F-score of 96%, an Area Under the Curve (AUC) value of 98%, a recall value of 98%, and a precision value of 97%. The UNet architecture and CNN-based transfer learning models play a significant role in the early diagnosis and rapid treatment of brain tumors, which is vital for improving patient outcomes.
——————————————————————
Keywords:
Brain tumor detection, UNet, CNN, Transfer Learning.
——————————————————————
Journal:
Scientific Reports

@Raminmousa
@Machine_learn
@paper4money
InterpreTabNet: Distilling Predictive Signals from Tabular Data by Salient Feature Interpretation

🖥 Github: https://github.com/jacobyhsi/InterpreTabNet

📕Paper: https://arxiv.org/abs/2406.00426v1

@Machine_learn
Please open Telegram to view this post
VIEW IN TELEGRAM
سلام دوستان حداقل ماين مي كنينن NFT ماين كنين كه يه چيزي گيرتون بياد. به نظرم اساس كوين هارو بخونين بعد ماين كنين. پروژه پايين از تمامي مواردي كه فرستادين برام بهتر بوده.

https://www.tg-me.com/SpinnerCoin_bot/app?startapp=r_280673
🚀 AgentGym: Evolving Large Language Model-based Agents across Diverse Environments

🖥 Github: https://github.com/woooodyy/agentgym

📕 Paper: https://arxiv.org/abs/2406.04151v1

🔥Project: https://agentgym.github.io/

⚡️Model (AgentEvol-7B): https://huggingface.co/AgentGym/AgentEvol-7B

@Machine_learn
Please open Telegram to view this post
VIEW IN TELEGRAM
2025/07/04 23:23:09
Back to Top
HTML Embed Code: