📚 image InternVL Family: Closing the Gap to Commercial Multimodal Models with Open-Source Suites —— A Pioneering Open-Source Alternative to GPT-4V
🖥 Github: https://github.com/opengvlab/internvl
📕 Paper: https://arxiv.org/abs/2404.16821v1
🔥Dataset: https://paperswithcode.com/dataset/visual-genome
@Machine_learn
🔥Dataset: https://paperswithcode.com/dataset/visual-genome
@Machine_learn
Please open Telegram to view this post
VIEW IN TELEGRAM
Please open Telegram to view this post
VIEW IN TELEGRAM
MER 2024: Semi-Supervised Learning, Noise Robustness, and Open-Vocabulary Multimodal Emotion Recognition
🖥 Github: https://github.com/zeroqiaoba/mertools
📕 Paper: https://arxiv.org/abs/2404.17113v1
🔥Dataset: https://paperswithcode.com/dataset/voxceleb2
@Machine_learn
🔥Dataset: https://paperswithcode.com/dataset/voxceleb2
@Machine_learn
Please open Telegram to view this post
VIEW IN TELEGRAM
Forwarded from Papers
نفرات ۱ تا ۴ مقاله ی زیر خالی می باشد از دوستان اگر کسی خواست در خدمتیم
Title
Solar Energy Production Forecasting: A Comparative Study of LSTM, Bi-LSTM, and XGBoost Models with Activation Function Analysis
Abstract
This research focuses on the integration of Machine Learning (ML) methodologies and climatic parameters to predict solar panel energy generation, with a specific emphasis on addressing consumption-production imbalances. Leveraging a dataset sourced from the Kaggle platform, the study is conducted in the context of Estonia, aiming to optimize solar energy utilization in this geographic region. The dataset, obtained from Kaggle, encompasses comprehensive information on climatic variables, including sunlight intensity, temperature, and humidity, alongside corresponding solar panel energy output. Through the utilization of machine learning algorithms, such as XGBoost regression and neural networks, our predictive model endeavors to discern intricate patterns and correlations within these datasets. By tailoring the model to Estonia's climatic nuances, we seek to enhance the accuracy of energy production forecasts and, consequently, better manage the challenges associated with consumption-production imbalances. Furthermore, the research investigates the adaptability of the proposed model to diverse climatic conditions, ensuring its applicability for similar endeavors in other geographical locations. By utilizing Kaggle's rich dataset and employing advanced machine learning techniques, this study aims to contribute valuable insights that can inform sustainable energy policies and practices, ultimately promoting a more efficient and reliable renewable energy infrastructure.
Related Fields
Business, Marketing, Industrial Engineering, Computer Engineering.
Candidate Journals
1. Sustainability (5.8 CiteScore, 3.9 Impact Factor)
2. Archives of Computational Methods in Engineering (14.1 CiteScore, 9.7 Impact Factor)
3. Journal of Building Engineering (8.3 CiteScore, 6.4 Impact Factor)
@Raminmousa
@paper4money
@Machine_learn
Title
Solar Energy Production Forecasting: A Comparative Study of LSTM, Bi-LSTM, and XGBoost Models with Activation Function Analysis
Abstract
This research focuses on the integration of Machine Learning (ML) methodologies and climatic parameters to predict solar panel energy generation, with a specific emphasis on addressing consumption-production imbalances. Leveraging a dataset sourced from the Kaggle platform, the study is conducted in the context of Estonia, aiming to optimize solar energy utilization in this geographic region. The dataset, obtained from Kaggle, encompasses comprehensive information on climatic variables, including sunlight intensity, temperature, and humidity, alongside corresponding solar panel energy output. Through the utilization of machine learning algorithms, such as XGBoost regression and neural networks, our predictive model endeavors to discern intricate patterns and correlations within these datasets. By tailoring the model to Estonia's climatic nuances, we seek to enhance the accuracy of energy production forecasts and, consequently, better manage the challenges associated with consumption-production imbalances. Furthermore, the research investigates the adaptability of the proposed model to diverse climatic conditions, ensuring its applicability for similar endeavors in other geographical locations. By utilizing Kaggle's rich dataset and employing advanced machine learning techniques, this study aims to contribute valuable insights that can inform sustainable energy policies and practices, ultimately promoting a more efficient and reliable renewable energy infrastructure.
Related Fields
Business, Marketing, Industrial Engineering, Computer Engineering.
Candidate Journals
1. Sustainability (5.8 CiteScore, 3.9 Impact Factor)
2. Archives of Computational Methods in Engineering (14.1 CiteScore, 9.7 Impact Factor)
3. Journal of Building Engineering (8.3 CiteScore, 6.4 Impact Factor)
@Raminmousa
@paper4money
@Machine_learn
Dynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image Editing
🖥 Github: https://github.com/wangkai930418/DPL
📕 Paper: https://arxiv.org/abs/2405.01496v1
🔥Dataset: https://neurips.cc/virtual/2023/poster/72801
@Machine_learn
🔥Dataset: https://neurips.cc/virtual/2023/poster/72801
@Machine_learn
Please open Telegram to view this post
VIEW IN TELEGRAM
🧬 AlphaFold 3 predicts the structure and interactions of all of life’s molecules
▪Blog: https://blog.google/technology/ai/google-deepmind-isomorphic-alphafold-3-ai-model/
▪Nature: https://www.nature.com/articles/s41586-024-07487-w
▪Two Minute Papers: https://www.youtube.com/watch?v=Mz7Qp73lj9o
@Machine_learn
▪Blog: https://blog.google/technology/ai/google-deepmind-isomorphic-alphafold-3-ai-model/
▪Nature: https://www.nature.com/articles/s41586-024-07487-w
▪Two Minute Papers: https://www.youtube.com/watch?v=Mz7Qp73lj9o
@Machine_learn
Forwarded from Papers
Automatic Image Annotation (AIA) of AlmondNet-20 Method for Almond Detection by Improved CNN-based Model
Machine Learning, Convolutional Neural Networks (CNNs), Image Annotation, Food Industry, Almond, Nuts Detection
Abstract:
In response to the global demand for high-quality agricultural products, especially in the competitive nut market, we present an innovative approach to enhance the grading of almonds and their shells. Leveraging Deep Convolutional Neural Networks (AlmondNet-20), we achieved over 99% accuracy through 20 layers of CNN, employing data augmentation for robust almond-shell differentiation. Our model, trained over 1000 epochs, demonstrated a remarkable accuracy of 99%, with a low loss function of 0.0567. Test evaluations revealed perfect precision, recall, and F1-score for almond detection. This advanced classification system not only boosts grading accuracy but also ensures reliability in distinguishing almonds from shells globally, benefiting both experts and non-experts. The application of deep learning algorithms opens avenues for product patents, contributing to the economic value of our country.
Field
Food Industry, Agricultural Engineering, Industrial Engineering, Computer Engineering.
1. Agronomy (3.7 CiteScore, 5.2 Impact Factor)
2. Biosystems Engineering (10.1 CiteScore, 5.1 Impact Factor)
3. Precision Agriculture (9.9 CiteScore, 6.2 Impact Factor)
@Raminmousa
@Machine_learn
@Paper4money
Please open Telegram to view this post
VIEW IN TELEGRAM
🔥 Say Goodbye to LoRA, Hello to DoRA 🤩🤩
DoRA consistently outperforms LoRA with various tasks (LLM, LVLM, etc.) and backbones (LLaMA, LLaVA, etc.)
[Paper] https://arxiv.org/abs/2402.09353
[Code] https://github.com/NVlabs/DoRA
😄 @Machine_learn
DoRA consistently outperforms LoRA with various tasks (LLM, LVLM, etc.) and backbones (LLaMA, LLaVA, etc.)
[Paper] https://arxiv.org/abs/2402.09353
[Code] https://github.com/NVlabs/DoRA
Please open Telegram to view this post
VIEW IN TELEGRAM
Enhancing Semantics in Multimodal Chain of Thought via Soft Negative Sampling
🖥 Github: https://github.com/zgmin/snse-cot
📕 Paper: https://paperswithcode.com/dataset/scienceqa
@Machine_learn
@Machine_learn
Please open Telegram to view this post
VIEW IN TELEGRAM
GitHub
GitHub - zgMin/SNSE-CoT: Official implementation for "Enhancing Semantics in Multimodal Chain of Thought via Soft Negative Sampling"
Official implementation for "Enhancing Semantics in Multimodal Chain of Thought via Soft Negative Sampling" - zgMin/SNSE-CoT
💡 Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers
▪Github: https://github.com/alpha-vllm/lumina-t2x
▪Paper: https://arxiv.org/abs/2405.05945
▪Demo: https://lumina.sylin.host/
@Machine_learn
▪Github: https://github.com/alpha-vllm/lumina-t2x
▪Paper: https://arxiv.org/abs/2405.05945
▪Demo: https://lumina.sylin.host/
@Machine_learn
Awesome-Text-to-Video-Generation Awesome
🖥 Github: https://github.com/soraw-ai/awesome-text-to-video-generation
📕 Paper: https://arxiv.org/abs/2405.10674v1
@Machine_learn
@Machine_learn
Please open Telegram to view this post
VIEW IN TELEGRAM
This media is not supported in your browser
VIEW IN TELEGRAM
⚡️ Deblur-GS: 3D Gaussian Splatting from Camera Motion Blurred Images
▪Code: https://github.com/Chaphlagical/Deblur-GS
▪Paper: https://chaphlagical.icu/Deblur-GS/static/paper/Deblur_GS_author_version.pdf
▪Project: https://chaphlagical.icu/Deblur-GS/
@Machine_learn
▪Code: https://github.com/Chaphlagical/Deblur-GS
▪Paper: https://chaphlagical.icu/Deblur-GS/static/paper/Deblur_GS_author_version.pdf
▪Project: https://chaphlagical.icu/Deblur-GS/
@Machine_learn
Large Brain Model for Learning Generic Representations with Tremendous EEG Data in BCI
🖥 Github: https://github.com/935963004/labram
📕 Paper: https://arxiv.org/abs/2405.18765v1
@Machine_learn
@Machine_learn
Please open Telegram to view this post
VIEW IN TELEGRAM
This media is not supported in your browser
VIEW IN TELEGRAM
Images that Sound: Composing Images and Sounds on a Single Canvas
abs: https://arxiv.org/abs/2405.12221
project page: https://ificl.github.io/images-that-sound/
code: https://github.com/IFICL/images-that-sound
This paper introduces an inference-time procedure that generates images that are also spectrograms corresponding to the prompt. It uses a latent image and audio diffusion model with same latent space (Stable Diffusion v1.5 and Auffusion) and denoise the same latent with both.
@Machine_learn
abs: https://arxiv.org/abs/2405.12221
project page: https://ificl.github.io/images-that-sound/
code: https://github.com/IFICL/images-that-sound
This paper introduces an inference-time procedure that generates images that are also spectrograms corresponding to the prompt. It uses a latent image and audio diffusion model with same latent space (Stable Diffusion v1.5 and Auffusion) and denoise the same latent with both.
@Machine_learn
🔥🔥🔥 YOLOv10: Real-Time End-to-End Object Detection
▪Paper: arxiv.org/pdf/2405.14458
▪Github: https://github.com/THU-MIG/yolov10/
▪Demo :https://huggingface.co/spaces/kadirnar/Yolov10
▪Colab: https://colab.research.google.com/github/roboflow-ai/notebooks/blob/main/notebooks/train-yolov10-object-detection-on-custom-dataset.ipynb#scrollTo=SaKTSzSWnG7s
@machine_learn
▪Paper: arxiv.org/pdf/2405.14458
▪Github: https://github.com/THU-MIG/yolov10/
▪Demo :https://huggingface.co/spaces/kadirnar/Yolov10
▪Colab: https://colab.research.google.com/github/roboflow-ai/notebooks/blob/main/notebooks/train-yolov10-object-detection-on-custom-dataset.ipynb#scrollTo=SaKTSzSWnG7s
@machine_learn
Forwarded from Papers
با عرض سلام این مقاله رو می خواییم برای Nature بفرستیم جایگاه های ۱ تا ۴ اش خالیه از دوستان کسی نیاز داشت در خدمتیم
Title:
Detection of brain tumors from images using the UNet architecture, with a comparative analysis of transfer learning methods and CNNs.
——————————————————————--
Abstract:
Health is crucial for human life, especially brain health, which is vital for all executive functions. Diagnosing brain health issues is often done using magnetic resonance imaging (MRI) devices, which provide critical data for health decision-makers. Images from these devices serve as a significant source of big data for artificial intelligence applications. This big data facilitates high performance in image processing classification problems, a subfield of artificial intelligence. In this study, we aim to classify brain tumors such as glioma, meningioma, and pituitary tumors from brain MRI images using the UNet architecture. To compare the results and gain a better understanding, we also employed Convolutional Neural Networks (CNN) and CNN-based models like Inception-V3, EfficientNetB4, VGG19, along with transfer learning methods for classification tasks. The models were evaluated using F-score, recall, precision, and accuracy metrics. The best accuracy result was achieved with CNN-VGG16, reaching 97%. The same transfer learning model also showed an F-score of 96%, an Area Under the Curve (AUC) value of 98%, a recall value of 98%, and a precision value of 97%. The UNet architecture and CNN-based transfer learning models play a significant role in the early diagnosis and rapid treatment of brain tumors, which is vital for improving patient outcomes.
——————————————————————
Keywords:
Brain tumor detection, UNet, CNN, Transfer Learning.
——————————————————————
Journal:
Scientific Reports
@Raminmousa
@Machine_learn
@paper4money
Title:
Detection of brain tumors from images using the UNet architecture, with a comparative analysis of transfer learning methods and CNNs.
——————————————————————--
Abstract:
Health is crucial for human life, especially brain health, which is vital for all executive functions. Diagnosing brain health issues is often done using magnetic resonance imaging (MRI) devices, which provide critical data for health decision-makers. Images from these devices serve as a significant source of big data for artificial intelligence applications. This big data facilitates high performance in image processing classification problems, a subfield of artificial intelligence. In this study, we aim to classify brain tumors such as glioma, meningioma, and pituitary tumors from brain MRI images using the UNet architecture. To compare the results and gain a better understanding, we also employed Convolutional Neural Networks (CNN) and CNN-based models like Inception-V3, EfficientNetB4, VGG19, along with transfer learning methods for classification tasks. The models were evaluated using F-score, recall, precision, and accuracy metrics. The best accuracy result was achieved with CNN-VGG16, reaching 97%. The same transfer learning model also showed an F-score of 96%, an Area Under the Curve (AUC) value of 98%, a recall value of 98%, and a precision value of 97%. The UNet architecture and CNN-based transfer learning models play a significant role in the early diagnosis and rapid treatment of brain tumors, which is vital for improving patient outcomes.
——————————————————————
Keywords:
Brain tumor detection, UNet, CNN, Transfer Learning.
——————————————————————
Journal:
Scientific Reports
@Raminmousa
@Machine_learn
@paper4money
InterpreTabNet: Distilling Predictive Signals from Tabular Data by Salient Feature Interpretation
🖥 Github: https://github.com/jacobyhsi/InterpreTabNet
📕 Paper: https://arxiv.org/abs/2406.00426v1
@Machine_learn
@Machine_learn
Please open Telegram to view this post
VIEW IN TELEGRAM
سلام دوستان حداقل ماين مي كنينن NFT ماين كنين كه يه چيزي گيرتون بياد. به نظرم اساس كوين هارو بخونين بعد ماين كنين. پروژه پايين از تمامي مواردي كه فرستادين برام بهتر بوده.
https://www.tg-me.com/SpinnerCoin_bot/app?startapp=r_280673
https://www.tg-me.com/SpinnerCoin_bot/app?startapp=r_280673
Telegram
SpinnerCoin
P2E game powered by TON and based on unique NFT
This media is not supported in your browser
VIEW IN TELEGRAM
🎙 Real-time in-browser speech recognition
▪Сode: https://github.com/xenova/transformers.js/tree/v3/examples/webgpu-whisper
▪Hf: https://huggingface.co/spaces/Xenova/realtime-whisper-webgpu
@Machine_learn
▪Сode: https://github.com/xenova/transformers.js/tree/v3/examples/webgpu-whisper
▪Hf: https://huggingface.co/spaces/Xenova/realtime-whisper-webgpu
@Machine_learn
🚀 AgentGym: Evolving Large Language Model-based Agents across Diverse Environments
🖥 Github: https://github.com/woooodyy/agentgym
📕 Paper: https://arxiv.org/abs/2406.04151v1
🔥Project: https://agentgym.github.io/
⚡️Model (AgentEvol-7B): https://huggingface.co/AgentGym/AgentEvol-7B
@Machine_learn
🔥Project: https://agentgym.github.io/
⚡️Model (AgentEvol-7B): https://huggingface.co/AgentGym/AgentEvol-7B
@Machine_learn
Please open Telegram to view this post
VIEW IN TELEGRAM