Telegram Web Link
Pixart-Sigma, the first high-quality, transformer-based image generation training framework!

🖥 Github: https://github.com/PixArt-alpha/PixArt-sigma

🔥Demo: https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma

@Machine_learn
Please open Telegram to view this post
VIEW IN TELEGRAM
👍3
LongEmbed: Extending Embedding Models for Long Context Retrieval

🖥 Github: https://github.com/dwzhu-pku/longembed

📕 Paper: https://arxiv.org/abs/2404.12096v1

⚡️ Project: https://6img-to-3d.github.io/

@Machine_learn
Please open Telegram to view this post
VIEW IN TELEGRAM
👍6
This media is not supported in your browser
VIEW IN TELEGRAM
⚡️ DesignEdit: Multi-Layered Latent Decomposition and Fusion for Unified & Accurate Image Editing

Microsoft представляет DesignEd it!

Github: https://github.com/design-edit/DesignEdit.git
Paper: https://arxiv.org/abs/2403.14487
Project: https://design-edit.github.io/

@Machine_learn
👍3
docker-jumpstart.pdf
820.6 KB
A quick introduction to Docker.

📓 book

@Machine_learn
3
📚 image InternVL Family: Closing the Gap to Commercial Multimodal Models with Open-Source Suites —— A Pioneering Open-Source Alternative to GPT-4V

🖥 Github: https://github.com/opengvlab/internvl

📕 Paper: https://arxiv.org/abs/2404.16821v1

🔥Dataset: https://paperswithcode.com/dataset/visual-genome

@Machine_learn
Please open Telegram to view this post
VIEW IN TELEGRAM
👍1
📃 A review on graph neural networks for predicting synergistic drug combinations

📕 Journal:  Artificial Intelligence Review (I.F=12)
🗓 Publish year: 2024

🧑‍💻Authors: Milad Besharatifard, Fatemeh Vafaee
👌 University: University of New South Wales (UNSW), Sydney, Australia

📎 Study the paper

Telegram: @Machine_learn
Please open Telegram to view this post
VIEW IN TELEGRAM
👍31
MER 2024: Semi-Supervised Learning, Noise Robustness, and Open-Vocabulary Multimodal Emotion Recognition

🖥 Github: https://github.com/zeroqiaoba/mertools

📕 Paper: https://arxiv.org/abs/2404.17113v1

🔥Dataset: https://paperswithcode.com/dataset/voxceleb2

@Machine_learn
Please open Telegram to view this post
VIEW IN TELEGRAM
👍10
Forwarded from Papers
نفرات ۱ تا ۴ مقاله ی زیر خالی می باشد از دوستان اگر کسی خواست در خدمتیم

Title

Solar Energy Production Forecasting: A Comparative Study of LSTM, Bi-LSTM, and XGBoost Models with Activation Function Analysis



Abstract
This research focuses on the integration of Machine Learning (ML) methodologies and climatic parameters to predict solar panel energy generation, with a specific emphasis on addressing consumption-production imbalances. Leveraging a dataset sourced from the Kaggle platform, the study is conducted in the context of Estonia, aiming to optimize solar energy utilization in this geographic region. The dataset, obtained from Kaggle, encompasses comprehensive information on climatic variables, including sunlight intensity, temperature, and humidity, alongside corresponding solar panel energy output. Through the utilization of machine learning algorithms, such as XGBoost regression and neural networks, our predictive model endeavors to discern intricate patterns and correlations within these datasets. By tailoring the model to Estonia's climatic nuances, we seek to enhance the accuracy of energy production forecasts and, consequently, better manage the challenges associated with consumption-production imbalances. Furthermore, the research investigates the adaptability of the proposed model to diverse climatic conditions, ensuring its applicability for similar endeavors in other geographical locations. By utilizing Kaggle's rich dataset and employing advanced machine learning techniques, this study aims to contribute valuable insights that can inform sustainable energy policies and practices, ultimately promoting a more efficient and reliable renewable energy infrastructure.

Related Fields
Business, Marketing, Industrial Engineering, Computer Engineering.

Candidate Journals
1. Sustainability (5.8 CiteScore, 3.9 Impact Factor)
2. Archives of Computational Methods in Engineering (14.1 CiteScore, 9.7 Impact Factor)
3. Journal of Building Engineering (8.3 CiteScore, 6.4 Impact Factor)

@Raminmousa
@paper4money
@Machine_learn
👍72
Dynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image Editing

🖥 Github: https://github.com/wangkai930418/DPL

📕 Paper: https://arxiv.org/abs/2405.01496v1

🔥Dataset: https://neurips.cc/virtual/2023/poster/72801

@Machine_learn
Please open Telegram to view this post
VIEW IN TELEGRAM
👍41
🧬 AlphaFold 3 predicts the structure and interactions of all of life’s molecules


Blog: https://blog.google/technology/ai/google-deepmind-isomorphic-alphafold-3-ai-model/
Nature: https://www.nature.com/articles/s41586-024-07487-w
Two Minute Papers: https://www.youtube.com/watch?v=Mz7Qp73lj9o

@Machine_learn
👍42🔥2
Forwarded from Papers
Title:
Automatic Image Annotation (AIA) of AlmondNet-20 Method for Almond Detection by Improved CNN-based Model
Short title
Machine Learning, Convolutional Neural Networks (CNNs), Image Annotation, Food Industry, Almond, Nuts Detection

Abstract:
In response to the global demand for high-quality agricultural products, especially in the competitive nut market, we present an innovative approach to enhance the grading of almonds and their shells. Leveraging Deep Convolutional Neural Networks (AlmondNet-20), we achieved over 99% accuracy through 20 layers of CNN, employing data augmentation for robust almond-shell differentiation. Our model, trained over 1000 epochs, demonstrated a remarkable accuracy of 99%, with a low loss function of 0.0567. Test evaluations revealed perfect precision, recall, and F1-score for almond detection. This advanced classification system not only boosts grading accuracy but also ensures reliability in distinguishing almonds from shells globally, benefiting both experts and non-experts. The application of deep learning algorithms opens avenues for product patents, contributing to the economic value of our country.
Field
Food Industry, Agricultural Engineering, Industrial Engineering, Computer Engineering.
1. Agronomy (3.7 CiteScore, 5.2 Impact Factor)
2. Biosystems Engineering (10.1 CiteScore, 5.1 Impact Factor)
3. Precision Agriculture (9.9 CiteScore, 6.2 Impact Factor)

@Raminmousa
@Machine_learn
@Paper4money
Please open Telegram to view this post
VIEW IN TELEGRAM
👍1
🔥 Say Goodbye to LoRA, Hello to DoRA 🤩🤩

DoRA consistently outperforms LoRA with various tasks (LLM, LVLM, etc.) and backbones (LLaMA, LLaVA, etc.)

[Paper] https://arxiv.org/abs/2402.09353
[Code] https://github.com/NVlabs/DoRA

😄@Machine_learn
Please open Telegram to view this post
VIEW IN TELEGRAM
👍2
💡 Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers


Github: https://github.com/alpha-vllm/lumina-t2x
Paper: https://arxiv.org/abs/2405.05945
Demo: https://lumina.sylin.host/

@Machine_learn
👍1
Please open Telegram to view this post
VIEW IN TELEGRAM
👍1
Large Brain Model for Learning Generic Representations with Tremendous EEG Data in BCI

🖥 Github: https://github.com/935963004/labram

📕Paper: https://arxiv.org/abs/2405.18765v1

@Machine_learn
Please open Telegram to view this post
VIEW IN TELEGRAM
🔥3👍1
This media is not supported in your browser
VIEW IN TELEGRAM
Images that Sound: Composing Images and Sounds on a Single Canvas

abs: https://arxiv.org/abs/2405.12221
project page: https://ificl.github.io/images-that-sound/
code: https://github.com/IFICL/images-that-sound

This paper introduces an inference-time procedure that generates images that are also spectrograms corresponding to the prompt. It uses a latent image and audio diffusion model with same latent space (Stable Diffusion v1.5 and Auffusion) and denoise the same latent with both.

@Machine_learn
👍2
2025/07/13 21:21:09
Back to Top
HTML Embed Code: