Machine learning books and papers 2834

🌟 SEE-2-SOUND - a method for generating complex spatial sound based on images and videos

— pip install see2sound

🖥 GitHub
🟡 Hugging Face
🟡 Arxiv

@Machine_learn

5.1K viewsedited 17:47

Machine learning books and papers

Seq2Seq: Sequence-to-Sequence Generator

🖥 Github: https://github.com/fiy2w/mri_seq2seq

📕 Paper: https://arxiv.org/abs/2407.02911v1

🔥Dataset: https://paperswithcode.com/task/contrastive-learning

@Machine_learn

5.6K viewsedited 13:30

Machine learning books and papers

سلام دوستانی که مقاله دارن می تونن به این ژورنال بفرستن و من و به عنوان داور معرفی کنن
@Machine_learn

5.5K views11:36

Machine learning books and papers

Minutes to Seconds: Speeded-up DDPM-based Image Inpainting with Coarse-to-Fine Sampling

🖥

Github: https://github.com/linghuyuhangyuan/m2s

📕

Paper: https://arxiv.org/abs/2407.05875v1

🔥Dataset: https://paperswithcode.com/task/denoising

@Machine_learn

Please open Telegram to view this post

VIEW IN TELEGRAM

5.0K views05:28

Machine learning books and papers

👁‍🗨 LongVA: Long Context Transfer from Language to Vision

▪Github: https://github.com/EvolvingLMMs-Lab/LongVA
▪Paper: https://arxiv.org/abs/2406.16852
▪Project: https://lmms-lab.github.io/posts/longva/
▪Demo: https://longva-demo.lmms-lab.com/

@Machine_learn

6.6K views05:29

Machine learning books and papers

Unified Embedding Alignment for Open-Vocabulary Video Instance Segmentation (ECCV 2024)

🖥

Github: https://github.com/fanghaook/ovformer

📕

Paper: https://arxiv.org/abs/2407.07427v1

@Machine_learn

Please open Telegram to view this post

VIEW IN TELEGRAM

5.4K viewsedited 10:40

Machine learning books and papers

Multimodal contrastive learning for spatial gene expression prediction using histology images

🖥

Github: https://github.com/modelscope/data-juicer

📕

Paper: https://arxiv.org/abs/2407.08583v1

🚀 Dataset: https://paperswithcode.com/dataset/coco

@Machine_learn

Please open Telegram to view this post

VIEW IN TELEGRAM

6.4K viewsedited 16:02

Machine learning books and papers

🌟 An Empirical Study of Mamba-based Pedestrian Attribute Recognition

🖥

Github: https://github.com/event-ahu/openpar

📕

Paper: https://arxiv.org/pdf/2407.10374v1.pdf

🚀 Dataset: https://paperswithcode.com/dataset/peta

@Machine_learn

Please open Telegram to view this post

VIEW IN TELEGRAM

5.6K viewsedited 03:46

Machine learning books and papers

Aligning Sight and Sound: Advanced Sound Source Localization Through Audio-Visual Alignment

🖥

Github: https://github.com/kaistmm/SSLalignment

📕

Paper: https://arxiv.org/abs/2407.13676v1

🚀 Dataset: https://paperswithcode.com/dataset/is3-interactive-synthetic-sound-source

@Machine_learn

Please open Telegram to view this post

VIEW IN TELEGRAM

5.7K viewsedited 12:59

Machine learning books and papers

🌟 MG-LLaVA - multimodal LLM with advanced capabilities for working with visual information

Just recently, the guys from Shanghai University rolled out MG-LLaVA - MLLM, which expands the capabilities of processing visual information through the use of additional components: special components that are responsible for working with low and high resolution.

MG-LLaVA integrates an additional high-resolution visual encoder to capture fine details, which are then combined with underlying visual features using the Conv-Gate network.

Trained exclusively on publicly available multimodal data, MG-LLaVA achieves excellent results.

🟡 MG-LLaVA page
🖥 GitHub

@Machine_learn

5.8K viewsedited 11:39

Machine learning books and papers

Aligning Sight and Sound: Advanced Sound Source Localization Through Audio-Visual Alignment

🖥 Github: https://github.com/kaistmm/SSLalignment

📕 Paper: https://arxiv.org/abs/2407.13676v1

🚀 Dataset: https://paperswithcode.com/dataset/is3-interactive-synthetic-sound-source

@Machine_learn

6.2K viewsedited 11:42

Machine learning books and papers

🖥

StackFLOW: Monocular Human-Object Reconstruction by Stacked Normalizing Flow with Offset.

🖥

Github: https://github.com/huochf/StackFLOW

📕

Paper: https://arxiv.org/abs/2407.20545v1

🚀 Dataset: https://paperswithcode.com/dataset/behave

@Machine_learn

Please open Telegram to view this post

VIEW IN TELEGRAM

6.5K views07:13

Machine learning books and papers

Machine learning books and papers pinned Deleted message

08:20

Machine learning books and papers

⚡️ EMO-Disentanger

🖥

Github: https://github.com/yuer867/emo-disentanger

📕

Paper: https://arxiv.org/abs/2407.20955v1

🚀 Dataset: https://paperswithcode.com/dataset/emopia

@Machine_learn

Please open Telegram to view this post

VIEW IN TELEGRAM

5.9K views06:44

Machine learning books and papers

How to Think Like a Computer Scientist: Interactive Edition

https://runestone.academy/ns/books/published/thinkcspy/index.html

@Machine_learn

6.6K viewsedited 06:27

Machine learning books and papers

Machine learning books and papers pinned Deleted message

11:22

Machine learning books and papers

No learning rates needed: Introducing SALSA - Stable Armijo Line Search Adaptation

🖥 Github: https://github.com/themody/no-learning-rates-needed-introducing-salsa-stable-armijo-line-search-adaptation

📕 Paper: https://arxiv.org/abs/2407.20650v1

🚀 Dataset: https://paperswithcode.com/dataset/cifar-10

✅

@Machine_learn

Please open Telegram to view this post

VIEW IN TELEGRAM

6.1K viewsedited 14:34

Machine learning books and papers

💨

Scaling hierarchical agglomerative clustering to trillion-edge graphs

https://research.google/blog/scaling-hierarchical-agglomerative-clustering-to-trillion-edge-graphs/

✅

@Machine_learn

Please open Telegram to view this post

VIEW IN TELEGRAM

4.7K views06:11

Machine learning books and papers

Pixart-Sigma, the first high-quality, transformer-based image generation training framework!

🖥

Github: https://github.com/PixArt-alpha/PixArt-sigma

🔥Demo: https://huggingface.co/spaces/PixArt-alpha/PixArt-Sigma

✅

@Machine_learn

Please open Telegram to view this post

VIEW IN TELEGRAM

GitHub

GitHub - PixArt-alpha/PixArt-sigma: PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation - PixArt-alpha/PixArt-sigma

4.4K views15:11

Machine learning books and papers

Recall-Oriented-CL-Framework

🖥

Github: https://github.com/bigdata-inha/recall-oriented-cl-framework

📕

Paper: https://arxiv.org/pdf/2403.03082v1.pdf

🔥Dataset: https://paperswithcode.com/dataset/cifar-10

✨

Tasks: https://paperswithcode.com/task/continual-learning

✅

@Machine_learn

Please open Telegram to view this post

VIEW IN TELEGRAM

GitHub

GitHub - bigdata-inha/recall-oriented-cl-framework

Contribute to bigdata-inha/recall-oriented-cl-framework development by creating an account on GitHub.

4.1K views15:11

2025/07/04 09:51:57
Back to Top

HTML Embed Code:

<iframe width="100%" src="https://www.bootg.com/buyppe/web?embed=1" title="Telegram Web" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen></iframe>