Telegram Web Link
​​Uber AI Plug and Play Language Model (PPLM)

PPLM allows a user to flexibly plug in one or more simple attribute models representing the desired control objective into a large, unconditional language modeling (LM). The method has the key property that it uses the LM as is – no training or fine-tuning is required – which enables researchers to leverage best-in-class LMs even if they don't have the extensive hardware required to train them.

PPLM lets users combine small attribute models with an LM to steer its generation. Attribute models can be 100k times smaller than the LM and still be effective in steering it

PPLM algorithm entails three simple steps to generate a sample:
* given a partially generated sentence, compute log(p(x)) and log(p(a|x)) and the gradients of each with respect to the hidden representation of the underlying language model. These quantities are both available using an efficient forward and backward pass of both models;
* use the gradients to move the hidden representation of the language model a small step in the direction of increasing log(p(a|x)) and increasing log(p(x));
* sample the next word

more at paper: https://arxiv.org/abs/1912.02164

blogpost: https://eng.uber.com/pplm/
code: https://github.com/uber-research/PPLM
online demo: https://transformer.huggingface.co/model/pplm
@Machine_learn
#nlp #lm #languagemodeling #uber #pplm
Forwarded from بینام
Practical Computer Vision Applications Using Deep Learning with CNNs — Ahmed Fawzy Gad (en) 2018

@Machine_learn
YOLACT (You Only Look At CoefficienTs) - Real-time Instance Segmentation
Results are impressive, above 30 FPS on COCO test-dev
Forwarded from بینام
Machine Learning and Security (en).pdf
6.4 MB
Machine Learning and Security — C. Chio, D. Freeman (en) 2018
#book #ML
@Machine_learn
Forwarded from بینام
[Jojo_John_Moolayil]_Learn_Keras_for_Deep_Neural_N.pdf
2.7 MB
Learn Keras for Deep Neural Networks: A Fast-Track Approach to Modern Deep Learning with Python

@Machine_learn
@Machine_learn


Learning to See Transparent Objects

ClearGrasp uses 3 neural networks: a network to estimate surface normals, one for occlusion boundaries (depth discontinuities), and one that masks transparent objects

Google research: https://ai.googleblog.com/2020/02/learning-to-see-transparent-objects.html

Code: https://github.com/Shreeyak/cleargrasp

Dataset: https://sites.google.com/view/transparent-objects

3D Shape Estimation of Transparent Objects for Manipulation: https://sites.google.com/view/cleargrasp
Machine learning books and papers pinned «https://www.tg-me.com/Machine_learn»
Deep learning of dynamical attractors from time series measurements

Embed complex time series using autoencoders and a loss function based on penalizing false-nearest-neighbors.

Code: https://github.com/williamgilpin/fnn

Paper: https://arxiv.org/abs/2002.05909
2025/07/08 23:31:46
Back to Top
HTML Embed Code: