Ritvik Rastogi – Medium

Ritvik Rastogi

Pinned

Ritvik Rastogi

Thanks for the appreciation, Its surreal for me to get acknowledged from the author itself.

Feb 1

Feb 1

Ritvik Rastogi

Papers Explained 159: XLM Roberta

XLM-RoBERTa combines RoBERTa techniques with XLM, excluding translation language modelling. Instead, it focuses on masked language…

2d ago

Papers Explained 159: XLM Roberta

2d ago

Ritvik Rastogi

Papers Explained 158: XLM

XLM is a transformer-based model, built by Meta. It extends the approach of generative pretraining to multiple languages and shows the…

4d ago

Papers Explained 158: XLM

4d ago

Ritvik Rastogi

Papers Explained 157: Gemma 2

Gemma 2 is a new addition to the Gemma family with several technical modifications, including interleaving local-global attentions and…

6d ago

Papers Explained 157: Gemma 2

6d ago

Ritvik Rastogi

Papers Explained 156: InstructBLIP

This paper conducts a systematic and comprehensive study on vision-language instruction tuning based on the pretrained BLIP-2 models. 26…

Jun 28

Papers Explained 156: InstructBLIP

Jun 28

Ritvik Rastogi

Papers Explained 155: BLIP 2

BLIP-2 is a generic and efficient pretraining strategy that bootstraps vision-language pre-training from off-the-shelf frozen pre-trained…

Jun 26

Papers Explained 155: BLIP 2

Jun 26

Ritvik Rastogi

Papers Explained 154: BLIP

BLIP is a new VLP framework which transfers flexibly to both vision-language understanding and generation tasks. BLIP effectively utilizes…

Jun 24

Papers Explained 154: BLIP

Jun 24

Ritvik Rastogi

Papers Explained 153: CTRL

CTRL is a 1.63 billion-parameter conditional transformer language model, trained to condition on control codes that govern style, content…

Jun 21

Papers Explained 153: CTRL

Jun 21

Ritvik Rastogi

Papers Explained 152: SigLip

This paper proposes a simple pairwise Sigmoid loss for Language-Image Pre-training (SigLIP). Unlike standard contrastive learning with…

Jun 20

Papers Explained 152: SigLip

Jun 20

Ritvik Rastogi

Papers Explained 151: Aya 23

Aya 23 is a family of multilingual language models that can serve 23 languages. It is an improvement over the previous model, Aya 101…

Jun 17

Papers Explained 151: Aya 23

Jun 17

Ritvik Rastogi

Ritvik Rastogi

Data Scientist, 2x Kaggle Expert

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams