Ritvik Rastogi – Medium

Ritvik Rastogi

Pinned

Ritvik Rastogi

Thanks for the appreciation, Its surreal for me to get acknowledged from the author itself.

Feb 1

Feb 1

Ritvik Rastogi

Papers Explained 212: DataGemma

This work presents an approach for enhancing the accuracy of LLMs by integrating them with Data Commons, a vast, open-source repository of…

10h ago

Papers Explained 212: DataGemma

10h ago

Ritvik Rastogi

Papers Explained 211: o1

OpenAI o1 is a large language model trained with reinforcement learning to perform complex reasoning. o1 thinks before it answers — it can…

1d ago

Papers Explained 211: o1

1d ago

Ritvik Rastogi

Papers Explained 210: MaxViT

Max ViT introduces an efficient and scalable attention model called multi-axis attention, consisting of two aspects: blocked local and…

4d ago

Papers Explained 210: MaxViT

4d ago

Ritvik Rastogi

Papers Explained 209: Minitron Approach in Practice

This work presents a comprehensive report on compressing the Llama 3.1 8B and Mistral NeMo 12B models to 4B and 8B parameters…

5d ago

Papers Explained 209: Minitron Approach in Practice

5d ago

Ritvik Rastogi

Papers Explained 208: Minitron

The study investigates whether pruning an existing Large Language Model (LLM) and re-training it with a fraction of the original training…

6d ago

Papers Explained 208: Minitron

6d ago

Ritvik Rastogi

Papers Explained 207: Nemotron-4 340B

A family of 340B models including a base model, instruct model and a reward model, aimed to benefit in various research studies and…

Sep 10

Papers Explained 207: Nemotron-4 340B

Sep 10

Ritvik Rastogi

Papers Explained 206: Nemotron-4 15B

Nemotron-4 15B is a large multilingual language model trained on 8T text tokens by Nvidia.It exhibits high downstream accuracies across a…

Sep 9

Papers Explained 206: Nemotron-4 15B

Sep 9

Ritvik Rastogi

Papers Explained 205: LeViT

LeViT is a hybrid neural network for fast inference image classification. LeViT significantly outperforms existing convnets and vision…

Sep 8

Papers Explained 205: LeViT

Sep 8

Ritvik Rastogi

Papers Explained 204: Matryoshka Adaptor

Matryoshka-Adaptor is a framework designed to customize LLM embeddings for improved computational efficiency and cost-effectiveness. The…

Sep 6

Papers Explained 204: Matryoshka Adaptor

Sep 6

Ritvik Rastogi

Ritvik Rastogi

Data Scientist, 2x Kaggle Expert

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams