PinnedRitvik RastogiThanks for the appreciation, Its surreal for me to get acknowledged from the author itself.Feb 1Feb 1
Ritvik RastogiPapers Explained 215: Swin Transformer V2Swin Transformer v2 explores large-scale models in computer vision, addressing challenges like training stability, resolution gaps, and…19h ago19h ago
Ritvik RastogiPapers Explained 214: Florence-2While existing large vision models excel in transfer learning, they struggle to perform a diversity of tasks with simple instructions…1d ago1d ago
Ritvik RastogiPapers Explained 213: FlorenceWhile existing vision foundation models such as CLIP focus mainly on mapping images and textual representations to a cross-modal shared…2d ago12d ago1
Ritvik RastogiPapers Explained 212: DataGemmaThis work presents an approach for enhancing the accuracy of LLMs by integrating them with Data Commons, a vast, open-source repository of…3d ago23d ago2
Ritvik RastogiPapers Explained 211: o1OpenAI o1 is a large language model trained with reinforcement learning to perform complex reasoning. o1 thinks before it answers — it can…4d ago14d ago1
Ritvik RastogiPapers Explained 210: MaxViTMax ViT introduces an efficient and scalable attention model called multi-axis attention, consisting of two aspects: blocked local and…Sep 13Sep 13
Ritvik RastogiPapers Explained 209: Minitron Approach in PracticeThis work presents a comprehensive report on compressing the Llama 3.1 8B and Mistral NeMo 12B models to 4B and 8B parameters…Sep 12Sep 12
Ritvik RastogiPapers Explained 208: MinitronThe study investigates whether pruning an existing Large Language Model (LLM) and re-training it with a fraction of the original training…Sep 11Sep 11
Ritvik RastogiPapers Explained 207: Nemotron-4 340BA family of 340B models including a base model, instruct model and a reward model, aimed to benefit in various research studies and…Sep 10Sep 10