PinnedRitvik RastogiThanks for the appreciation, Its surreal for me to get acknowledged from the author itself.Feb 1Feb 1
Ritvik RastogiPapers Explained 246: BROSThe main Transformer structure of BROS is the same as BERT. BROS (BERT Relying On Spatiality) encodes relative positions of texts in 2D…22h ago22h ago
Ritvik RastogiPapers Explained 245: Layout ParserLayoutParser is an open-source library designed to streamline the application of deep learning (DL) in document image analysis (DIA)…1d ago1d ago
Ritvik RastogiPapers Explained Review 06: Parameter Efficient FineTuningTable of Contents2d ago2d ago
Ritvik RastogiPapers Explained 187e: Quantized Llama 3.2Llama 3 is a new set of foundation models, designed for multilinguality, coding, reasoning, and tool usage.4d ago4d ago
Ritvik RastogiPapers Explained 244: Gemma APSThis work focuses on the task of abstractive proposition segmentation: transforming text into simple, self-contained, well-formed…5d ago5d ago
Ritvik RastogiPapers Explained 243: ShieldGemmaShieldGemma is a comprehensive suite of LLM-based safety content moderation models ranging from 2B to 27B built upon Gemma2. These models…6d ago6d ago
Ritvik RastogiPapers Explained 242: STORMSTORM is a writing system for the Synthesis of Topic Outlines through Retrieval and Multi-perspective Question Asking. STORM models the…Oct 301Oct 301
Ritvik RastogiPapers Explained 241: Pixmo and MolmoMolmo (Multimodal Open Language Model) utilizes PixMo (Pixels for Molmo), a high-quality dataset of detailed image captions collected from…Oct 29Oct 29
Ritvik RastogiPapers Explained 240: NVLMNVLM 1.0 is a family of multimodal large language models (LLMs) rivaling proprietary and open-access models. Notably, NVLM 1.0 shows…Oct 28Oct 28