PinnedI truly appreciate your kind words, It’s an honor to be acknowledged by the author, and I’m really…Jan 31Jan 31
PinnedThanks for the appreciation, Its surreal for me to get acknowledged from the author itself.Feb 1, 2024Feb 1, 2024
Papers Explained 330: Gemini EmbeddingGemini Embedding leverages the power of Gemini to produce highly generalizable embeddings for text spanning numerous languages and textual…2d ago2d ago
Papers Explained 329: Gemma 3Gemma 3 is a multimodal addition to the Gemma family, ranging in scale from 1 to 27 billion parameters. This version introduces vision…3d ago3d ago
Papers Explained 328: LIMOLIMO demonstrates unprecedented performance and efficiency in mathematical reasoning. With merely 817 curated training samples, LIMO…4d ago4d ago
Papers Explained 327: NeoBERTNeoBERT is a next-generation encoder that redefines the capabilities of bidirectional models by integrating state-of-the-art advancements…5d ago5d ago
Papers Explained 326: olmOCRolmOCR is an open-source Python toolkit for processing PDFs into clean, linearized plain text in natural reading order while preserving…6d ago6d ago
Papers Explained 325: Selective Self-to-Supervised Fine-Tuning (S3FT)Selective Self-to-Supervised Fine-Tuning (S3FT) is a fine- tuning approach that first identifies the correct model responses from the…Mar 7Mar 7
Papers Explained 324: Thinking Preference OptimizationThinking Preference Optimization (ThinkPO) utilizes readily available or easily obtainable short CoT reasoning responses as rejected…Mar 6Mar 6
Papers Explained 323: SysGenSysGen is a pipeline for generating system messages with better aligned assistant responses. This is achieved from the supervised…Mar 5Mar 5