PinnedI truly appreciate your kind words, It’s an honor to be acknowledged by the author, and I’m really…Jan 31Jan 31
PinnedThanks for the appreciation, Its surreal for me to get acknowledged from the author itself.Feb 1, 2024Feb 1, 2024
Papers Explained 355: OpenMath NemotronThis paper presents a winning submission to the AI Mathematical Olympiad — Progress Prize 2 (AIMO-2) competition.1d ago1d ago
Papers Explained 354: Does RL Incentivize Reasoning Capacity in LLMs Beyond the Base Model?It is widely believed that RLVR enables LLMs to continuously self-improve, thus acquiring novel reasoning abilities that exceed…2d ago2d ago
Papers Explained 353: s1This work curates a small dataset s1K of 1,000 questions paired with reasoning traces relying on three criteria validated through…3d ago3d ago
Papers Explained 352: Skywork-MathThis research investigates the underlying factors that potentially enhance the mathematical reasoning capabilities of large language models…4d ago4d ago
Papers Explained 351: MathFusionMathFusion is a novel framework that enhances mathematical reasoning through cross-problem instruction synthesis. MathFusion implements…5d ago5d ago
Papers Explained 350: GPT 4.5OpenAI GPT-4.5 is the largest and most knowledgeable model yet. Building on GPT-4o, GPT-4.5 scales pre-training further and is designed to…Apr 18Apr 18
Papers Explained 349: ReSearchReSearch is a novel framework that trains LLMs to Reason with Search via reinforcement learning without using any supervised data on…Apr 17Apr 17
Papers Explained 348: ReaderLM-v2A 1.5B language model specialized for efficient web content extraction, transforming HTML into clean Markdown or JSON formats, It utilizes…Apr 16Apr 16