List: Wizard Models | Curated by Ritvik Rastogi

Apr 26, 2024

3 stories

1 save

Wizard Models

Proposes Reinforcement Learning from Evol-Instruct Feedback (RLEIF) method, applied to Llama-2 to enhance the mathematical reasoning abilities.

Ritvik Rastogi

Papers Explained 129: WizardMath

WizardMath enhances the mathematical reasoning abilities of Llama-2, by applying the proposed Reinforcement Learning from Evol-Instruct…

Apr 26, 2024

Apr 26, 2024

Enhances the performance of the open-source Code LLM, StarCoder, through the application of Code Evol-Instruct.

Ritvik Rastogi

Papers Explained 128: WizardCoder

WizardCoder empowers Code LLMs (specifically StarCoder) with complex instruction fine-tuning, by adapting the Evol-Instruct method to the…

Apr 24, 2024

Apr 24, 2024

Introduces Evol-Instruct, a method to generate large amounts of instruction data with varying levels of complexity using LLM instead of humans to fine tune a Llama model

Ritvik Rastogi

Papers Explained 127: WizardLM

Wizard LM shows an avenue for creating large amounts of instruction data with varying levels of complexity using LLM instead of humans…

Apr 22, 2024

Apr 22, 2024