List: Parameter Efficient Fine Tuning | Curated by Ritvik Rastogi

Jun 7, 2024

3 stories

4 saves

Parameter Efficient Fine Tuning

Enables context extension for large language models, achieving significant computation savings through sparse local attention and parameter-efficient fine-tuning.

Ritvik Rastogi

Papers Explained 147: LongLoRA

LongLoRA is an efficient fine-tuning approach that extends the context sizes of pre-trained LLMs, with limited computation cost.

Jun 7, 2024

Jun 7, 2024

Allows efficient training of large models on limited GPU memory, through innovations like 4-bit NormalFloat (NF4), double quantization and paged optimisers.

Ritvik Rastogi

Papers Explained 146: QLoRA

QLoRA is an efficient finetuning approach that reduces memory usage for fine-tuning hplarge models on a single GPU while preserving full…

Jun 5, 2024

Jun 5, 2024

Introduces trainable rank decomposition matrices into each layer of a pre-trained Transformer model, significantly reducing the number of trainable parameters for downstream tasks.

Ritvik Rastogi

Papers Explained 145: LoRA

Low-Rank Adaptation or LoRA freezes the pretrained model weights and injects trainable rank decomposition matrices into each layer of the…

Jun 3, 2024

Jun 3, 2024