List: Retrieval and Representation Learning | Curated by Ritvik Rastogi

Sep 6, 2024
17 stories
4 saves
Retrieval and Representation Learning
A framework designed for the customization of LLM embeddings, facilitating substantial dimensionality reduction while maintaining comparable performance levels.
Ritvik Rastogi
Papers Explained 204: Matryoshka AdaptorMatryoshka-Adaptor is a framework designed to customize LLM embeddings for improved computational efficiency and cost-effectiveness. The…
Sep 6
Sep 6
A framework that adapts Multimodal Large Language Models for achieving universal multimodal embeddings by leveraging prompts and single modality training on text pairs, which demonstrates strong performance in multimodal embeddings without fine-tuning and eliminates the need for costly multimodal training data collection.
Ritvik Rastogi
Papers Explained 172: E5-VE5-V leverages Multimodal Large Language Models Via prompts to effectively bridge the modality gap between different types of inputs…
Jul 31
Jul 31
A retrieval model based on PaliGemma to produce high-quality contextualized embeddings solely from images of document pages, and employees late interaction allowing for efficient and effective visually rich document retrieval.
Ritvik Rastogi
Papers Explained 198: ColPaliColPali leverages the document understanding capabilities of recent Vision Language Models to produce high-quality contextualized…
Aug 30
Aug 30
Introduces architectural innovations and training recipe to significantly enhance LLMs performance in general-purpose text embedding tasks.
Ritvik Rastogi
Papers Explained 168: NV-EmbedNV-Embed proposes a latent attention layer to obtain pooled embeddings and removes causal attention mask during contrastive training to…
Jul 24
Jul 24
A 1.2B versatile text embedding model achieving strong retrieval performance by distilling knowledge from LLMs into a retriever.
Ritvik Rastogi
Papers Explained 203: GeckoGecko is a versatile text embedding model trained on a variety of tasks including document retrieval, semantic similarity, and…
Sep 5
Sep 5
A 137M parameter, open-source English text embedding model with an 8192 context length that outperforms OpenAI's models on both short and long-context tasks.
Ritvik Rastogi
Papers Explained 110: Nomic EmbedNomic-embed-text is a fully open-source English text embedding model with a large context length of 8192. It surpasses existing models like…
Mar 8
Mar 8
Leverages proprietary LLMs to generate diverse synthetic data to fine tune open-source decoder-only LLMs for hundreds of thousands of text embedding tasks.
Ritvik Rastogi
Papers Explained 91: E5 Mistral-7BThis paper introduces a novel and simple method for obtaining high-quality text embeddings using only synthetic data and less than 1k…
Jan 17
1
Jan 17
1
A visual representation learning method that leverages generative models to synthesize large-scale curated datasets without relying on any real data.
Ritvik Rastogi
Papers Explained 202: SynCLRSynCLR (Synthetic Contrastive Learning) leverages generative models to redefine the granularity of visual classes for improving visual…
Sep 4
Sep 4
A simple pairwise Sigmoid loss function for Language-Image Pre-training that operates solely on image-text pairs, allowing for larger batch sizes and better performance at smaller batch sizes.
Ritvik Rastogi
Papers Explained 152: SigLipThis paper proposes a simple pairwise Sigmoid loss for Language-Image Pre-training (SigLIP). Unlike standard contrastive learning with…
Jun 20
Jun 20
A family of text embeddings trained in a contrastive manner with weak supervision signals from a curated large-scale text pair dataset CCPairs.
Ritvik Rastogi
Papers Explained 90: E5Text Embeddings by Weakly-Supervised Contrastive Pre-training
Jan 15
1
Jan 15
1
Encodes information at different granularities and allows a flexible representation that can adapt to multiple downstream tasks with varying computational resources using a single embedding.
Ritvik Rastogi
Papers Explained 96: Matryoshka Representation LearningMatryoshka Representation Learning (MRL) encodes information at different granularities and allows a flexible representation that can adapt…
Jan 31
1
Jan 31
1
Couples an aggressive residual compression mechanism with a denoised supervision strategy to simultaneously improve the quality and space footprint of late interaction.
Ritvik Rastogi
Papers Explained 89: ColBERTv2Late interaction models produce multi-vector representations at the granularity of each token and decompose relevance modeling into…
Jan 12
1
Jan 12
1
A vision system that learns image representations from raw text-image pairs through pre-training, enabling zero-shot transfer to various downstream tasks.
Ritvik Rastogi
Papers Explained 100: CLIPCLIP is pre-trained on a large dataset of 400M (image, text) pairs from the internet, instead of relying on fixed sets of predetermined…
Feb 14
1
Feb 14
1
A Semi-supervised learning framework which uses unsupervised pre training followed by supervised fine-tuning and distillation with unlabeled examples.
Ritvik Rastogi
Papers Explained 201: SimCLRv2The study proposes a semi-supervised learning framework that combines Unsupervised or self-supervised pre training (SimCLRv2) to learn…
Sep 3
Sep 3
Introduces a late interaction architecture that adapts deep LMs (in particular, BERT) for efficient retrieval.
Ritvik Rastogi
Papers Explained 88: ColBERTColBERT is a novel ranking model that adapts deep LMs (in particular, BERT) for efficient retrieval. It introduces a late interaction…
Jan 10
Jan 10
Shows that retrieval can be practically implemented using dense representations alone, where embeddings are learned from a small number of questions and passages by a simple dual encoder framework.
Ritvik Rastogi
Papers Explained 86: Dense Passage RetrieverThis paper shows that retrieval can be practically implemented using dense representations alone, where embeddings are learned from a small…
Jan 5
Jan 5
A simplified framework for contrastive learning that optimizes data augmentation composition, introduces learnable nonlinear transformations, and leverages larger batch sizes and more training steps.
Ritvik Rastogi
Papers Explained 200: SimCLRSimCLR is a simple framework for contrastive learning of visual representations. The key components of the framework and findings are:
Sep 2
Sep 2