//
sign in
Profile
by @danabra.mov
Profile
by @dansshadow.bsky.social
Profile
by @jimpick.com
AviHandle
by @danabra.mov
AviHandle
by @dansshadow.bsky.social
AviHandle
by @katherine.computer
EventsList
by @katherine.computer
ProfileHeader
by @dansshadow.bsky.social
ProfileHeader
by @danabra.mov
ProfileMedia
by @danabra.mov
ProfilePlays
by @danabra.mov
ProfilePosts
by @danabra.mov
ProfilePosts
by @dansshadow.bsky.social
ProfileReplies
by @danabra.mov
Record
by @atsui.org
Skircle
by @danabra.mov
StreamPlacePlaylist
by @katherine.computer
+ new component
Profile
Loading...
Senior MLE at Meta. Trying to keep up with the Information Retrieval domain! Blog: https://blog.reachsumit.com/ Newsletter: https://recsys.substack.com/
Sumit









Loading...
Do Generative Recommenders Deepen the Information Cocoon? A Closed-Loop Simulation with LLM-powered User Simulators Introduces a framework with LLM-powered user agents to study information cocoons in generative recommenders. 📝 arxiv.org/abs/2606.17707 👨🏽‍💻 github.com/Dregen-Yor/R...
SproutRAG: Attention-Guided Tree Search with Progressive Embeddings for Long-Document RAG Introduces a framework organizing sentence chunks into a binary tree for multi-granularity retrieval without external LLMs. 📝 arxiv.org/abs/2606.18381 👨🏽‍💻 github.com/AmirAbaskohi...
MCompassRAG: Topic Metadata as a Semantic Compass for Paragraph-Level Retrieval Introduces a framework enriching coarse chunks with topic metadata to improve paragraph-level retrieval efficiency and precision. 📝 arxiv.org/abs/2606.18508 👨🏽‍💻 github.com/AmirAbaskohi...
Lost in a Single Vector: Improving Long-Document Retrieval with Chunk Evidence Aggregation Presents a training-free method to improve long-document dense retrieval by aggregating chunk embeddings into a single vector. 📝 arxiv.org/abs/2606.18781 👨🏽‍💻 github.com/PunchlineAAA...
RankGraph-2: Lifecycle Co-Design for Billion-Node Graph Learning in Recommendation[cite: 2] Meta introduces RankGraph-2, co-designing graph construction, representation learning, and serving for billion-node similarity-based retrieval. 📝 arxiv.org/abs/2606.18379
On the Memorization Behavior of LLMs in Generative Recommendation: Observations, Implications, and Training Strategies Snap shows that LLM recommenders rely heavily on one-hop memorization and teaches them richer relations. 📝 arxiv.org/abs/2606.17276 👨🏽‍💻 github.com/snap-researc...
Querit-Reranker: Training Compact Multilingual Rerankers via Efficient Label-Free Distribution Adaptation Presents a multilingual cross-encoder reranker family trained via a data-centric pipeline for label-efficient adaptation. 📝 arxiv.org/abs/2606.19037 🤗 huggingface.co/Querit/Querit
Rescaling MLM-Head for Neural Sparse Retrieval Finds that pretrained encoders with large MLM-head scales face degradation in sparse retrieval, and introduces a zero-cost rescaling correction to stabilize training. 📝 arxiv.org/abs/2606.18811
RSRank: Learning Relevance from Representational Shifts Adobe introduces a lightweight reranking framework that learns relevance from the representational shift a document induces in a query's internal state, filtering irrelevant content at a natural zero threshold. 📝 arxiv.org/abs/2606.17468
Beyond Parallel Sampling: Diverse Query Initialization for Agentic Search Diversifies the first-turn query across parallel search rollouts to avoid redundant retrieval, improving multi-hop QA at matched compute. 📝 arxiv.org/abs/2606.17209 👨🏽‍💻 github.com/cxcscmu/dive...
1d
6h
6h
6h
6h
1d
6h
6h
1d
1d
Recommender systems alleviate information overload, yet repeated feedback between recommendations and user interactions can reinforce existing preferences and narrow users' exposure, forming informati...
arxiv.org
Do Generative Recommenders Deepen the Information Cocoon? A Closed-Loop Simulation with LLM-powered User Simulators
Retrieval-augmented generation (RAG) systems must balance retrieval granularity with contextual coherence, a challenge that existing methods address through LLM-guided chunking, single-level context e...
arxiv.org
SproutRAG: Attention-Guided Tree Search with Progressive Embeddings for Long-Document RAG
Retrieval-augmented generation (RAG) systems depend critically on how documents are chunked and searched. Fine-grained chunks can improve retrieval precision but expand the search space, increasing la...
arxiv.org
MCompassRAG: Topic Metadata as a Semantic Compass for Paragraph-Level Retrieval
Dense retrieval ranks one query vector against one document vector. On long documents, this interface can fail when a short but decisive span is weakened during document encoding before ranking. We st...
Lost in a Single Vector: Improving Long-Document Retrieval with Chunk Evidence Aggregation
arxiv.org
Graph-based retrieval at billion-node scale requires jointly solving three tightly coupled problems -- graph construction, representation learning, and real-time serving -- yet existing work addresses...
arxiv.org
RankGraph-2: Lifecycle Co-Design for Billion-Node Graph Learning in Recommendation
Generative recommendation (GR) has emerged as a promising direction for recommender systems. Recently, large language models (LLMs) have been increasingly adopted for GR, as their rich pretrained know...
arxiv.org
On the Memorization Behavior of LLMs in Generative Recommendation: Observations, Implications, and Training Strategies
Deployable multilingual rerankers must generalize across languages, domains, and target ranking tasks while remaining efficient enough for second-stage reranking. However, adapting them to new target ...
arxiv.org
Querit-Reranker: Training Compact Multilingual Rerankers via Efficient Label-Free Distribution Adaptation
Learned sparse retrieval (LSR) models such as SPLADE have traditionally used BERT-style masked language models as backbone encoders. A natural expectation is that replacing BERT with stronger pretrain...
arxiv.org
Rescaling MLM-Head for Neural Sparse Retrieval
As enterprises deploy RAG-based systems to provide grounded responses to user queries, reranking has become a critical component for the final filtering step that separates relevant from distracting o...
arxiv.org
RSRank: Learning Relevance from Representational Shifts
Test-time scaling for agentic search typically increases depth (i.e., more turns and tokens per trajectory) or breadth (i.e., more parallel rollouts). Here we focus on breadth scaling, showing that st...
arxiv.org
Beyond Parallel Sampling: Diverse Query Initialization for Agentic Search
Sumit
Sumit
Sumit
Sumit
Sumit
Sumit
Sumit
Sumit
Sumit
Sumit