LLM-guided Hierarchical Retrieval

LATTICE turns retrieval into an LLM-driven navigation problem over a semantic scaffold for computational tractability needed for large corpora.

Arxiv · 5 min · Nilesh Gupta, Wei-Cheng Chang, Ngot Bui, Cho-Jui Hsieh, Inderjit S. Dhillon · 

Scalable In-context Ranking with Generative Models

BlockRank imposes blockwise sparse attention and leverages query-token attention signals for efficient in-context ranking

NeurIPS 2025 · 2 min · Nilesh Gupta, Chong You, Srinadh Bhojanapalli, Sanjiv Kumar, Inderjit S. Dhillon, Felix Yu · 

Exploring Design Choices for Building Language-Specific LLMs

This paper examines how adapting LLMs with vocabulary extension and pretraining improves efficiency and performance across languages

EMNLP 2024 · 1 min · Atula Tejaswi*, Nilesh Gupta*, Eunsol Choi ·