Publications

Research Papers

Deep experience in systems intelligence — autonomous agents, causal reasoning, reinforcement learning, large-scale personalization, and marketplace optimization.

NeurIPSICLRMICCAIKDDWWWSIGIRWSDMRecSysCIKM

Systems Intelligence & Autonomous Agents

Lifelong learning, multi-agent coordination, agent safety, and system ownership — the research foundations for compounding enterprise AI.

Pre-print

System Ownership as a Lifelong Learning Problem: A Formulation for Long-Horizon Software Maintenance(stealth)

P. Trochim, S. Pan, V. Chandela

Pre-print

Informational Individuality and Behavioural Consistency: A Theory Framework of Lifelong Learning for LLM Agents(stealth)

S. Pan, P. Trochim, V. Chandela, R. Mehrotra

OpenReview 2026

How Task Structure Limits Multi-Agent Success: An Information-Theoretic Analysis↗

S. Pan, M. Luo

Pre-print

Programmatic Process Rewards Improve the Reliability of Agent-Safety Reinforcement Learning(stealth)

S. Pan, R. Mehrotra

Pre-print

Mini-uber: Hold-Probe Evaluation for Multi-Regime Agent Tasks(stealth)

S. Pan, R. Mehrotra

Pre-print

Spectrum-Anchored Updates Preserve Plasticity in Continual Learning(stealth)

S. Pan, X. Guan

Pre-print

AutoGT: Distilling High Quality Evaluation Ground Truth from Heterogeneous Sources on Knowledge-Intensive Tasks(stealth)

S. Pan, S. Saket, R. Mehrotra

Pre-print

RCA Playbooks: Probabilistic Reconstruction of Diagnostic Workflows from SQL Query Graphs(stealth)

S. Saket, S. Dhar, R. Mehrotra

Pre-print

Prescriptive Cheatsheets: Structured Artifacts via Evidence-Aware Submodular Synthesis(stealth)

S. Saket, S. Dhar, R. Mehrotra

Pre-print

Semantic Factorization of Analytical SQL Workloads(stealth)

S. Saket, S. Dhar, R. Mehrotra

↑ Back to top

AI-Powered Engineering & Code Intelligence

Intelligent systems that understand codebases, retrieve context, and assist engineers with production-grade code recommendations.

WSDM 2025

Improving FIM Code Completions via Context & Curriculum Based Learning↗

H. Sagtani, R. Mehrotra, B. Liu

RecSys 2024

AI-assisted Coding with Cody: Lessons from Context Retrieval and Evaluation for Code Recommendations↗

J. Hartman, H. Sagtani, J. Tibshirani, R. Mehrotra

↑ Back to top

Intelligent Decision Systems & Reinforcement Learning

Causal reasoning, model-based planning, and bandit optimization — algorithmic foundations for systems that learn from decisions and compound over time.

NeurIPS 2022

Disentangling Causal Effects from Sets of Interventions in the Presence of Unobserved Confounders↗

O. Jeunen, C. Gilligan-Lee, R. Mehrotra, M. Lalmas

ICLR 2022

Evaluating Model-Based Planning and Planner Amortization for Continuous Control↗

A. Byravan, L. Hasenclever, P. Trochim, M. Mirza, A. Ialongo, Y. Tassa, J. Springenberg, A. Abdolmaleki, N. Heess, J. Merel, M. Riedmiller

MICCAI 2025

Teaching Pathology Foundation Models to Accurately Predict Gene Expression with Parameter Efficient Knowledge Transfer↗

S. Pan, J. Chen, M. Secrier

KDD 2020

Bandit based Optimization of Multiple Objectives on a Music Streaming Platform↗

R. Mehrotra, N. Xue, M. Lalmas

KDD 2020

Counterfactual Evaluation of Slate Recommendations with Sequential Reward Interactions↗

J. McInerney, B. Brost, P. Chandar, R. Mehrotra, B. Carterette

WSDM 2024

Ad-load Balancing via Off-policy Learning in a Content Marketplace↗

H. Sagtani, M. Jhawar, R. Mehrotra, O. Jeunen

WWW 2019

Deriving User- and Content-specific Rewards for Contextual Bandits↗

P. Dragone, R. Mehrotra, M. Lalmas

RecSys 2018

Explore, Exploit, and Explain: Personalizing Explainable Recommendations with Bandits↗

J. McInerney, B. Lacker, S. Hansen, K. Higley, H. Bouchard, A. Gruson, R. Mehrotra

arXiv 2022 (DeepMind)

Semi-analytical Industrial Cooling System Model for Reinforcement Learning↗

Y. Chervonyi, P. Dutta, P. Trochim, O. Voicu, C. Paduraru, C. Qian, E. Karagozler, J. Davis, R. Chippendale, G. Bajaj, S. Witherspoon, J. Luo

↑ Back to top

Large-Scale Personalization & Recommendation

Architecting ML systems at 100M+ user scale — embeddings, ranking, sequencing, and real-time serving across Spotify, ShareChat, and Seekho.

WWW 2025

Dimension Mask Layer: Optimizing Embedding Efficiency for Scalable ID-based Models↗

S. Saket, I. Ihara, V. Sharma, D. Kalim

SIGIR 2024

Monitoring the Evolution of Behavioural Embeddings in Social Media Recommendation↗

S. Saket, O. Jeunen, D. Kalim

FIRE 2023

On Gradient Boosted Decision Trees and Neural Rankers: Short-Video Recommendations at ShareChat↗

O. Jeunen, H. Sagtani, H. Doi, R. Karimov, N. Pokharna, D. Kalim, A. Ustimenko, C. Green, R. Mehrotra, W. Shi

WWW 2023

MEMER — Multimodal Encoder for Multi-signal Early-stage Recommendations↗

M. Agarwal, S. Saket, R. Mehrotra

LERI@RecSys 2023

Formulating Video Watch Success Signals for Recommendations on Short Video Platforms↗

S. Saket, S. Velugoti, R. Mehrotra

CIKM 2023

Exploiting Sequential Music Preferences via Optimisation-Based Sequencing↗

D. Moor, Y. Yuan, R. Mehrotra, Z. Dai, M. Lalmas

RecSys 2020

Contextual and Sequential User Embeddings for Large-Scale Music Recommendation↗

C. Hansen, C. Hansen, L. Maystre, R. Mehrotra, B. Brost, F. Tomasi, M. Lalmas

CIKM 2021

Algorithmic Balancing of Familiarity, Similarity, & Discovery in Music Recommendations↗

R. Mehrotra

WSDM 2021

Shifting Consumption towards Diverse Content on Music Streaming Platforms↗

C. Hansen, R. Mehrotra, C. Hansen, B. Brost, L. Maystre, M. Lalmas

arXiv 2024

Crafting Tomorrow: The Influence of Design Choices on Fresh Content in Social Media Recommendation↗

S. Saket, M. Agarwal, R. Mehrotra

↑ Back to top

Multi-Stakeholder Optimization & Marketplace Intelligence

Balancing competing objectives across users, creators, and platforms — fairness, multi-sided value, and system-level trade-offs.

WWW 2022

Mostra: A Flexible Balancing Framework to Trade-off User, Artist and Platform Objectives for Music Sequencing↗

E. Bugliarello, R. Mehrotra, J. Kirk, M. Lalmas

CIKM 2018

Towards a Fair Marketplace: Counterfactual Evaluation of the Trade-off between Relevance, Fairness & Satisfaction↗

R. Mehrotra, J. McInerney, H. Bouchard, M. Lalmas, F. Diaz

WWW 2019

Jointly Leveraging Intent and Interaction Signals to Predict User Satisfaction with Slate Recommendations↗

R. Mehrotra, M. Lalmas, D. Kenney, T. Lim-Meng, G. Hashemian

AI Magazine 2022

The Multisided Complexity of Fairness in Recommender Systems↗

N. Sonboli, R. Burke, M. Ekstrand, R. Mehrotra

SIGIR 2023

Quantifying and Leveraging User Fatigue for Interventions in Recommender Systems↗

H. Sagtani, M. Jhawar, A. Gupta, R. Mehrotra

RecSys 2020

Inferring the Causal Impact of New Track Releases on Music Recommendation Platforms through Counterfactual Predictions↗

R. Mehrotra, P. Bhattacharya, M. Lalmas

WWW 2020

Algorithmic Effects on the Diversity of Consumption on Spotify↗

A. Anderson, L. Maystre, I. Anderson, R. Mehrotra, M. Lalmas

KDD 2020

Learning with Limited Labels via Momentum Damped & Differentially Weighted Optimization↗

R. Mehrotra, A. Gupta

↑ Back to top

← Back to Home