|
Sagalpreet Singh
I am a Pre-Doctoral Researcher at Google DeepMind in the Agents team, advised by Dr. Rishi Saket and Dr. Aravindan Raghuveer. Previously, I worked as a Member of Technical Staff at Oracle (AI Services) focusing on speech generation. Before that, I completed my B.Tech. in Computer Science and Engineering from IIT Ropar (2019-2023) with a concentration in Artificial Intelligence.
My research goal is to develop fundamental algorithms that enable agents to generalize and adapt continually. I am interested in building systems that don't just memorize trajectories but achieve robust generalization to unseen physics or scenarios. My work anchors in Reinforcement Learning while actively leveraging insights from optimization and learning theory to overcome methodological bottlenecks in dynamic and data-scarce environments.
Email /
Scholar /
Github /
LinkedIn /
CV
|
News
|
- 2025Presented work at GDM's NeurIPS Booth on goal coverage in RL at San Diego.
- 2025Paper accepted to UAI: "Learning from Label Proportions and Covariate-shifted Instances" at Rio.
- 2024Joined Google DeepMind as a Pre-Doctoral Researcher.
- 2024Oracle's flagship TTS model supporting natural voices and voice cloning is live.
- 2023Joined Oracle Cloud AI Services - Speech team.
- 2023Oral presentation at AAMAS for human-AI team selection work at London.
- 2023Graduated B.Tech in CS (9.31/10) with a Concentration in AI (10/10) from IIT Ropar.
- 2023Offered Amazon Applied Scientist Internship in Alexa team. Declined due to academic commitments.
- 2023Awarded the AAMAS Student Scholarship worth 650 GBP.
- 2023Awarded Google Research Travel Grant worth 2k USD.
- 2023Awarded Microsoft Research Travel Grant worth 120k INR. Declined in favor of Google Research travel grant.
- 2022Among top 20 in India in Oppo Inspiration Cup. Invited on a fully paid trip to Hyderabad for on-site finale.
- 2022Elected as the Academic Council Representative for CS 2019 batch.
- 2022Finished 2 month SWE internship in Oracle Integration Cloud with a full time return offer.
- 2022Received Best B.Tech. Project Award during Technology Day at IIT Ropar for SAMPAN apps.
- 2022Achieved global rank 238 in Google Kickstart Round C.
- 2022Offered Summer Analyst internship at Goldman Sachs. Declined in favor of Oracle.
- 2022Offered Quant Developer internship at Barclays UK. Declined in favor of Oracle.
- 2021Our team achieved an All India Rank 18 and Global Rank 198 in Google Hash Code.
- 2020Elected as the Representative of Coding Club at IIT Ropar.
- 2019Joined IIT Ropar for B.Tech. in Computer Science and Engineering.
- 2019Secured 99.8%ile in JEE Mains and All India Rank 1108 in JEE Advanced.
- 2019National top 0.1% in Senior Secondary exam for Physics, CBSE board.
- 2018National top 1% in National Standard Examination in Physics (NSEP).
- 2018National top 1% in National Standard Examination in Chemistry (NSEC).
- 2018Cleared PRMO and selected for RMO (Regional Mathematics Olympiad).
- 2017Cleared PRMO and selected for RMO (Regional Mathematics Olympiad).
- 2017Awarded National Talent Search Examination (NTSE) scholarship to support educational finances till PhD.
- 2017State top 1% in National Standard Examination in Junior Science (NSEJS).
- 2017Achieved top ranks, scholarships and cash awards in several competitions - link.
|
|
|
Dense and Diverse Goal Coverage in Multi Goal Reinforcement Learning
Sagalpreet Singh,
Rishi Saket,
Aravindan Raghuveer
Google DeepMind Booth @ NeurIPS, 2025
Under Submission
arXiv / demo
Addressed entropy/mode collapse in RL policies by formulating a regularized optimization objective. We utilized a Frank-Wolfe based algorithm to maximize the density and dispersion of marginal state distributions, providing provable performance guarantees and preventing the agent from learning only a subset of goals.
|
|
|
Learning from Label Proportions and Covariate-shifted Instances
Sagalpreet Singh,
Navodita Sharma,
Shreyas Havaldar,
Rishi Saket,
Aravindan Raghuveer
Uncertainty in Artificial Intelligence (UAI), 2025
code /
paper
Developed a novel loss function for domain adaptation where the target domain only offers weakly supervised data (aggregate labels). We proved theoretical guarantees bounding the generalization error, enabling efficient learning of domain-invariant representations.
|
|
|
On Subset Selection of Multiple Humans to Improve Human-AI Team Accuracy
Sagalpreet Singh,
Shweta Jain,
Shashi Shekhar Jha
Autonomous Agents and Multiagent Systems (AAMAS), 2023   (Oral Presentation)
code /
paper
Designed a greedy strategy for optimal subset selection to combine human experts with AI models. We formulated the lower bound maximization as a submodular problem, achieving superior accuracy by identifying diminishing returns in human inputs.
|
Unpublished Research Contributions
|
|
|
Retrieval for Tool Aware Planning
Google DeepMind (Ongoing)
Developing a tool retrieval agent capable of efficiently fetching tools from a large library to solve tasks. This project explores giving agents external memory that can be updated without drastic model weight updates, inspired by Hierarchical RL.
|
|
|
Verifiable Problem Discovery & Unsaturating Benchmarks
Google DeepMind
Investigated autonomous generation of verifiable problems in Math and CS domains using rejection sampling and RL with an aim of recursive self improvement. The generated curricula are used to improve Gemini model performance post-training.
|
Software & Applied AI Projects
|
|
|
Natural Text-to-Speech System
Oracle (OCI AI Services - Speech)
blog
Built Oracle's flagship natural TTS system, enabling real-time generation on CPU and voice cloning with 5-second reference audio. Implemented SSML tag support for fine-grained control over prosody and pitch.
|
|
|
SAMPAN Android App
IIT Ropar
play Store /
news coverage /
video /
blog
Co-developed an app for Anganwadi workers to record and analyze child malnourishment data. Recognized with the Best B.Tech Project Award at IIT Ropar.
|
🏆 Awards
|
- Institute Merit Scholarship, IIT Ropar (Academic Performance)
- Best Poster Presentation Award, Research Scholar's Day 2023, IIT Ropar
- Google (USD 2k) & Microsoft Research (INR 120k) Travel Grants (AAMAS 2023)
- Best B.Tech. Project Award, National Technology Day 2022, IIT Ropar
- NTSE Scholarship, Govt. of India (Top 1000 nationwide)
|
🛠️ Skills
|
- Languages: Python, C/C++, Bash, RISC-V Assembly, SQL, LaTeX
- Frameworks: Jax, PyTorch, Tensorflow, FastAPI, JupyterLab, Triton
- DevOps: Git, Docker, Kubernetes
|
🤝 Service
|
- Reviewer: AISTATS 2026, ACL ARR (May 2025)
- Volunteer: AAMAS 2023
- Problem Setter: Competitive Programming, TechFest, IIT Ropar
|
|