Bio
I am a Pre-Doctoral Researcher at Google DeepMind in the Autonomous Agents team, advised by Dr. Rishi Saket and Dr. Aravindan Raghuveer. Previously, I worked as a Member of Technical Staff at Oracle (AI Services) focusing on speech generation. Before that, I completed my B.Tech. in Computer Science and Engineering from IIT Ropar (2019-2023) with a concentration in Artificial Intelligence, where I was fortunate to be advised by Prof. Shweta Jain, Prof. Shashi Shekhar Jha and Prof. Abhinav Dhall.

My research interests lie in developing principled algorithms that enable agents to generalize and adapt continually. I am interested in building systems that don't just memorize trajectories but achieve robust generalization to unseen physics or scenarios. My work anchors in Reinforcement Learning while actively leveraging insights from optimization and learning theory to overcome methodological bottlenecks in dynamic and data-scarce environments.

Keywords: Reinforcement Learning, Deep Learning and Machine Learning

For more details about my background, refer to my Resume / CV. I am always happy to discuss research, potential collaborations, or just chat about ML. Feel free to reach out via Email or LinkedIn!

Last Updated: April 2026
What's New
  • December, 2025
    Presented work at Google DeepMind's NeurIPS Booth on goal coverage in RL (San Diego).
  • October, 2025
    Our paper "Dense and Diverse Goal Coverage in Reinforcement Learning" is now on arXiv! [Paper]
  • July, 2025
    Our work on "Learning from Label Proportions and Covariate-shifted Instances" has been accepted to UAI (Rio)! [Paper]
  • December, 2024
    Our paper "Learning from Label Proportions and Covariate-shifted Instances" is now on arXiv! [Paper]
  • August, 2024
    Joined Google DeepMind as a Pre-Doctoral Researcher.
  • July, 2024
    Oracle's flagship TTS model supporting natural voices and voice cloning is live.
  • August, 2023
    Joined Oracle Cloud AI Services - Speech team.
  • May, 2023
    Gave oral presentation at AAMAS about our work on combining human-AI predictions (London).
  • May, 2023
    Graduated from IIT Ropar with a CGPA of 9.31/10 (AI Concentration: 10/10)
  • May, 2023
    Awarded the AAMAS Student Scholarship (£650), Google Research Travel Grant ($2k) and Microsoft Research Travel Grant (₹120k).
  • February, 2023
    Offered Amazon Applied Scientist Internship based on performance in Amazon ML Summer School and interviews (Alexa team).
  • December, 2022
    Top 20 in India in Oppo Inspiration Cup; invited to finale in Hyderabad.
  • August, 2022
    Finished SWE internship at Oracle with a full-time return offer.
  • May, 2022
    Received Best B.Tech. Project Award during Technology Day at IIT Ropar.
  • May, 2022
    Achieved global rank 238 in Google Kickstart Round C.
  • March, 2021
    Achieved All India Rank 18 and Global Rank 198 in Google Hash Code.
Projects

Verifiable Problem Generation for Gemini Thinking
Engineered a fully-verifiable curriculum generation engine for improving thinking in Gemini models. Contributed 38% of the data in the Gemini thinking data revision, leading to >10% improvement on downstream tasks like Math and NL Reasoning.

Transformer-based Text-to-Speech (TTS)
Built Oracle's in-house TTS model supporting few-shot voice cloning. Handled high-fidelity data procurement, distributed training on Slurm clusters, deployment via Triton inference server, and developer experience.
Blog: OCI Speech Update

SAMPAN: Tracking District Level Malnutrition
Developed an Android app and Web dashboard used by 1200+ workers to log 50,000+ data points monthly.
Media Coverage: The Tribune Play Store Video

Publications

C=Conference, P=Preprint, *=Equal Contribution

[P.1] Dense and Diverse Goal Coverage in Reinforcement Learning
Sagalpreet Singh, Rishi Saket, Aravindan Raghuveer
Demoed at NeurIPS 2025 (DeepMind Kiosk) Paper Demo


[C.2] Learning from Label Proportions and Covariate-shifted Instances
Sagalpreet Singh, Navodita Sharma, Shreyas Havaldar, Rishi Saket, Aravindan Raghuveer
The 41st Conference on Uncertainty in Artificial Intelligence
UAI 2025 Paper Poster


[C.1] On Subset Selection of Multiple Humans to Improve Human-AI Team Accuracy
Sagalpreet Singh, Shweta Jain, Shashi Shekhar Jha
The 22nd International Conference on Autonomous Agents and Multiagent Systems
AAMAS 2023 (Oral) Paper Poster

IIT Ropar
2019 - 2023
Oracle AI Services
2023 - 2024
Google DeepMind
2024 - Present