I am a Pre-Doctoral Researcher at
Google DeepMind in the Autonomous Agents team, advised by
Dr. Rishi Saket
and
Dr.
Aravindan Raghuveer. Previously, I worked as a Member of Technical Staff at
Oracle (AI Services) focusing on speech
generation. Before that, I completed my B.Tech. in Computer Science and Engineering from
IIT Ropar (2019-2023) with a
concentration in Artificial Intelligence, where I was fortunate to be advised by
Prof. Shweta Jain,
Prof. Shashi
Shekhar Jha and
Prof. Abhinav Dhall.
My research interests lie in developing principled algorithms that enable agents to generalize and adapt
continually. I am interested in building systems that don't just memorize trajectories but achieve
robust generalization to unseen physics or scenarios. My work anchors in Reinforcement Learning while
actively leveraging insights from optimization and learning theory to overcome methodological
bottlenecks in dynamic and data-scarce environments.
Keywords:
Reinforcement Learning, Deep Learning and Machine Learning
For more details about my background, refer to my
Resume /
CV. I am always happy to discuss
research, potential collaborations, or just chat about ML. Feel free to reach out via
Email or
LinkedIn!
Last Updated: April 2026
Verifiable Problem Generation for Gemini Thinking
Engineered a fully-verifiable curriculum generation engine for improving
thinking in Gemini models.
Contributed 38% of the data in the Gemini thinking data revision, leading to >10%
improvement on downstream tasks like Math and NL Reasoning.
Transformer-based Text-to-Speech (TTS)
Built Oracle's in-house TTS model supporting few-shot voice cloning.
Handled high-fidelity data procurement, distributed training on Slurm clusters,
deployment via Triton inference server, and developer experience.
Blog:
OCI Speech Update
SAMPAN: Tracking District Level Malnutrition
Developed an Android app and Web dashboard used by 1200+ workers to log 50,000+ data points
monthly.
Media Coverage:
The Tribune
Play Store
Video
C=Conference, P=Preprint, *=Equal Contribution
[P.1] Dense and Diverse Goal Coverage in Reinforcement Learning
Sagalpreet Singh, Rishi Saket, Aravindan Raghuveer
Demoed at NeurIPS 2025 (DeepMind Kiosk)
Paper
Demo
[C.2] Learning from Label Proportions and Covariate-shifted Instances
Sagalpreet Singh, Navodita Sharma, Shreyas Havaldar, Rishi
Saket, Aravindan Raghuveer
The 41st Conference on Uncertainty in Artificial Intelligence
UAI 2025
Paper
Poster
[C.1] On Subset Selection of Multiple Humans to Improve Human-AI Team Accuracy
Sagalpreet Singh, Shweta Jain, Shashi Shekhar Jha
The 22nd International Conference on Autonomous Agents and Multiagent Systems
AAMAS 2023 (Oral)
Paper
Poster
IIT Ropar
2019 - 2023
Oracle AI Services
2023 - 2024
Google DeepMind
2024 - Present