I am a PhD student in Computer Science at Mila and McGill where I am supervised by Prof. Dzmitry Bahdanau and Prof. Siva Reddy. Previously, I spent 2.5 amazing years as a predoctoral Research Fellow at Microsoft Research India, where I worked with Dr. Navin Goyal. I also interned with the AllenNLP team at the Allen Institute for Artificial Intelligence (AI2) where I worked with Pradeep Dasigi on evaluating code generation in LLMs.

I do research in Machine Learning on various interesting aspects surrounding Large Language Models (LLMs). My work focuses on building a principled and predictive understanding of how LLMs behave across varying data and training regimes. My goal is to uncover general laws that help explain when and why capabilities emerge, how they transfer or fail out of distribution, and which signals reliably anticipate downstream performance. I aim to leverage these insights for forecasting model behaviour and for informing the design of more reliable LLMs.

News

Jan 12, 2026

Our Thoughtology paper investigating the reasoning chains-of-thoughts of Large Reasoning Models like DeepSeek-R1 has been published at TMLR!

May 01, 2025

Our paper proposing SafeArena, a benchmark for evaluating the safety of autonomous web agents, is accepted at ICML 2025!