Vansh Kapoor

Graduate Student, Machine Learning @ CMU School of Computer Science

Hey! I’m Vansh Kapoor, a second-year Master’s student in Machine Learning at Carnegie Mellon University. I completed my undergrad with Honors in Electrical Engineering at the Indian Institute of Technology Bombay (IIT Bombay). I have also interned in RL & LLM reasoning @ Amazon AI as an Applied Science (AI) Intern, and in hardware systems @ Google as a Hardware Intern.

I am passionate about developing principled and scalable ML algorithms for domains such as reinforcement learning and generative AI. Specifically, I am interested in research that integrates theoretical insights with practical applications in RL and its extensions to LLM reasoning/RLHF, Generative Modeling, Multi-Agent Learning & Game Theory.

I currently work with Prof. Aviral Kumar. During my undergrad at IIT Bombay, I worked with Prof. Jayakrishnan Nair, Prof. Vivek Borkar and Prof. Nikhil Karamchandani. My research focused on a special case of discounted cost Partially Observable Markov Decision Processes (POMDPs), which earned me the Undergraduate Research Award from IIT Bombay and was submitted as a first-author paper to AISTATS’26.

I love solving brain teasers and puzzles and I somehow try fitting them into my daily routine. I enjoy in both playing table tennis and watching cricket (more of a cricket follower rather than a player).

If you have similar interests, feel free to hit me up with a mail here, I would love to chat.