I am a Senior Research Scientist at Meta Reality Labs. I completed my PhD at the University of Southern California (USC), Los Angeles under the supervision of Prof. Ram Nevatia. My primary field of research was at the intersection of Computer Vision and Natural Language Processing. Specifically, I focused on grounding language in vision, with an emphasis on videos.
My research broadly lies at the intersection of vision and language, with a focus on grounding
language in images and videos. Such visual-linguistic associations encompass objects, actions, and
their relations, and are important for richer image and video understanding.
Prior to this, I completed my undergraduate from the Department of Electrical Engineering (EE), Indian Institute of Technology Bombay in 2018. I did my BTech project with Prof. Subhasis Chaudhuri on Graph CNN for disease detection using ECG signals.
Throughout my academic journey, I have gained valuable experience through several internships,
including Meta AI, PRIOR@AI2, Wadhwani AI, USC, and Aalto University.