Welcome

I am Arka Sadhu, currently a first second third year PhD student at USC. I work with Prof. Ram Nevatia, broadly at the intersection of Computer Vision and Natural Language Processing with a focus on grounding language in vision. Previously, I was an undergraduate student at IIT Bombay.

In my spare time, I maintain Awesome-Grounding which is a curated list of papers in the field of grounding language in vision.

About Me | CV | Github | Email

News and Updates

Sep 2020 I will serve as a reviewer for AAAI21
Aug 2020 Recognized as Outstanding Reviewer at BMVC'20
June 2020 Presented "Video Object Grounding using Semantic Roles in Language Description" at CVPR20. [5min Video].
May 2020 Started internship at AI2 in the Prior group with Ani Kembhavi, Tanmay Gupta, Mark Yatskar.
May 2020 I will serve as a reviewer for BMVC20.
March 2020 Our CVPR20 paper Video Object Grounding using Semantic Roles in Language Description is avaible on [Arxiv] and [Github]
March 2020 Open-sourced a repository with research advices.
Feb 2020 Our paper "Video Object Grounding using Semantic Roles in Language Description" has been accepted to CVRP20. Arxiv/Github coming soon.
Feb 2020 I will serve as a reviewer for ICPR20.
Jan 2020 I will join PRIOR Team (AI2) for an internship in Summer 2020.
Jan 2020 Attended Google LA PhD summit.
Jan 2020 Gave a talk on "Need for Language in Vision" as a part of USC-WiSE seminar. [Slides]
Nov 2019 Attended AI symposium at USC.
Oct 2019 Presented "Zero-Shot Grounding of Objects from Natural Language Queries" at ICCV 2019. [Video], [Slides], [Poster].
Aug 2019 Our paper "Zero-Shot Grounding of Objects from Natural Language Queries" is now available at Arxiv and Github.
Aug 2019 I am serving as a reviewer for WACV'2020.
July 2019 Our paper "Zero-Shot Grounding of Objects from Natural Language Queries" is accepted to ICCV'19 as an Oral paper. Arxiv, github code and a blog post coming soon.
June 2019 Delivered a key-note at Media-Forensics workshop at CVPR'19 summarizing the key-takeaways from the Audio-Visual Fakes Workshop. [Slides]
June 2019 Successfully co-organized the first iteration of the workshop Synthetic-Realities: Audio-Visual Fakes at ICML'19. Live Recording of the workshop is also available.
May 2019 Mentoring Varun Sunar, a Viterbi-India fellow who will be interning at our lab over Summer'19.
April 2019 Presented Rhythmic trajectories in Annenberg Symposium. Joint work with Szilvia Ruszev.
March 2019 Co-organizing a workshop on detecting Audio-Visual Fakes at ICML'19 . The call for papers is out . Submission deadline is 1 May 2019 (anywhere on earth).
Jan 2019 Released subreddit-classification dataset on Kaggle here. Also find corresponding github repository
Dec 2018 Released VCR-Bert: Using BERT for QA on the VCR dataset. Achieves 6% more than the scores reported in the paper.
Oct 2018 Released Awesome Grounding: A collection of papers on visual grounding which came out in the recent years.
Sep 2018 I will be attending the Pytorch Dev Conference at San Fransisco on Oct 2nd.
Aug 2018 I have joined University of Southern California as a PhD student. I am working in Prof. Ram Nevatia's lab in PHE234.
July 2018 I have written a blog on AI courses offered at IITB. Check it out here
June 2018 We placed third in the iFood Challenge. Our implementation is available here
April 2018 I have accepted PhD offer at University of Southern California and I will start in Fall 2018 under Prof. Ram Nevatia