He conducts research at the intersection of Computer Vision, and Natural Language Understanding. He is especially interested in analyzing, and mining useful human insights from enormous amounts of images with associated text.Also, he is interested in big scale visual analytics by learning models that can perform high-level perceptual tasks for applications in social media, urban computing, and everyday activities. More recently, he is also involved in research on fairness and accountability in machine learning applications.
Now he is a tenure-track Assistant Professor in the Department of Computer Science at the University of Virginia where he leads the Vision, Language, and Learning lab. He had also spent time as visiting professor at Adobe Research and as visiting researcher at the Allen Institute for Artificial Intelligence (AI2).