I am a Ph.D. Candidate at University of Michigan. My advisor is Prof. Jason Corso in Department of EECS. I received my bachelor's degree in Engineering from Nanjing University in 2015.
My research focuses on vision-language embedding, such as captioning and question answering, and video action understanding. My work intensively relies on deep learning and general machine learning/optimization algorithms. My most recent efforts are on automatic video understanding, featured projects include large-scale cooking video dataset YouCook2, gounded video description, and dense video captioning. Previously, I worked on multi-agent reinforcement learning at Nanjing University. I have spent summer research interns at MSR, FAIR, Salesforce.