I am a first-year Ph.D. student in the Computer Science Department at University of California, Santa Barbara. I am advised by Professor William Y. Wang and affiliated with the UCSB NLP Group. My research interests lie on the intersection of natural language processing, reinforcement learning and deep learning. Before coming to UCSB, I received my B.Sc. from Tsinghua University.
|[Feb 2018]||One paper gets accpeted by CVPR 2018! Check the preprint version [here].|
|[Feb 2018]||One paper gets accpeted by NAACL-HLT 2018! The preprint version will be released soon.|
|[Jan 2018]||I will serve as a student co-chair for SoCal NLP 2018.|
|[Sep 2017]||I will attend Socalml 2017 and NIPS 2017. See you there!|
|[Sep 2017]||Start my exciting Ph.D. journey at UC Santa Barbara!|
Co-training methods exploit predicted labels on the unlabeled data and select samples based on prediction confidence to augment the training. However, the sample selection in existing co-training methods is based on a predetermined policy, which ignores the sampling bias between the unlabeled and the labeled, and fails to explore data space. We propose a reinforcement learning method to select high-quality unlabeled samples to better co-train on.
Reinforced Co-Training (NAACL-HLT 2018)
Although previous work (e.g. sequence-to-sequence model) has shown promising results in abstracting a coarse description of a short video, it is still challenging to caption a video containing multiple fine-grained actions with a detailed description. We propose a novel hierarchical reinforcement learning framework for video captioning, where a high-level Manager module learns to design sub-goals and a low-level Worker module recognizes the primitive actions to fulfill the sub-goal.
Video Captioning via Hierarchical Reinforcement Learning (CVPR 2018)
Copyright © Jiawei Wu 2015-2018