Publication [Google Scholar]

  • AdaFrame: Adaptive Frame Selection for Fast Video Recognition. [pdf]
  • IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, USA, June, 2019
  • Zuxuan Wu, Caiming Xiong, Chih-Yao Ma, Richard Socher, Larry S Davis
  • The Regretful Agent: Heuristic-Aided Navigation through Progress Estimation. [pdf]
  • IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, USA, June, 2019.
  • Chih-Yao Ma, Zuxuan Wu, Ghassan AlRegib, Caiming Xiong, Zsolt Kira
  • Self-Monitoring Navigation Agent via Auxiliary Progress Estimation. [pdf]
  • International Conference on Learning Representations (ICLR), New Orleans, USA, May, 2019.
  • Chih-Yao Ma, Jiasen Lu, Zuxuan Wu, Ghassan AlRegib, Zsolt Kira, Richard Socher, Caiming Xiong
  • DCAN: Dual Channel-wise Alignment Networks for Unsupervised Scene Adaptation. [pdf][code]
  • European Conference on Computer Vision (ECCV), Munich, Germany, September, 2018.
  • Zuxuan Wu, Xintong Han, Yen-Liang Lin, Mustafa Gkhan Uzunbas, Tom Goldstein, Ser Nam Lim, Larry S. Davis
  • BlockDrop: Dynamic Inference Paths in Residual Networks. [pdf][code]
  • IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, USA, June, 2018. (Spotlight)
  • Zuxuan Wu*, Tushar Nagarajan*, Abhishek Kumar, Steven Rennie, Larry S. Davis, Kristen Grauman, Rogerio Feris
    (* denotes equal contribution)
  • VITON: An Image-based Virtual Try-on Network. [pdf][code]
  • IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, USA, June, 2018. (Spotlight)
  • Xintong Han, Zuxuan Wu, Zhe Wu, Ruichi Yu, Larry S. Davis
  • Exploiting Feature and Class Relationships in Video Categorization with Regularized Deep Neural Networks. [pdf]
  • IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), Vol. 40, Issue 2, pp. 352-364, 2018.
  • Yu-Gang Jiang, Zuxuan Wu, Jun Wang, Xiangyang Xue, Shih-Fu Chang
  • Fudan-Columbia Video Dataset (FCVID), one of the largest public Web video datasets with manual annotations.
  • Deep Learning for Video Classification and Video Captioning. [pdf]
  • In Frontiers of Multimedia Research, Shih-Fu Chang (Ed.), ACM Morgan & Claypool, New York, NY, USA, pp. 3-29, 2018
  • Zuxuan Wu, Ting Yao, Yanwei Fu, Yu-Gang Jiang
  • Surveying 100+ recent literatures on video classification and captioning with deep learning.
  • Weakly-Supervised Spatial Context Networks. [pdf]
  • arXiv preprint arXiv:1704.02998
  • Zuxuan Wu, Larry S. Davis, Leonid Sigal
  • Automatic Spatially-aware Fashion Concept Discovery. [pdf]
  • International Conference on Computer Vision (ICCV), Venice, Italy, Oct., 2017.
  • Xintong Han, Zuxuan Wu, Phoenix Huang, Xiao Zhang, Menglong Zhu, Yuan Li, Yang Zhao, Larry S. Davis
  • Learning Fashion Compatibility with Bidirectional LSTMs. [pdf]
  • ACM Multimedia (ACM MM), Mountain View, USA, Oct., 2017.
  • Xintong Han, Zuxuan Wu, Yu-Gang Jiang, Larry S. Davis
  • Harnessing Object and Scene Semantics for Large-Scale Video Understanding. [pdf]
  • IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, USA, June, 2016. (Spotlight)
  • Zuxuan Wu, Yanwei Fu, Yu-Gang Jiang, Leonid Sigal
  • Featured in Tech2, ACM Technews
  • Multi-Stream Multi-Class Fusion of Deep Networks for Video Classification. [pdf]
  • ACM Multimedia (ACM MM), Amsterdam, the Netherlands, Oct., 2016. (Oral Paper)
  • Zuxuan Wu, Yu-Gang Jiang, Xi Wang, Hao Ye, Xiangyang Xue
  • Modeling Spatial-Temporal Clues in a Hybrid Deep Learning Framework for Video Classification. [pdf]
  • ACM Multimedia (ACM MM), Brisbane, Australia, Oct., 2015. (Oral Paper)
  • Zuxuan Wu, Xi Wang, Yu-Gang Jiang, Hao Ye, Xiangyang Xue
  • Obtain 91.3% accuracy on the UCF-101 dataset.
  • Evaluating Two-Stream CNN for Video Classification. [pdf][motion CNN model]
  • ACM International Conference on Multimedia Retrieval (ICMR), Shanghai, China, June, 2015
  • Hao Ye, Zuxuan Wu, Rui-Wei Zhao, Xi Wang, Yu-Gang Jiang, Xiangyang Xue
  • Exploring Inter-feature and Inter-class Relationships with Deep Neural Networks for Video Classification. [pdf]
  • ACM Multimedia (ACM MM), Orlando, USA, Nov., 2014. (Oral Paper)
  • Zuxuan Wu, Yu-Gang Jiang, Jun Wang, Jian Pu, Xiangyang Xue


  • Aggregating Frame-level Features for Large-Scale Video Classification.
  • Google Cloud & YouTube-8M Video Understanding Challenge, July 2017. [pdf]
  • Shaoxiang Chen, Xi Wang, Yongyi Tang, Xinpeng Chen, Zuxuan Wu, Yu-Gang Jiang
  • Fourth place among 600 teams
  • Fudan-Huawei at MediaEval 2015: Detecting Violent Scenes and Affective Impact in Movies with Deep Learning.
  • MediaEval 2015 Workshop, Wurzen, Germany, Sept. 2015. [pdf]
  • Qi Dai, Rui-Wei Zhao, Zuxuan Wu, Xi Wang, Zichen Gu, Wenhai Wu, Yu-Gang Jiang
  • Top Performance
  • Challenge Huawei Challenge: Fusing Multimodal Features with Deep Neural Networks for Mobile Video Annotation.
  • IEEE Conference on Multimedia and Expo (ICME), Chengdu, China, July, 2014 (Grand Challenge Session)
  • Jian Tu, Zuxuan Wu, Qi Dai, Yu-Gang Jiang, Xiangyang Xue
  • Top Accuracy Award
  • Fudan-NJUST at MediaEval 2014: Violent Scenes Detecting Using Deep Neural Networks.
  • MediaEval 2014 Workshop, Barcelona, Spain, Oct, 2014. [pdf]
  • Qi Dai, Zuxuan Wu, Yu-Gang Jiang, Xiangyang Xue, Jinhui Tang
  • Top Performance


2017.12Snap Inc. Fellowship
2016.08Dean's Fellowship
2015.10National Graduate Scholarship of China
2015.08Google Excellent Student Award
2015.07ACM Student Travel Grant
2014.10National Graduate Scholarship of China
2014.08ACM Student Travel Grant
2013.06Shanghai Outstanding Graduates
2012.12Shanghai Scholarship