Publication [Google Scholar]

  • Semi-Supervised Vision Transformers. [pdf][code]
  • European Conference on Computer Vision (ECCV), Tel Aviv, October, 2022.
  • Zejia Weng, Xitong Yang, Ang Li, Zuxuan Wu, Yu-Gang Jiang
  • Efficient Video Transformers with Spatial-Temporal Token Selection. [pdf][code]
  • European Conference on Computer Vision (ECCV), Tel Aviv, October, 2022.
  • Junke Wang, Xitong Yang, Hengduo Li, Li Liu, Zuxuan Wu, Yu-Gang Jiang
  • Semi-Supervised Single-View 3D Reconstruction via Prototype Shape Priors. [pdf][code]
  • European Conference on Computer Vision (ECCV), Tel Aviv, October, 2022.
  • Zhen Xing, Hengduo Li, Zuxuan Wu, Yu-Gang Jiang
  • BEVT: BERT Pretraining of Video Transformers. [pdf][code]
  • IEEE Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, USA, June, 2022
  • Rui Wang, Dongdong Chen, Zuxuan Wu, Yinpeng Chen, Xiyang Dai, Mengchen Liu, Yu-Gang Jiang, Luowei Zhou, Lu Yuan
  • Cross-Modal Transferable Adversarial Attacks from Images to Videos. [pdf]
  • IEEE Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, USA, June, 2022
  • Zhipeng Wei, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang
  • AdaViT: Adaptive Vision Transformers for Efficient Image Recognition. [pdf]
  • IEEE Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, USA, June, 2022
  • Lingchen Meng, Hengduo Li, Bor-Chun Chen, Shiyi Lan, Zuxuan Wu, Yu-Gang Jiang, Ser-Nam Lim
  • ObjectFormer for Image Manipulation Detection and Localization. [pdf]
  • IEEE Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, USA, June, 2022
  • Junke Wang, Zuxuan Wu, Jingjing Chen, Xintong Han, Abhinav Shrivastava, Ser-Nam Lim, Yu-Gang Jiang
  • Flag: Adversarial data augmentation for graph neural networks. [pdf]
  • IEEE Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, USA, June, 2022
  • Kezhi Kong, Guohao Li, Mucong Ding, Zuxuan Wu, Chen Zhu, Bernard Ghanem, Gavin Taylor, Tom Goldstein
  • Boosting the Transferability of Video Adversarial Examples via Temporal Translation. [pdf]
  • The AAAI Conference on Artificial Intelligence (AAAI), Virtual, Feb., 2022
  • Zhipeng Wei, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang
  • Attacking Video Recognition Models with Bullet-Screen Comments. [pdf]
  • The AAAI Conference on Artificial Intelligence (AAAI), Virtual, Feb., 2022
  • Kaichen, Zhipeng Wei, Jingjing Chen, Zuxuan Wu, Yu-Gang Jiang
  • Towards transferable adversarial attacks on vision transformers. [pdf]
  • The AAAI Conference on Artificial Intelligence (AAAI), Virtual, Feb., 2022
  • Zhipeng Wei, Jingjing Chen, Micah Goldblum, Zuxuan Wu, Tom Goldstein, Yu-Gang Jiang
  • Rethinking Pseudo Labels for Semi-Supervised Object Detection. [pdf]
  • The AAAI Conference on Artificial Intelligence (AAAI), Virtual, Feb., 2022
  • Hengduo Li, Zuxuan Wu, Abhinav Shrivastava, Larry Davis
  • Encoding Robustness to Image Style via Adversarial Feature Perturbations. [pdf]
  • Advances in Neural Information Processing Systems (NeurIPS), Virtual, Dec., 2021.
  • Manli Shu, Zuxuan Wu, Micah Goldblum, Tom Goldstein
  • Deep Video Inpainting Detection. [pdf]
  • British Machine Vision Conference (BMVC), Virtual, Oct., 2021
  • Peng Zhou, Ning Yu, Zuxuan Wu, Larry Davis, Abhinav Shrivastava, Ser-Nam Lim
  • GTA: Global Temporal Attention for Video Action Understanding. [pdf]
  • British Machine Vision Conference (BMVC), Virtual, Oct., 2021
  • Bo He, Xitong Yang, Zuxuan Wu, Hao Chen, Ser-Nam Lim, Abhinav Shrivastava
  • VideoLT: Large-scale Long-tailed Video Recognition. [pdf]
  • International Conference on Computer Vision (ICCV), Virtual, Oct., 2021
  • Xing Zhang, Zuxuan Wu, Zejia Weng, Huazhu Fu, Jingjing Chen, Yu-Gang Jiang, Larry Davis
  • Exploring Visual Engagement Signals for Representation Learning. [pdf]
  • International Conference on Computer Vision (ICCV), Virtual, Oct., 2021
  • Menglin Jia, Zuxuan Wu, Austin Reiter, Claire Cardie, Serge Belongie, Ser-Nam Lim
  • 2D or not 2D? Adaptive 3D Convolution Selection for Efficient Video Recognition. [pdf]
  • IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Virtual, June, 2021
  • Hengduo Li, Zuxuan Wu, Abhinav Shrivastava, Larry S. Davis
  • Intentonomy: a Dataset and Study towards Human Intent Understanding [pdf][code]
  • IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Virtual, June, 2021 (Oral)
  • Menglin Jia, Zuxuan Wu, Austin Reiter, Claire Cardie, Serge Belongie, Ser-Nam Lim
  • Efficient Object Embedding for Manipulated Image Retrieval [pdf]
  • IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Virtual, June, 2021
  • Bor-Chun Chen, Zuxuan Wu, Larry S. Davis, Ser-Nam Lim
  • Making an Invisibility Cloak: Real World Adversarial Attacks on Object Detectors. [pdf][code]
  • European Conference on Computer Vision (ECCV), Virtual, August, 2020.
  • Zuxuan Wu, Ser-Nam Lim, Larry S. Davis, Tom Goldstein
  • Learning from Noisy Anchors for One-stage Object Detection. [pdf]
  • IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Virtual, June, 2020
  • Hengduo Li, Zuxuan Wu, Chen Zhu, Caiming Xiong, Richard Socher, Larry S. Davis
  • LiteEval: A Coarse-to-Fine Framework for Resource Efficient Video Recognition. [pdf][code]
  • Advances in Neural Information Processing Systems (NeurIPS), Vancouver, Canada, Dec., 2019.
  • Zuxuan Wu, Caiming Xiong, Yu-Gang Jiang, Larry S. Davis
  • FiNet: Compatible and Diverse Fashion Image Inpainting. [pdf]
  • International Conference on Computer Vision (ICCV), Seoul, Korea, Oct., 2019. (Oral)
  • Xintong Han, Zuxuan Wu, Weilin Huang, Matthew R. Scott, Larry S. Davis
  • ACE: Adapting to Changing Environments for Semantic Segmentation. [pdf]
  • International Conference on Computer Vision (ICCV), Seoul, Korea, Oct., 2019.
  • Zuxuan Wu, Xin Wang, Joseph E. Gonzalez, Tom Goldstein, Larry S. Davis
  • AdaFrame: Adaptive Frame Selection for Fast Video Recognition. [pdf]
  • IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, USA, June, 2019
  • Zuxuan Wu, Caiming Xiong, Chih-Yao Ma, Richard Socher, Larry S Davis
  • The Regretful Agent: Heuristic-Aided Navigation through Progress Estimation. [pdf]
  • IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, USA, June, 2019.
  • Chih-Yao Ma, Zuxuan Wu, Ghassan AlRegib, Caiming Xiong, Zsolt Kira
  • Visual Content Recognition by Exploiting Semantic Feature Map with Attention and Multi-task Learning.[pdf]
  • ACM Trans. Multimedia Comput. Commun (ACM TOMM), vol. 15, issue 1, pp. 6:1-6:22, 2019.
  • Rui-Wei Zhao, Qi Zhang, Zuxuan Wu, Jianguo Li, Yu-Gang Jiang
  • Self-Monitoring Navigation Agent via Auxiliary Progress Estimation. [pdf]
  • International Conference on Learning Representations (ICLR), New Orleans, USA, May, 2019.
  • Chih-Yao Ma, Jiasen Lu, Zuxuan Wu, Ghassan AlRegib, Zsolt Kira, Richard Socher, Caiming Xiong
  • DCAN: Dual Channel-wise Alignment Networks for Unsupervised Scene Adaptation. [pdf][code]
  • European Conference on Computer Vision (ECCV), Munich, Germany, September, 2018.
  • Zuxuan Wu, Xintong Han, Yen-Liang Lin, Mustafa Gkhan Uzunbas, Tom Goldstein, Ser Nam Lim, Larry S. Davis
  • BlockDrop: Dynamic Inference Paths in Residual Networks. [pdf][code]
  • IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, USA, June, 2018. (Spotlight)
  • Zuxuan Wu*, Tushar Nagarajan*, Abhishek Kumar, Steven Rennie, Larry S. Davis, Kristen Grauman, Rogerio Feris
    (* denotes equal contribution)
  • VITON: An Image-based Virtual Try-on Network. [pdf][code]
  • IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, USA, June, 2018. (Spotlight)
  • Xintong Han, Zuxuan Wu, Zhe Wu, Ruichi Yu, Larry S. Davis
  • Exploiting Feature and Class Relationships in Video Categorization with Regularized Deep Neural Networks. [pdf]
  • IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), Vol. 40, Issue 2, pp. 352-364, 2018.
  • Yu-Gang Jiang, Zuxuan Wu, Jun Wang, Xiangyang Xue, Shih-Fu Chang
  • Fudan-Columbia Video Dataset (FCVID), one of the largest public Web video datasets with manual annotations.
  • Deep Learning for Video Classification and Video Captioning. [pdf]
  • In Frontiers of Multimedia Research, Shih-Fu Chang (Ed.), ACM Morgan & Claypool, New York, NY, USA, pp. 3-29, 2018
  • Zuxuan Wu, Ting Yao, Yanwei Fu, Yu-Gang Jiang
  • Surveying 100+ recent literatures on video classification and captioning with deep learning.
  • Weakly-Supervised Spatial Context Networks. [pdf]
  • arXiv preprint arXiv:1704.02998
  • Zuxuan Wu, Larry S. Davis, Leonid Sigal
  • Automatic Spatially-aware Fashion Concept Discovery. [pdf]
  • International Conference on Computer Vision (ICCV), Venice, Italy, Oct., 2017.
  • Xintong Han, Zuxuan Wu, Phoenix Huang, Xiao Zhang, Menglong Zhu, Yuan Li, Yang Zhao, Larry S. Davis
  • Learning Fashion Compatibility with Bidirectional LSTMs. [pdf]
  • ACM Multimedia (ACM MM), Mountain View, USA, Oct., 2017.
  • Xintong Han, Zuxuan Wu, Yu-Gang Jiang, Larry S. Davis
  • Learning Semantic Feature Map for Visual Content Recognition. [pdf]
  • ACM Multimedia (ACM MM), Mountain View, USA, Oct., 2017.
  • Rui-Wei Zhao, Zuxuan Wu, Jianguo Li, Yu-Gang Jiang
  • Harnessing Object and Scene Semantics for Large-Scale Video Understanding. [pdf]
  • IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, USA, June, 2016. (Spotlight)
  • Zuxuan Wu, Yanwei Fu, Yu-Gang Jiang, Leonid Sigal
  • Featured in Tech2, ACM Technews
  • Multi-Stream Multi-Class Fusion of Deep Networks for Video Classification. [pdf]
  • ACM Multimedia (ACM MM), Amsterdam, the Netherlands, Oct., 2016. (Oral Paper)
  • Zuxuan Wu, Yu-Gang Jiang, Xi Wang, Hao Ye, Xiangyang Xue
  • Modeling Spatial-Temporal Clues in a Hybrid Deep Learning Framework for Video Classification. [pdf]
  • ACM Multimedia (ACM MM), Brisbane, Australia, Oct., 2015. (Oral Paper)
  • Zuxuan Wu, Xi Wang, Yu-Gang Jiang, Hao Ye, Xiangyang Xue
  • Obtain 91.3% accuracy on the UCF-101 dataset.
  • Evaluating Two-Stream CNN for Video Classification. [pdf][motion CNN model]
  • ACM International Conference on Multimedia Retrieval (ICMR), Shanghai, China, June, 2015
  • Hao Ye, Zuxuan Wu, Rui-Wei Zhao, Xi Wang, Yu-Gang Jiang, Xiangyang Xue
  • Exploring Inter-feature and Inter-class Relationships with Deep Neural Networks for Video Classification. [pdf]
  • ACM Multimedia (ACM MM), Orlando, USA, Nov., 2014. (Oral Paper)
  • Zuxuan Wu, Yu-Gang Jiang, Jun Wang, Jian Pu, Xiangyang Xue

Honors

2017.12Snap Inc. Fellowship
2016.08Dean's Fellowship
2015.10National Graduate Scholarship of China
2015.08Google Excellent Student Award
2015.07ACM Student Travel Grant
2014.10National Graduate Scholarship of China
2014.08ACM Student Travel Grant
2013.06Shanghai Outstanding Graduates
2012.12Shanghai Scholarship