I am currently a research lead at TikTok. I was a research scientist at Facebook AI Research, and an early member of the Amazon Go team that built the computer vision system to replace human cashiers for retail. Before moving to US, I was a postdoc in the LEAR Team, INRIA with Cordelia Schmid. I received my Ph.D. in Computer Vision from Chinese Academy of Sciences, and B.S. in Electrical Engineering from Harbin Institute of Technology.
My research interests range from low-level vision to high-level vision with a focus on video understanding. You can find more detailed information in my CV and my old homepage. The best way to contact me is via my e-mail: .
- Release the code & model for our CVPR 2022 paper on open-world instance segmentation.
- Release the UVO dataset and organize a challenge for Open-World Segmentation @ ICCV 2021.
- is released! Check out the code at GitHub and the offical webiste.
- Open sourced the code & model for TimeSformer.
| Open-World Instance Segmentation: Exploiting Pseudo Ground Truth Learned from Pairwise Affinity. |
Weiyao Wang, Matt Feiszli, Heng Wang, Jitendra Malik, Du Tran. CVPR, 2022.
Paper, Project page, Code
| PyTorchVideo: A Deep Learning Library for Video Understanding. |
Haoqi Fan, Tullie Murrell, Heng Wang, Kalyan Vasudev Alwala, Yanghao Li, Yilei Li, Bo Xiong, Nikhila Ravi, Meng Li, Haichuan Yang, Jitendra Malik, Ross Girshick, Matt Feiszli, Aaron Adcock, Wan-Yen Lo, Christoph Feichtenhofer. ACM International Conference on Multimedia, 2021.
Paper, Project page, Code, Facebook AI Blog
| Unidentified Video Objects: A Benchmark for Dense, Open-World Segmentation. |
Weiyao Wang, Matt Feiszli, Heng Wang, Du Tran. ICCV, 2021.
Paper, Dataset, Workshop, Challenge, Facebook AI Blog
| Searching for Two-Stream Models in Multivariate Space for Video Recognition. |
Xinyu Gong, Heng Wang, Zheng Shou, Matt Feiszli, Zhangyang Wang, Zhicheng Yan. ICCV, 2021.
| Interactive Prototype Learning for Egocentric Action Recognition. |
Xiaohan Wang, Linchao Zhu, Heng Wang, Yi Yang. ICCV, 2021.
| Is Space-Time Attention All You Need for Video Understanding? |
Gedas Bertasius, Heng Wang, Lorenzo Torresani. ICML, 2021.
Paper, Code, Facebook AI Blog
| Beyond Short Clips: End-to-End Video-Level Learning with Collaborative Memories. |
Xitong Yang, Haoqi Fan, Lorenzo Torresani, Larry Davis, Heng Wang. CVPR, 2021.
Paper, Poster, Slides
- Area Chair: BMVC 2021
- Reviewer: CVPR’13-21, ICCV’13-21, ECCV’14-20, T-PAMI, IJCV, etc.