Chao Feng

I am a master's student at the University of Michigan (UMich).

I got my Bachelor's degree from University of Electronic Science and Technology of China (UESTC).

Email: chfeng at umich dot edu

Email  /  CV  /  Google Scholar  /  Github

profile photo
  • 2023/03: Our paper "Self-Supervised Video Forensics by Audio-Visual Anomaly Detection" is seleted as a highlight by CVPR 2023.

  • 2023/02: Our paper "Self-Supervised Video Forensics by Audio-Visual Anomaly Detection" is accepted by CVPR 2023.


I'm interested in computer vision and machine learning.

unitouch  GPS-to-3D: Lifting Tourist Photos to 3D Using 2D Diffusion
Chao Feng, Ziyang Chen, Aleksander Holynski, Alexei A. Efros, Andrew Owens,
In submission

We produce 3D reconstruction for landmarks from unordered collections of tourist photos by GPS conditioned diffusion model and score distillation sampling.

unitouch  Binding Touch to Everything: Learning Unified Multimodal Tactile Representations
Fengyu Yang*, Chao Feng*, Ziyang Chen*, Hyoungseob Park, Daniel Wang, Yiming Dou,
Ziyao Zeng, Xien Chen, Rit Gangopadhyay, Andrew Owens, Alex Wong
In submission

We introduce UniTouch, a unified tactile representation for vision-based tactile sensors aligned with multiple modalities. We show we can now use powerful models trained on other modalities (e.g. CLIP, LLM) to conduct tactile sensing tasks zero shot.

b3do Self-Supervised Video Forensics by Audio-Visual Anomaly Detection
Chao Feng, Ziyang Chen, Andrew Owens,
CVPR, 2023   (Highlight -- 2.5% accept rate)
project page / arXiv / code

We learn several feature sets in a self-supervised manner by using audio-visual synchronization task and utilize autoregressive model to do anomaly detection on top of each feature set for video forensics detection.

b3do ACL: Augmented Competitive Learning for Image Ordinal Classification
Chao Zhang*, Chao Feng*, Jianmei Cheng, Shuaicheng Liu, Ce Zhu,
In submission
project page / arXiv / code

We design a novel loss to embed ordinal information into training procedure and use augmented way to learn discriminative representation for image ordinal classification (IOC) task .

b3do AVA-AVD: Audio-Visual Speaker Diarization in the Wild
Eric Zhongcong Xu, Zeyang Song, Satoshi Tsutsui, Chao Feng, Mang Ye, Mike Zheng Shou,
ACM Multimedia, 2022
project page / arXiv / code

We create the AVA Audio-Visual Diarization (AVA-AVD) dataset to develop diarization methods for in-the-wild videos.