Bdbtoken Exchange

Chen GENG „ÄåËÄø Êô® ? My first name is Chen, and my last name is Geng.
I prefer to be addressed by my first name Chen.
Possible pronunciation: Chen(ch-uhn) Geng(guh-ng). „Äç

Hiüëã! I'm a first-year CS Ph.D. student at Stanford. I am fortunate to be advised by Prof. Jiajun Wu and affiliated with the Stanford Vision and Learning Lab. My research lies at the intersection between Graphics, 3D Vision, and Machine Learning. Specifically, I'm currently interested in physical scene understanding by inverting graphics engines.

Previously, I got my bachelor's degree in Computer Science from Zhejiang University in 2023, with an honors degree from Chu Kochen Honors College. During my undergraduate, I was privileged to work closely with Prof. Xiaowei Zhou and Prof. Sida Peng on several research projects. I spent a wonderful summer at Stanford, also with the CogAI group, in 2022.

If you have shared research interests or have any topics you'd like to chat about ‚Äî especially if you're from underrepresented groups ‚Äî don't hesitate to shoot me an email. I'm always up for exploring potential collaborations and/or engaging in insightful conversations.

Email: X √ó Y, where X = {gengchen}, Y = {@cs.stanford.edu}

Google Scholar &nbsp/&nbsp Twitter &nbsp/&nbsp GitHub &nbsp/&nbsp LinkedIn

News üì∞

02/2024: One paper is accepted to CVPR 2024.
01/2024: One paper is accepted to ICLR 2024.
01/2024: This year I will co-organize Stanford Graphics Cafe (Lunch Seminar). Subscribe to our mailing list here!
07/2023: One paper is accepted to ICCV 2023.
04/2023: I'll be joining Stanford University as a CS PhD student.
02/2023: One paper is accepted to CVPR 2023.
02/2023: One paper is accepted to TPAMI.
04/2022: One paper is accepted to SIGGRAPH 2022.

Selected Research üî¨

&nbsp&nbsp&nbsp&nbsp (* denotes equal contribution)

&nbsp&nbsp&nbsp&nbsp For the comprehensive list, check out my Google Scholar page.

	Relightable and Animatable Neural Avatar from Sparse-View Video Zhen Xu, Sida Peng, Chen Geng, Linzhan Mou, Zihan Yan, Jiaming Sun, Hujun Bao, Xiaowei Zhou CVPR 2024 (Highlight) [arXiv] [Project Page] [Video] tl;dr: We estimate physically based intrinsics of dynamic characters from monocular videos.
	Neural Polynomial Gabor Fields for Macro Motion Analysis Chen Geng, Hong-Xing "Koven" Yu, Sida Peng, Xiaowei Zhou, Jiajun Wu ICLR 2024 [Paper] [OpenReview] [Project Page] tl;dr: We discover a low-dimensional interpretable motion representation for dynamic scenes with macro motion.
	Tree-Structured Shading Decomposition Chen Geng, Hong-Xing "Koven" Yu, Sharon Zhang, Maneesh Agrawala, Jiajun Wu ICCV 2023 [Paper] [Project Page] [Video] [Code] [MarkTechPost] tl;dr: We decompose the shading of objects into a tree-structured representation, which can be edited or interpreted by users easily. Abstract: We study the problem of obtaining a tree-structured representation for shading objects. Prior work typically uses the parametric or measured representation to model shading, which is neither interpretable nor easily editable. Our method uses the shade tree representation, which combines basic shading nodes and compositing methods, to model and decomposes the material shading. Such a representation enables users to edit previously rigid material appearances in an efficient and intuitive manner. In particular, novice users who are unfamiliar with the construction of such shade trees can quickly obtain such a representation. The extraction of such a representation enables the editing and understanding of object shading, even for novice users. The biggest challenge in this task is that the discrete structure of the shade tree is not differentiable. We propose a hybrid algorithm to address this issue. First, given an input image, a recursive amortized inference model is leveraged to initialize a guess of the tree structure and corresponding leaf node parameters. Then, we apply an optimization-based method to fine-tune the result. Experiments show that our method works well on synthetic images, realistic images, and non-realistic vector drawings, surpassing the baselines significantly.
	Learning Neural Volumetric Representations of Dynamic Humans in Minutes Chen Geng, gate io, Zhen Xu, Hujun Bao, gateio CVPR 2023 [Paper] [Project Page] [Code] tl;dr: We accelerate the learning of neural volumetric videos of dynamic humans by over 100 times. Abstract: This paper addresses the challenge of quickly reconstructing free-viewpoint videos of dynamic humans from sparse multi-view videos. Some recent works represent the dynamic human as a canonical neural radiance field (NeRF) and a motion field, which are learned from videos through differentiable rendering. They generally require a lengthy optimization process. Other generalization methods leverage learned prior from datasets and reduce the optimization time by only finetuning on new scenes, at the cost of visual fidelity. In this paper, we propose a novel method for creating viewpoint-free human performance synthesis from sparse view videos in minutes with competitive visual quality. Specifically, we leverage the human body prior to define a novel part-based voxelized NeRF representation, which distributes the representational power of the canonical human model efficiently. Furthermore, we propose a novel dimensionality reduction 2D motion parameterization scheme to increase the convergence rate of the human deformation field. Experiments demonstrate that our approach can be trained 100 times faster than prior per-scene optimiztion methods while being competitive in the rendering quality. We show that given a video capturing a human performer of 100 frames, our model typically takes about 5 minutes for training to produce photorealistic free-viewpoint videos on a single RTX 3090 GPU. The code will be released for reproducibility.
	Implicit Neural Representations with Structured Latent Codes for Human Body Modeling Sida Peng, Chen Geng, Yuanqing Zhang, Yinghao Xu, Qianqian Wang, Qing Shuai, Xiaowei Zhou, Hujun Bao TPAMI 2023 [Paper] [www] tl;dr: Our approach reconstruct geometry and appearance of human performers with high accuracy from sparse observations.
	Novel View Synthesis of Human Interactions from Sparse Multi-view Videos Qing Shuai, Chen Geng, Qi Fang, Sida Peng, Wenhao Shen, Xiaowei Zhou, Hujun Bao SIGGRAPH 2022 (Featured in the technical paper trailer) [Paper] gate io [Code] [Project Page] tl;dr: Given sparse multi-view videos of crowded scenes with multiple human performers, our approach is able to generate high-fidelity novel views and accurate instance masks. @inproceedings{multinb, &nbsp&nbsp&nbsp&nbsp title = {Novel View Synthesis of Human Interactions from Sparse Multi-view Videos}, &nbsp&nbsp&nbsp&nbsp author = {Qing, Shuai and Chen, Geng and Qi, Fang and Sida, Peng and Wenhao, Shen and Xiaowei, Zhou and Hujun, Bao}, &nbsp&nbsp&nbsp&nbsp booktitle = {SIGGRAPH Conference Proceedings}, &nbsp&nbsp&nbsp&nbsp year = {2022}, }

Experience üßë‚Äçüéì

	Stanford University 2023 - Present, Stanford, California PhD Student Advisor: Prof. gateio
	Zhejiang University 2019 - 2023, Hangzhou, China B.Eng.(Honours) in Computer Science Cumulative GPA: 94.38/100, 3.99/4.0 Major GPA: 96.67/100, 4.0/4.0 Advisor: Prof. gateio login

Academic Service üèõÔ∏è

Reviewer: CVPR, ECCV, SIGGRAPH Asia, ICML, ICLR, AISTATS, NeurIPS, T-PAMI, ICRA, NeurIPS D&B, 3DV, AAAI, Pacific Graphics.
Co-organizer: Stanford GCaf√© (Graphics Lunch Seminar) 2024, 3DV 2025.

This website is adapted from this awesome template
Last updated: Jun 2024