Result: Consistent 3D Human Reconstruction From Monocular Video: Learning Correctable Appearance and Temporal Motion Priors.

Title:

Consistent 3D Human Reconstruction From Monocular Video: Learning Correctable Appearance and Temporal Motion Priors.

Authors:

Shang C, An L, Li T, Zhang J, Zhang Y, Tian J, Liu Y, Yang X

Source:

IEEE transactions on visualization and computer graphics [IEEE Trans Vis Comput Graph] 2026 Feb; Vol. 32 (2), pp. 1895-1910.

Publication Type:

Journal Article

Language:

English

Journal Info:

Publisher: IEEE Computer Society Country of Publication: United States NLM ID: 9891704 Publication Model: Print Cited Medium: Internet ISSN: 1941-0506 (Electronic) Linking ISSN: 10772626 NLM ISO Abbreviation: IEEE Trans Vis Comput Graph Subsets: MEDLINE

Imprint Name(s):

Original Publication: New York, NY : IEEE Computer Society, c1995-

MeSH Terms:

Imaging, Three-Dimensional*/methods , Video Recording*/methods , Computer Graphics*, Humans ; Algorithms ; Motion ; Machine Learning

Entry Date(s):

Date Created: 20251031 Date Completed: 20260206 Latest Revision: 20260209

Update Code:

20260210

DOI:

10.1109/TVCG.2025.3626741

PMID:

41171674

Database:

MEDLINE

Further Information

*Recent advancements in rendering dynamic humans using NeRF and 3D Gaussian splatting have made significant progress, leveraging implicit geometry learning and image appearance rendering to create digital humans. However, in monocular video rendering, there are still challenges in rendering subtle and complex motion from different viewpoints and states, primarily due to the imbalance of viewpoints. Additionally, ensuring continuity between adjacent frames when rendering from novel and free viewpoints remains a difficult task. To address these challenges, we first propose a pixel-level motion correction module that adjusts the errors in the learned representation between different viewpoints. We also introduce a temporal information-based model to improve motion continuity by leveraging adjacent frames. Experimental results on dynamic human rendering, using the NeuMan, ZJU-Mocap, and People-Snapshot datasets, demonstrate that our method outperforms state-of-the-art techniques both quantitatively and qualitatively.*

*Result*: Consistent 3D Human Reconstruction From Monocular Video: Learning Correctable Appearance and Temporal Motion Priors.

*Further Information*

*Links*

*Additional functions*

Result: Consistent 3D Human Reconstruction From Monocular Video: Learning Correctable Appearance and Temporal Motion Priors.

Further Information

Links

Additional functions