Multiple View Neural Regression of a Facial Shape Model

Salvato in:
Dettagli Bibliografici
Pubblicato in:ProQuest Dissertations and Theses (2025)
Autore principale: Li, Xiang
Pubblicazione:
ProQuest Dissertations & Theses
Soggetti:
Accesso online:Citation/Abstract
Full Text - PDF
Tags: Aggiungi Tag
Nessun Tag, puoi essere il primo ad aggiungerne!!

MARC

LEADER 00000nab a2200000uu 4500
001 3266812676
003 UK-CbPIL
020 |a 9798297665835 
035 |a 3266812676 
045 2 |b d20250101  |b d20251231 
084 |a 66569  |2 nlm 
100 1 |a Li, Xiang 
245 1 |a Multiple View Neural Regression of a Facial Shape Model 
260 |b ProQuest Dissertations & Theses  |c 2025 
513 |a Dissertation/Thesis 
520 3 |a Creating re-topologized 3D facial meshes is a critical step in high-quality facial animation pipelines, yet it remains a labor-intensive and time-consuming task. Traditional approaches typically rely on multiview stereo reconstruction and specialized photometric environments to acquire accurate geometric and reflectance data under controlled conditions. This dissertation presents work toward more efficient capture of production-ready meshes including (1) developmental aspects of VarIS, a custom-designed light sphere capable of capturing high-resolution stereo geometry and reflectance maps—including diffuse, specular, and normal components under programmable illumination; (2) a study of the effects of camera parameters on automatic 2D and 3D landmarking methods, (3) methods for using synthetic data to train neural face regression, and (4) techniques proposed to improve neural multi-view regression of face shape.While VarIS enables photorealistic face capture, its operational cost and the need for manual processing of its acquired data highlight the need for a more scalable solution. To address this, a deep learning–based framework is proposed, enabling direct prediction of re-topologized facial meshes from synthetic multiview images. Training data was generated using Visage Craft, an in-house rendering system built upon a physically based Appearance 3D Morphable Model (A3DMM). The method infers dense mesh geometry in a standardized format ready to rig and animate. Results demonstrate that integrating precise camera intrinsics and extrinsics during training markedly improves landmark accuracy and geometric consistency, and incorporating 3D landmarks themselves in the regularization of the network also improves results. The final system presents a robust, data-driven alternative to conventional face analysis/synthesis workflows, capable of producing facial meshes with minimal human supervision. 
653 |a Deep learning 
653 |a Computer graphics 
653 |a Computer vision 
653 |a Lighting 
653 |a Animation 
653 |a Geometry 
653 |a Neural networks 
653 |a Artificial intelligence 
653 |a Computer science 
773 0 |t ProQuest Dissertations and Theses  |g (2025) 
786 0 |d ProQuest  |t ProQuest Dissertations & Theses Global 
856 4 1 |3 Citation/Abstract  |u https://www.proquest.com/docview/3266812676/abstract/embedded/7BTGNMKEMPT1V9Z2?source=fedsrch 
856 4 0 |3 Full Text - PDF  |u https://www.proquest.com/docview/3266812676/fulltextPDF/embedded/7BTGNMKEMPT1V9Z2?source=fedsrch