ScanTalk: 3D Talking Heads from Unregistered Scans

Nocentini, Federico; Besnier, Thomas; Ferrari, Claudio; Arguillere, Sylvain; Berretti, Stefano; Daoudi, Mohamed

Computer Science > Computer Vision and Pattern Recognition

arXiv:2403.10942 (cs)

[Submitted on 16 Mar 2024 (v1), last revised 25 Sep 2024 (this version, v3)]

Title:ScanTalk: 3D Talking Heads from Unregistered Scans

Authors:Federico Nocentini, Thomas Besnier, Claudio Ferrari, Sylvain Arguillere, Stefano Berretti, Mohamed Daoudi

View PDF HTML (experimental)

Abstract:Speech-driven 3D talking heads generation has emerged as a significant area of interest among researchers, presenting numerous challenges. Existing methods are constrained by animating faces with fixed topologies, wherein point-wise correspondence is established, and the number and order of points remains consistent across all identities the model can animate. In this work, we present \textbf{ScanTalk}, a novel framework capable of animating 3D faces in arbitrary topologies including scanned data. Our approach relies on the DiffusionNet architecture to overcome the fixed topology constraint, offering promising avenues for more flexible and realistic 3D animations. By leveraging the power of DiffusionNet, ScanTalk not only adapts to diverse facial structures but also maintains fidelity when dealing with scanned data, thereby enhancing the authenticity and versatility of generated 3D talking heads. Through comprehensive comparisons with state-of-the-art methods, we validate the efficacy of our approach, demonstrating its capacity to generate realistic talking heads comparable to existing techniques. While our primary objective is to develop a generic method free from topological constraints, all state-of-the-art methodologies are bound by such limitations. Code for reproducing our results, and the pre-trained model are available at this https URL .

Comments:	Published in the ECCV 2024 Proceedings
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2403.10942 [cs.CV]
	(or arXiv:2403.10942v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2403.10942

Submission history

From: Federico Nocentini [view email]
[v1] Sat, 16 Mar 2024 14:58:58 UTC (22,314 KB)
[v2] Tue, 19 Mar 2024 11:28:12 UTC (22,314 KB)
[v3] Wed, 25 Sep 2024 09:42:45 UTC (22,605 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:ScanTalk: 3D Talking Heads from Unregistered Scans

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:ScanTalk: 3D Talking Heads from Unregistered Scans

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators