#	Pagina
attuale pagina	/open-h2020/projects/201445/index.html

Opendata, web and dolomites

TalkingHeads SIGNED

TalkingHeads: Audiovisual Speech Recognition in-the-wild

Total Cost €

EC-Contrib. €

Partnership

Views

Outcomes and
results

TalkingHeads project word cloud

Explore the words cloud of the TalkingHeads project. It provides you a very rough idea of what is the project "TalkingHeads" about.

diarization wild audio video collected academia patterns personal refers explored complementing time computer noise visual alignment detection plan collaborators assumed industry listener automatic network recognizing supervisor supervisory tracking multimedia talkingheads leadership mainly career movement proposes er expertise expression setting occasionally publishing establishing conducting noisy training purely background vision mouth skills unconstrained world facial extensive talented independent databases correlation brings environment videos internationally speakers first conferences literature journals research perceives fusion attain researcher recognition speech laboratory maturity multiple auditory area achievable

Project "TalkingHeads" data sheet

The following table provides information about the project.

Coordinator	THE UNIVERSITY OF NOTTINGHAM Organization address address: University Park city: NOTTINGHAM postcode: NG7 2RD website: www.nottingham.ac.uk contact info title: n.a. name: n.a. surname: n.a. function: n.a. email: n.a. telephone: n.a. fax: n.a.
Coordinator Country	United Kingdom [UK]
Project website	http://www.talking-heads.eu
Total cost	183˙454 €
EC max contribution	183˙454 € (100%)
Programme	1. H2020-EU.1.3.2. (Nurturing excellence by means of cross-border and cross-sector mobility)
Code Call	H2020-MSCA-IF-2015
Funding Scheme	MSCA-IF-EF-ST
Starting year	2016
Duration (year-month-day)	from 2016-06-01 to 2018-05-31

Partnership

Take a look of project's partnership.

#	participants	country	role	EC contrib. [€]
1	THE UNIVERSITY OF NOTTINGHAM THE UNIVERSITY OF NOTTINGHAM Organization address address: University Park city: NOTTINGHAM postcode: NG7 2RD website: www.nottingham.ac.uk contact info title: n.a. name: n.a. surname: n.a. function: n.a. email: n.a. telephone: n.a. fax: n.a.	UK (NOTTINGHAM)	coordinator	183˙454.00

Map

Project objective

Audio-visual speech recognition refers to the problem of recognizing speech using both audio and video information. Speech is not a purely auditory process but the way that the listener perceives it is also through the recognition of the visual patterns associated with the mouth movement. This correlation of the audio-visual information has been occasionally explored in literature in order to develop more robust automatic speech recognition systems for cases in which the auditory environment is noisy (e.g. background noise, multiple speakers). However, the problem of audio-visual speech recognition has been mainly studied in controlled, laboratory conditions. TalkingHeads proposes, for the first time, the problem of audio-visual speech recognition in unconstrained (in-the-wild) videos collected from real-world multimedia databases and a set of methodologies that will work well under the assumed in-the-wild setting.

TalkingHeads brings together a talented but experienced researcher (ER) with expertise in speech analysis (diarization and recognition) and the Supervisor with large research experience in Computer Vision for face analysis in-the-wild (recognition, detection, alignment and tracking, and facial expression analysis). TalkingHeads will establish the ER as an independent and internationally recognized researcher in the area of audio-visual fusion and speech recognition. Through TalkingHeads’ achievable work plan, the ER will attain a high level of research maturity by (a) complementing his expertise on speech analysis through extensive training in Computer Vision, (b) conducting research on a challenging research problem (audio-visual speech recognition in-the-wild) with significant career opportunities in both the academia and the industry, (c) publishing at high impact factor conferences and journals, (d) establishing a network of research collaborators, and (e) enhancing personal skills (e.g. supervisory experience, leadership and management skills).

Publications

List of publications.
year	authors and title	journal	last update
2018	Themos Stafylakis, Georgios Tzimiropoulos Zero-shot keyword spotting for visual speech recognition in-the-wild published pages: , ISSN: , DOI:	European Conference on Computer Vision (ECCV)	2019-06-13
2017	Themos Stafylakis, Georgios Tzimiropoulos Combining Residual Networks with LSTMs for Lipreading published pages: , ISSN: , DOI:	Interspeech	2019-06-13
2016	Kong-Aik Lee, Ville HautamÃ¤ki, Tomi Kinnunen, Anthony Larcher, C Zhang, A Nautsch, T Stafylakis, G Liu, M Rouvier, W Rao, F Alegre, J Ma, MW Mak, AK Sarkar, H Delgado, R Saeidi, H Aronowitz, A Sizov, H Sun, TH Nguyen, Md Sahidullah, V Vestman, M Halonen, A Kanervisto The I4U submission to the 2016 NIST speaker recognition evaluation published pages: , ISSN: , DOI:	NIST SRE 2016 Workshop	2019-06-13
2017	Kong-Aik Lee, Ville HautamÃ¤ki, Tomi Kinnunen, Anthony Larcher, C Zhang, A Nautsch, T Stafylakis, G Liu, M Rouvier, W Rao, F Alegre, J Ma, MW Mak, AK Sarkar, H Delgado, R Saeidi, H Aronowitz, A Sizov, H Sun, TH Nguyen, Md Sahidullah, V Vestman, M Halonen, A Kanervisto The I4U mega fusion and collaboration for NIST speaker recognition evaluation 2016 published pages: , ISSN: , DOI:	Interspeech	2019-06-13
2018	Stavros Petridis, Themos Stafylakis, Pingchuan Ma, Feipeng Cai, Georgios Tzimiropoulos, Maja Pantic End-to-end Audiovisual Speech Recognition published pages: , ISSN: , DOI:	IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)	2019-06-13
2018	Stafylakis, Themos; Tzimiropoulos, Georgios Deep word embeddings for visual speech recognition published pages: , ISSN: , DOI:	IEEE International Conference on Acoustics, Speech, and Signal Processing	2019-06-13
2018	Brummer, Niko; Silnova, Anna; Burget, Lukas; Stafylakis, Themos Gaussian meta-embeddings for efficient scoring of a heavy-tailed PLDA model published pages: , ISSN: , DOI:	Proceedings of Odyssey	2019-06-13

Are you the coordinator (or a participant) of this project? Plaese send me more information about the "TALKINGHEADS" project.

For instance: the website url (it has not provided by EU-opendata yet), the logo, a more detailed description of the project (in plain text as a rtf file or a word file), some pictures (as picture files, not embedded into any word file), twitter account, linkedin page, etc.

Send me an email (fabio@fabiodisconzi.com) and I put them in your project's page as son as possible.

Thanks. And then put a link of this page into your project's website.

The information about "TALKINGHEADS" are provided by the European Opendata Portal: CORDIS opendata.

More projects from the same programme (H2020-EU.1.3.2.)

CoCoNat (2019)

Coordination in constrained and natural distributed systems

ICARUS (2020)

Information Content of locAlisation: fRom classical to qUantum Systems

PaSION (2018)

A longitudinal assessment of treatment experience, symptoms and potential associations with biomarkers in cancer patients undergoing immune checkpoint inhibitor therapy