Opendata, web and dolomites

TalkingHeads SIGNED

TalkingHeads: Audiovisual Speech Recognition in-the-wild

Total Cost €

0

EC-Contrib. €

0

Partnership

0

Views

0

Project "TalkingHeads" data sheet

The following table provides information about the project.

Coordinator
THE UNIVERSITY OF NOTTINGHAM 

Organization address
address: University Park
city: NOTTINGHAM
postcode: NG7 2RD
website: www.nottingham.ac.uk

contact info
title: n.a.
name: n.a.
surname: n.a.
function: n.a.
email: n.a.
telephone: n.a.
fax: n.a.

 Coordinator Country United Kingdom [UK]
 Project website http://www.talking-heads.eu
 Total cost 183˙454 €
 EC max contribution 183˙454 € (100%)
 Programme 1. H2020-EU.1.3.2. (Nurturing excellence by means of cross-border and cross-sector mobility)
 Code Call H2020-MSCA-IF-2015
 Funding Scheme MSCA-IF-EF-ST
 Starting year 2016
 Duration (year-month-day) from 2016-06-01   to  2018-05-31

 Partnership

Take a look of project's partnership.

# participants  country  role  EC contrib. [€] 
1    THE UNIVERSITY OF NOTTINGHAM UK (NOTTINGHAM) coordinator 183˙454.00

Map

 Project objective

Audio-visual speech recognition refers to the problem of recognizing speech using both audio and video information. Speech is not a purely auditory process but the way that the listener perceives it is also through the recognition of the visual patterns associated with the mouth movement. This correlation of the audio-visual information has been occasionally explored in literature in order to develop more robust automatic speech recognition systems for cases in which the auditory environment is noisy (e.g. background noise, multiple speakers). However, the problem of audio-visual speech recognition has been mainly studied in controlled, laboratory conditions. TalkingHeads proposes, for the first time, the problem of audio-visual speech recognition in unconstrained (in-the-wild) videos collected from real-world multimedia databases and a set of methodologies that will work well under the assumed in-the-wild setting.

TalkingHeads brings together a talented but experienced researcher (ER) with expertise in speech analysis (diarization and recognition) and the Supervisor with large research experience in Computer Vision for face analysis in-the-wild (recognition, detection, alignment and tracking, and facial expression analysis). TalkingHeads will establish the ER as an independent and internationally recognized researcher in the area of audio-visual fusion and speech recognition. Through TalkingHeads’ achievable work plan, the ER will attain a high level of research maturity by (a) complementing his expertise on speech analysis through extensive training in Computer Vision, (b) conducting research on a challenging research problem (audio-visual speech recognition in-the-wild) with significant career opportunities in both the academia and the industry, (c) publishing at high impact factor conferences and journals, (d) establishing a network of research collaborators, and (e) enhancing personal skills (e.g. supervisory experience, leadership and management skills).

 Publications

year authors and title journal last update
List of publications.
2018 Themos Stafylakis, Georgios Tzimiropoulos
Zero-shot keyword spotting for visual speech recognition in-the-wild
published pages: , ISSN: , DOI:
European Conference on Computer Vision (ECCV) 2019-06-13
2017 Themos Stafylakis, Georgios Tzimiropoulos
Combining Residual Networks with LSTMs for Lipreading
published pages: , ISSN: , DOI:
Interspeech 2019-06-13
2016 Kong-Aik Lee, Ville Hautamäki, Tomi Kinnunen, Anthony Larcher, C Zhang, A Nautsch, T Stafylakis, G Liu, M Rouvier, W Rao, F Alegre, J Ma, MW Mak, AK Sarkar, H Delgado, R Saeidi, H Aronowitz, A Sizov, H Sun, TH Nguyen, Md Sahidullah, V Vestman, M Halonen, A Kanervisto
The I4U submission to the 2016 NIST speaker recognition evaluation
published pages: , ISSN: , DOI:
NIST SRE 2016 Workshop 2019-06-13
2017 Kong-Aik Lee, Ville Hautamäki, Tomi Kinnunen, Anthony Larcher, C Zhang, A Nautsch, T Stafylakis, G Liu, M Rouvier, W Rao, F Alegre, J Ma, MW Mak, AK Sarkar, H Delgado, R Saeidi, H Aronowitz, A Sizov, H Sun, TH Nguyen, Md Sahidullah, V Vestman, M Halonen, A Kanervisto
The I4U mega fusion and collaboration for NIST speaker recognition evaluation 2016
published pages: , ISSN: , DOI:
Interspeech 2019-06-13
2018 Stavros Petridis, Themos Stafylakis, Pingchuan Ma, Feipeng Cai, Georgios Tzimiropoulos, Maja Pantic
End-to-end Audiovisual Speech Recognition
published pages: , ISSN: , DOI:
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2019-06-13
2018 Stafylakis, Themos; Tzimiropoulos, Georgios
Deep word embeddings for visual speech recognition
published pages: , ISSN: , DOI:
IEEE International Conference on Acoustics, Speech, and Signal Processing 2019-06-13
2018 Brummer, Niko; Silnova, Anna; Burget, Lukas; Stafylakis, Themos
Gaussian meta-embeddings for efficient scoring of a heavy-tailed PLDA model
published pages: , ISSN: , DOI:
Proceedings of Odyssey 2019-06-13

Are you the coordinator (or a participant) of this project? Plaese send me more information about the "TALKINGHEADS" project.

For instance: the website url (it has not provided by EU-opendata yet), the logo, a more detailed description of the project (in plain text as a rtf file or a word file), some pictures (as picture files, not embedded into any word file), twitter account, linkedin page, etc.

Send me an  email (fabio@fabiodisconzi.com) and I put them in your project's page as son as possible.

Thanks. And then put a link of this page into your project's website.

The information about "TALKINGHEADS" are provided by the European Opendata Portal: CORDIS opendata.

More projects from the same programme (H2020-EU.1.3.2.)

GRAHAM (2018)

Concepts of Graph Theory Applied to the Human Microbiome

Read More  

ArcticRisk (2020)

Risk and Business continuity management in the Arctic

Read More  

HSQG (2020)

Higher Spin Quantum Gravity: Lagrangian Formulations for Higher Spin Gravity and Their Applications

Read More