Opendata, web and dolomites

EMBEDDIA SIGNED

Cross-Lingual Embeddings for Less-Represented Languages in European News Media

Total Cost €

0

EC-Contrib. €

0

Partnership

0

Views

0

 EMBEDDIA project word cloud

Explore the words cloud of the EMBEDDIA project. It provides you a very rough idea of what is the project "EMBEDDIA" about.

barriers    engagement    languages    solutions    planning    multiple    world    industry    multilingual    lingual    represented    communities    coupled    leveraging    mobility    quality    changing    exist    translations    embeddings    tools    six    professional    internet    lack    diversity    becomes    government    serves    fair    expanding    services    cross    realise    base    german    dominant    limited    websites    citizens    networks    appropriate    innovations    speed    urgently    access    cultures    english    longer    equitable    monolingual    truly    few    absolutely    language    form    news    online    nor    web    fast    civic    contexts    personal    smaller    transformations    proliferation    local    multicultural    life    usually    near    neural    luxury    multilingualism    basic    untenable    everyday    media    operation    content    time    natural    continues    streams    fundamental    embeddia    mostly    national    37    french    computational    deep    clear    academic   

Project "EMBEDDIA" data sheet

The following table provides information about the project.

Coordinator
INSTITUT JOZEF STEFAN 

Organization address
address: Jamova 39
city: LJUBLJANA
postcode: 1000
website: www.ijs.si

contact info
title: n.a.
name: n.a.
surname: n.a.
function: n.a.
email: n.a.
telephone: n.a.
fax: n.a.

 Coordinator Country Slovenia [SI]
 Total cost 2˙998˙850 €
 EC max contribution 2˙998˙850 € (100%)
 Programme 1. H2020-EU.2.1.1. (INDUSTRIAL LEADERSHIP - Leadership in enabling and industrial technologies - Information and Communication Technologies (ICT))
 Code Call H2020-ICT-2018-2
 Funding Scheme RIA
 Starting year 2019
 Duration (year-month-day) from 2019-01-01   to  2021-12-31

 Partnership

Take a look of project's partnership.

# participants  country  role  EC contrib. [€] 
1    INSTITUT JOZEF STEFAN SI (LJUBLJANA) coordinator 560˙059.00
2    QUEEN MARY UNIVERSITY OF LONDON UK (LONDON) participant 451˙800.00
3    HELSINGIN YLIOPISTO FI (HELSINGIN YLIOPISTO) participant 448˙125.00
4    UNIVERSITE DE LA ROCHELLE FR (LA ROCHELLE) participant 372˙500.00
5    UNIVERZA V LJUBLJANI SI (LJUBLJANA) participant 323˙750.00
6    TEXTA OU EE (TALLINN) participant 306˙250.00
7    THE UNIVERSITY OF EDINBURGH UK (EDINBURGH) participant 175˙000.00
8    TRIKODER DRUSTVO S OGRANICENOM ODGOVORNOSCU ZA RAZVOJ INTERNET SUSTAVAI OBLIKOVANJE HR (ZAGREB GRAD ZAGREB) participant 125˙176.00
9    AS EKSPRESS MEEDIA EE (TALLINN) participant 113˙437.00
10    OY SUOMEN TIETOTOIMISTO - FINSKA NOTISBYRAN AB FI (HELSINKI) participant 111˙737.00
11    STYRIA MEDIJSKI SERVISI DOO ZA TRGOVINU I USLUGE HR (ZAGREB) participant 11˙013.00

Map

 Project objective

Access to the internet is no longer a luxury---it is a basic component of everyday life and civic engagement, but one in which language continues to be a challenge for fair and equitable access. As Europe becomes more multicultural, and personal and professional mobility between cultures rapidly increases, access to fundamental resources such as local news and government services is limited by the great diversity of the EU's 37 languages. The internet mostly developed in English, and without clear planning for how language issues might form barriers to access and engagement, nor how multilingualism might be supported. In the EU, websites and online services for citizens have developed national local language resources, and often only provide a second language (usually English) when absolutely needed; but the great proliferation of web content, multiple and fast-changing content streams, and an expanding user interest base make this approach untenable. And while advanced natural language research and resources exist for a few dominant languages (English, French, German), many of Europe's smaller language communities---and the news media industry that serves them---lack appropriate tools for multilingual internet development. For the EU to realise a truly equitable, open, multilingual future internet, new tools allowing high quality transformations (not translations) between languages are urgently needed. The EMBEDDIA project seeks to address these challenges by leveraging innovations in the use of cross-lingual embeddings coupled with deep neural networks to allow existing monolingual resources to be used across languages, leveraging their high speed of operation for near real-time applications, without the need for large computational resources. Across three years, the project's six academic and four industry partners will develop novel solutions including for under-represented languages, and test them in real-world news and media production contexts.

 Publications

year authors and title journal last update
List of publications.
2019 Miok, Kristian; Nguyen-Doan, Dong; Zaharie, Daniela; Robnik-Å ikonja, Marko
Generating Data using Monte Carlo Dropout
published pages: , ISSN: , DOI: 10.5281/zenodo.3559060
1 2020-03-05
2019 Pivovarova, Lidia; Marjanen, Jani; Zosa, Elaine
Word Clustering for Historical Newspapers Analysis
published pages: 3-10, ISSN: , DOI: 10.5281/zenodo.3402940
Proceedings of the Workshop on Language Technology for Digital Historical Archives in conjuction with RANLP-2019 2020-03-05
2019 Shamila Nasreen; Matthew Purver; Julian Hough
A Corpus Study on Questions, Responses and Misunderstanding Signals in Conversations with Alzheimer\'s Patients
published pages: , ISSN: , DOI: 10.5281/zenodo.3689456
Proceedings of the 23rd Workshop on the Semantics and Pragmatics of Dialogue 13 2020-03-05
2019 Andraž Repar, Matej Martinc, Senja Pollak
Reproduction, replication, analysis and adaptation of a term alignment approach
published pages: , ISSN: 1574-020X, DOI: 10.1007/s10579-019-09477-1
Language Resources and Evaluation 2020-03-05
2019 Jani Marjanen; Lidia Pivovarova; Elaine Zosa; Jussi Kurunmäki
Clustering Ideological Terms in Historical Newspaper Data with Diachronic Word Embeddings
published pages: , ISSN: , DOI: 10.5281/zenodo.3689467
HistoInformatics 2019: International Workshop on Computational History 2019 2020-03-05
2019 Kristian Miok, Dong Nguyen-Doan, Daniela Zaharie, and Marko Robnik-Å ikonja
Generating Data using Monte Carlo Dropout
published pages: , ISSN: , DOI:
IEEE 15th International Conference on Intelligent Computer Communication and Processing (ICCP 2019) 2020-02-11
2019 Matej Martinc, Senja Pollak
Combining n -grams and deep convolutional features for language variety classification
published pages: 607-632, ISSN: 1351-3249, DOI: 10.1017/S1351324919000299
Natural Language Engineering 25/5 2020-02-11
2019 Andraž Repar, Vid Podpečan, Anže Vavpetič, Nada Lavrač, Senja Pollak
TermEnsembler
published pages: 93-120, ISSN: 0929-9971, DOI: 10.1075/term.00029.rep
Terminology 25/1 2020-02-11
2019 Senja Pollak, Andraž Repar, Matej Martinc, and Vid Podpečan
Karst exploration: Extracting terms and definitions from karst
published pages: , ISSN: , DOI:
Proceedings of the 6th biennial conference on electronic lexicography, eLex 2019 2020-02-11
2019 Martinc, Matej; Škrlj, Blaž; Pollak, Senja
Fake or Not: Distinguishing Between Bots, Males and Females
published pages: , ISSN: , DOI:
Working Notes of CLEF 2019 - Conference and Labs of the Evaluation Forum 2 2020-02-11
2019 Martinc, Matej; Škrlj, Blaž; Pollak, Senja
Who is hot and who is not? Profiling celebs on Twitter
published pages: , ISSN: , DOI:
Working Notes of CLEF 2019 - Conference and Labs of the Evaluation Forum 6 2020-02-11
2019 Jani Marjanen, Lidia Pivovarova, Elaine Zosa, and Jussi Kurunmäki
Clustering Ideological Terms in Historical Newspaper Data with Diachronic Word Embeddings
published pages: , ISSN: , DOI:
Proceedings of the 5th International Workshop on Computational History 2020-02-11
2019 Lidia Pivovarova, Elaine Zosa, and Jussi Kurunmäki
Word Clustering for Historical Newspapers Analysis
published pages: , ISSN: , DOI:
Proceedings of the Workshop on Language Technology for Digital Historical Archives 2020-02-11
2019 Tadej Å kvorc, Simon Krek, Senja Pollak, Å pela Arhar Holdt, Marko Robnik-Å ikonja
Predicting Slovene Text Complexity Using Readability Measures
published pages: , ISSN: 2463-7807, DOI:
In Contributions to Contemporary History 2020-02-11
2019 Andraž Pelicon, Matej Martinc, and Petra Kralj Novak
Embeddia at SemEval-2019 Task 6: Detecting hate with neural network and transfer learning approaches
published pages: , ISSN: , DOI:
Proceedings of The 13th International Workshop on Semantic Evaluation (SemEval) 2020-02-11
2019 Morteza Rohanian, Julian Hough, Matthew Purver
Detecting Depression with Word-Level Multimodal Fusion
published pages: 1443-1447, ISSN: , DOI: 10.21437/interspeech.2019-2283
Interspeech 2019 2020-02-11
2019 Jose G. Moreno, Elvys Linhares Pontes, Mickael Coustaty, Antoine Doucet
TLR at BSNLP2019: A Multilingual Named Entity Recognition System
published pages: 83-88, ISSN: , DOI: 10.18653/v1/w19-3711
Proceedings of the 7th Workshop on Balto-Slavic Natural Language Processing 2020-02-11
2019 Matej Martinc, Senja Pollak
Pooled LSTM for Dutch cross-genre gender classification
published pages: , ISSN: , DOI:
Proceedings of the Shared Task on Cross-Genre Gender Detection in Dutch at Computational Linguistic in Netherlands (CLIN 2019) conference 2020-02-11

Are you the coordinator (or a participant) of this project? Plaese send me more information about the "EMBEDDIA" project.

For instance: the website url (it has not provided by EU-opendata yet), the logo, a more detailed description of the project (in plain text as a rtf file or a word file), some pictures (as picture files, not embedded into any word file), twitter account, linkedin page, etc.

Send me an  email (fabio@fabiodisconzi.com) and I put them in your project's page as son as possible.

Thanks. And then put a link of this page into your project's website.

The information about "EMBEDDIA" are provided by the European Opendata Portal: CORDIS opendata.

More projects from the same programme (H2020-EU.2.1.1.)

MULTIPOINT (2019)

Multibeam Femtosecond Laser System for High Throughput Micro-drilling of HLFC Structures

Read More  

NGI FORWARD (2019)

NGI FORWARD

Read More  

EuConNeCts4 (2019)

European Conferences on Networks and Communications (EuCNC)

Read More