Opendata, web and dolomites

iEXTRACT SIGNED

Information Extraction for Everyone

Total Cost €

0

EC-Contrib. €

0

Partnership

0

Views

0

Project "iEXTRACT" data sheet

The following table provides information about the project.

Coordinator
BAR ILAN UNIVERSITY 

Organization address
address: BAR ILAN UNIVERSITY CAMPUS
city: RAMAT GAN
postcode: 52900
website: www.biu.ac.il

contact info
title: n.a.
name: n.a.
surname: n.a.
function: n.a.
email: n.a.
telephone: n.a.
fax: n.a.

 Coordinator Country Israel [IL]
 Total cost 1˙499˙354 €
 EC max contribution 1˙499˙354 € (100%)
 Programme 1. H2020-EU.1.1. (EXCELLENT SCIENCE - European Research Council (ERC))
 Code Call ERC-2018-STG
 Funding Scheme ERC-STG
 Starting year 2019
 Duration (year-month-day) from 2019-05-01   to  2024-04-30

 Partnership

Take a look of project's partnership.

# participants  country  role  EC contrib. [€] 
1    BAR ILAN UNIVERSITY IL (RAMAT GAN) coordinator 1˙499˙354.00

Map

 Project objective

Staggering amounts of information are stored in natural language documents, rendering them unavailable to data-science techniques. Information Extraction (IE), a subfield of Natural Language Processing (NLP), aims to automate the extraction of structured information from text, yielding datasets that can be queried, analyzed and combined to provide new insights and drive research forward.

Despite tremendous progress in NLP, IE systems remain mostly inaccessible to non-NLP-experts who can greatly benefit from them. This stems from the current methods for creating IE systems: the dominant machine-learning (ML) approach requires technical expertise and large amounts of annotated data, and does not provide the user control over the extraction process. The previously dominant rule-based approach unrealistically requires the user to anticipate and deal with the nuances of natural language.

I aim to remedy this situation by revisiting rule-based IE in light of advances in NLP and ML. The key idea is to cast IE as a collaborative human-computer effort, in which the user provides domain-specific knowledge, and the system is in charge of solving various domain-independent linguistic complexities, ultimately allowing the user to query unstructured texts via easily structured forms.

More specifically, I aim develop: (a) a novel structured representation that abstracts much of the complexity of natural language; (b) algorithms that derive these representations from texts; (c) an accessible rule language to query this representation; (d) AI components that infer the user extraction intents, and based on them promote relevant examples and highlight extraction cases that require special attention.

The ultimate goal of this project is to democratize NLP and bring advanced IE capabilities directly to the hands of domain-experts: doctors, lawyers, researchers and scientists, empowering them to process large volumes of data and advance their profession.

Are you the coordinator (or a participant) of this project? Plaese send me more information about the "IEXTRACT" project.

For instance: the website url (it has not provided by EU-opendata yet), the logo, a more detailed description of the project (in plain text as a rtf file or a word file), some pictures (as picture files, not embedded into any word file), twitter account, linkedin page, etc.

Send me an  email (fabio@fabiodisconzi.com) and I put them in your project's page as son as possible.

Thanks. And then put a link of this page into your project's website.

The information about "IEXTRACT" are provided by the European Opendata Portal: CORDIS opendata.

More projects from the same programme (H2020-EU.1.1.)

TransTempoFold (2019)

A need for speed: mechanisms to coordinate protein synthesis and folding in metazoans

Read More  

MITOvTOXO (2020)

Understanding how mitochondria compete with Toxoplasma for nutrients to defend the host cell

Read More  

Mu-MASS (2019)

Muonium Laser Spectroscopy

Read More