CALBC

Collaborative Annotation of a Large Biomedical Corpus

 Coordinatore EUROPEAN MOLECULAR BIOLOGY LABORATORY 

 Organization address address: Wellcome Trust Genome Campus -
city: Hinxton, Cambridge
postcode: CB10 1SD

contact info
Titolo: Ms.
Nome: Barbara
Cognome: Baron
Email: send email
Telefono: 441223000000
Fax: 441223000000

 Nazionalità Coordinatore Germany [DE]
 Totale costo 2˙197˙840 €
 EC contributo 1˙499˙687 €
 Programma FP7-ICT
Specific Programme "Cooperation": Information and communication technologies
 Code Call FP7-ICT-2007-3
 Funding Scheme CSA
 Anno di inizio 2009
 Periodo (anno-mese-giorno) 2009-01-01   -   2011-06-30

 Partecipanti

# participant  country  role  EC contrib. [€] 
1    EUROPEAN MOLECULAR BIOLOGY LABORATORY

 Organization address address: Wellcome Trust Genome Campus -
city: Hinxton, Cambridge
postcode: CB10 1SD

contact info
Titolo: Ms.
Nome: Barbara
Cognome: Baron
Email: send email
Telefono: 441223000000
Fax: 441223000000

DE (Hinxton, Cambridge) coordinator 0.00
2    ERASMUS UNIVERSITAIR MEDISCH CENTRUM ROTTERDAM

 Organization address address: 's Gravendijkwal
city: ROTTERDAM
postcode: 3015CE

contact info
Titolo: Dr.
Nome: Erik
Cognome: van Mulligen
Email: send email
Telefono: -7043026
Fax: -7044701

NL (ROTTERDAM) participant 0.00
3    FRIEDRICH-SCHILLER-UNIVERSITAET JENA

 Organization address address: FUERSTENGRABEN
city: JENA
postcode: 7743

contact info
Titolo: Prof.
Nome: Udo
Cognome: Hahn
Email: send email
Telefono: +49 3641 944 320
Fax: +49 3641 931052

DE (JENA) participant 0.00
4    LINGUAMATICS LIMITED

 Organization address address: St Johns Innovation Centre, Cowley Road
city: Cambridge
postcode: CB4 0WS

contact info
Titolo: Dr.
Nome: Roger
Cognome: Hale
Email: send email
Telefono: 441223000000
Fax: +44 1223 421361

UK (Cambridge) participant 0.00

Mappa


 Word cloud

Esplora la "nuvola delle parole (Word Cloud) per avere un'idea di massima del progetto.

corpus    biomedical    strategies    mining    format    corpora    entities    named    recognition    scopes    evaluation    small    semantic    annotations    annotated    annotation    entity    community    variety   

 Obiettivo del progetto (Objective)

This proposal defines a support action project that brings together the researchers from international biomedical text-mining groups to address the difficult issue of annotating large text corpora with a large set of semantic types. We propose a collaborative approach to this annotation task in the form of an open challenge to the biomedical text-mining community. The task is the annotation of named entities in a large biomedical corpus, for a variety of semantic categories. The project delivers as outcome a large, collaboratively annotated corpus, marked with the mentions of biomedical entities. The annotated corpus becomes a resource for the community, to be used as a reference for improving text-mining applications. The biomedical text-mining research community has a long tradition of organizing such challenges, as a way of evaluating techniques, sharing technical knowledge, and helping to improve the results from text-mining programs. However, such challenges have typically addressed relatively small corpora in a narrow sub-domain, in part because the evaluation of the results is extremely long and costly. As a result, the generated annotated corpora are too small and are only narrowly annotated to be useful in a variety of text-mining applications. In contrast, we propose to create a broadly-scoped and large annotated corpus by integrating the annotations from different named entity recognition systems. Metadata will also be added to the corpus. The participating systems have different application scopes and annotation strategies, and therefore complement each other. As a consequence, the annotated corpus reflects these different scopes and strategies. A secondary goal of this project is to define a standardized format for representing the annotations contributed by the participants and comparing them effectively. Currently the lack of such a format hinders progress in the evaluation of named entity recognition systems.

Altri progetti dello stesso programma (FP7-ICT)

CONFINE (2011)

Community Networks Testbed for the Future Internet

Read More  

QuILMI (2012)

Quantum Integrated Light Matter Interface

Read More  

PoSecCo (2010)

Policy and Security Configuration Management

Read More