#	Pagina
attuale pagina	/open-fp7/projects/89254/index.html
-1	/open-fp7/projects/99116/index.html
-2	/open-fp7/projects/87811/index.html
-3	/open-fp7/projects/90750/index.html
-4	/open-fp7/projects/99108/index.html
-5	/open-fp7/projects/89651/index.html
-6	/open-fp7/projects/99145/index.html

CALBC

Collaborative Annotation of a Large Biomedical Corpus

Coordinatore	EUROPEAN MOLECULAR BIOLOGY LABORATORY Organization address address: Wellcome Trust Genome Campus - city: Hinxton, Cambridge postcode: CB10 1SD contact info Titolo: Ms. Nome: Barbara Cognome: Baron Email: send email Telefono: 441223000000 Fax: 441223000000
Nazionalità Coordinatore	Germany [DE]
Totale costo	2˙197˙840 €
EC contributo	1˙499˙687 €
Programma	FP7-ICT Specific Programme "Cooperation": Information and communication technologies
Code Call	FP7-ICT-2007-3
Funding Scheme	CSA
Anno di inizio	2009
Periodo (anno-mese-giorno)	2009-01-01 - 2011-06-30

Partecipanti

#	participant	country	role
1	EUROPEAN MOLECULAR BIOLOGY LABORATORY Organization address address: Wellcome Trust Genome Campus - city: Hinxton, Cambridge postcode: CB10 1SD contact info Titolo: Ms. Nome: Barbara Cognome: Baron Email: send email Telefono: 441223000000 Fax: 441223000000	DE (Hinxton, Cambridge)	coordinator
2	ERASMUS UNIVERSITAIR MEDISCH CENTRUM ROTTERDAM Organization address address: 's Gravendijkwal city: ROTTERDAM postcode: 3015CE contact info Titolo: Dr. Nome: Erik Cognome: van Mulligen Email: send email Telefono: -7043026 Fax: -7044701	NL (ROTTERDAM)	participant
3	FRIEDRICH-SCHILLER-UNIVERSITAET JENA Organization address address: FUERSTENGRABEN city: JENA postcode: 7743 contact info Titolo: Prof. Nome: Udo Cognome: Hahn Email: send email Telefono: +49 3641 944 320 Fax: +49 3641 931052	DE (JENA)	participant
4	LINGUAMATICS LIMITED Organization address address: St Johns Innovation Centre, Cowley Road city: Cambridge postcode: CB4 0WS contact info Titolo: Dr. Nome: Roger Cognome: Hale Email: send email Telefono: 441223000000 Fax: +44 1223 421361	UK (Cambridge)	participant

Mappa

Word cloud

Esplora la "nuvola delle parole (Word Cloud) per avere un'idea di massima del progetto.

annotated semantic scopes named community evaluation mining entities small annotations recognition annotation strategies corpora entity format variety corpus biomedical

Obiettivo del progetto (Objective)

This proposal defines a support action project that brings together the researchers from international biomedical text-mining groups to address the difficult issue of annotating large text corpora with a large set of semantic types. We propose a collaborative approach to this annotation task in the form of an open challenge to the biomedical text-mining community. The task is the annotation of named entities in a large biomedical corpus, for a variety of semantic categories. The project delivers as outcome a large, collaboratively annotated corpus, marked with the mentions of biomedical entities. The annotated corpus becomes a resource for the community, to be used as a reference for improving text-mining applications. The biomedical text-mining research community has a long tradition of organizing such challenges, as a way of evaluating techniques, sharing technical knowledge, and helping to improve the results from text-mining programs. However, such challenges have typically addressed relatively small corpora in a narrow sub-domain, in part because the evaluation of the results is extremely long and costly. As a result, the generated annotated corpora are too small and are only narrowly annotated to be useful in a variety of text-mining applications. In contrast, we propose to create a broadly-scoped and large annotated corpus by integrating the annotations from different named entity recognition systems. Metadata will also be added to the corpus. The participating systems have different application scopes and annotation strategies, and therefore complement each other. As a consequence, the annotated corpus reflects these different scopes and strategies. A secondary goal of this project is to define a standardized format for representing the annotations contributed by the participants and comparing them effectively. Currently the lack of such a format hinders progress in the evaluation of named entity recognition systems.