CRAFT corpus, BioNLP-Corpora

warning: Creating default object from empty value in /home/medlingmap/medlingmap.org/Drupal/modules/taxonomy/taxonomy.pages.inc on line 33.

CRAFT: The Colorado Richly Annotated Full Text Corpus

The Colorado Richly Annotated Full Text Corpus (CRAFT) is a large annotated corpus consisting of full texts of biomedical journal articles. It includes both semantic and syntactic annotation layers (listed below) that have been carried out by experienced linguistic and domain-expert annotators. Various formats are available.

A sample of the corpus is available here:
craft-pre-release.tar.gz

BioNlp-Corpora

BioNLP-Corpora is a repository of biologically and linguistically annotated corpora and biological datasets. It is one of the projects of the BioNLP initiative by the Center for Computational Pharmacology at the University of Colorado Denver Health Sciences Center to create and distribute code, software, and data for applying natural language processing techniques to biomedical texts.

An overview of the CRAFT concept annotation guidelines

Bada, M., L. E. Hunter, M. Eckert, and M. Palmer, "An overview of the CRAFT concept annotation guidelines", Proceedings of the Fourth Linguistic Annotation Workshop: Association for Computational Linguistics, pp. 207–211, 2010.
Syndicate content