TU Darmstadt / ULB / TUbiblio

Text mining resources for the life sciences

Przybyła, Piotr and Shardlow, Matthew and Aubin, Sophie and Bossy, Robert and Eckart de Castilho, Richard and Piperidis, Stelios and McNaught, John and Ananiadou, Sophia (2016):
Text mining resources for the life sciences.
In: Database, Oxford Academic, pp. 1-30, 2016, DOI: 10.1093/database/baw145, [Online-Edition: http://database.oxfordjournals.org/content/2016/baw145.full....],
[Article]

Abstract

Text mining is a powerful technology for quickly distilling key information from vast quantities of biomedical literature. However, to harness this power the researcher must be well versed in the availability, suitability, adaptability, interoperability and comparative accuracy of current text mining resources. In this survey, we give an overview of the text mining resources that exist in the life sciences to help researchers, especially those employed in biocuration, to engage with text mining in their own work. We categorize the various resources under three sections: Content Discovery looks at where and how to find biomedical publications for text mining; Knowledge Encoding describes the formats used to represent the different levels of information associated with content that enable text mining, including those formats used to carry such information between processes; Tools and Services gives an overview of workflow management systems that can be used to rapidly configure and compare domain- and task-specific processes, via access to a wide range of pre-built tools. We also provide links to relevant repositories in each section to enable the reader to find resources relevant to their own area of interest. Throughout this work we give a special focus to resources that are interoperable—those that have the crucial ability to share information, enabling smooth integration and reusability.

Item Type: Article
Erschienen: 2016
Creators: Przybyła, Piotr and Shardlow, Matthew and Aubin, Sophie and Bossy, Robert and Eckart de Castilho, Richard and Piperidis, Stelios and McNaught, John and Ananiadou, Sophia
Title: Text mining resources for the life sciences
Language: English
Abstract:

Text mining is a powerful technology for quickly distilling key information from vast quantities of biomedical literature. However, to harness this power the researcher must be well versed in the availability, suitability, adaptability, interoperability and comparative accuracy of current text mining resources. In this survey, we give an overview of the text mining resources that exist in the life sciences to help researchers, especially those employed in biocuration, to engage with text mining in their own work. We categorize the various resources under three sections: Content Discovery looks at where and how to find biomedical publications for text mining; Knowledge Encoding describes the formats used to represent the different levels of information associated with content that enable text mining, including those formats used to carry such information between processes; Tools and Services gives an overview of workflow management systems that can be used to rapidly configure and compare domain- and task-specific processes, via access to a wide range of pre-built tools. We also provide links to relevant repositories in each section to enable the reader to find resources relevant to their own area of interest. Throughout this work we give a special focus to resources that are interoperable—those that have the crucial ability to share information, enabling smooth integration and reusability.

Journal or Publication Title: Database
Volume: 2016
Publisher: Oxford Academic
Uncontrolled Keywords: UKP_reviewed;UKP_p_OpenMinTeD;UKP_a_LangTech4eHum
Divisions: 20 Department of Computer Science
20 Department of Computer Science > Ubiquitous Knowledge Processing
DFG-Graduiertenkollegs
DFG-Graduiertenkollegs > Research Training Group 1994 Adaptive Preparation of Information from Heterogeneous Sources
Date Deposited: 31 Dec 2016 14:29
DOI: 10.1093/database/baw145
Official URL: http://database.oxfordjournals.org/content/2016/baw145.full....
Identification Number: TUD-CS-2016-14665
Export:

Optionen (nur für Redakteure)

View Item View Item