chemical ontologies: what are they, what are they for and what are the challenges

1
ORAL PRESENTATION Open Access Chemical ontologies: what are they, what are they for and what are the challenges Janna Hastings * , Nico Adams, Marcus Ennis, Duncan Hull, Christoph Steinbeck From 6th German Conference on Chemoinformatics, GCC 2010 Goslar, Germany. 7-9 November 2010 Ontologies encode human knowledge in computationally accessible forms. They are designed to narrow the gap between the knowledge of human experts and the func- tionality available in computer systems, by expressing expert knowledge in a manner computers can manipulate and reason over. With the ever-growing deluge of data in modern scientific domains, researchers need intelligent tools able to filter out irrelevant and automatically orga- nise relevant information into meaningful categories. The Chemoinformatics and Metabolism team at the EBI is developing chemical ontologies for structure-based chemi- cal classification, role or bioactivity-based chemical classi- fication, and chemical information entities such as descriptors and algorithms. Our ontologies provide collec- tions of names and synonyms which are useful for text mining, stable identifiers which are essential to semantic integration of data, and a semantically rich encoding of many aspects of the chemical domain. But for such ontol- ogies to be maximally useful for diverse users and intero- perable with other ontologies in the scientific domain, similar-sounding things have to be disentangled in our language and our ontology. Recent work addresses the dis- tinguishing of structures from chemicals [1], and of bioac- tivity from drug uses [2]. Ontologies are backed by logical formalisms such as the Web Ontology Language, OWL. One of the challenges of chemical ontology is representing complex chemical structures in the underlying formalism. Cyclic structures prove particularly challenging for logic- based representation. Recent research in our group inves- tigated the inclusion of chemical graphs in OWL [3]. Inte- grating chemoinformatics tools with chemical ontology is the subject of ongoing research. Published: 19 April 2011 References 1. Hastings J, Batchelor C, Steinbeck C, Schulz S: What are Chemical Structures and their Relations? Front. Artif. Intell. Appl. 2010, 209:257-270. 2. Batchelor C, Hastings J, Steinbeck C: Ontological dependence, dispositions and institutional reality in chemistry. Front. Artif. Intell. Appl. 2010, 209:271-284. 3. Hastings J, Dumontier M, Hull D, Horridge M, Stevens R, Sattler U, Steinbeck C, Horne T, Britz K: Representing chemicals using OWL, Description Graphs and Rules. In CEUR Workshop Proceedings, OWLED 2010: OWL: Experiences and Directions, Seventh International Workshop. Volume 614. San Francisco, California, USA;Clark K, Sirin E 2010. doi:10.1186/1758-2946-3-S1-O4 Cite this article as: Hastings et al.: Chemical ontologies: what are they, what are they for and what are the challenges. Journal of Cheminformatics 2011 3(Suppl 1):O4. Open access provides opportunities to our colleagues in other parts of the globe, by allowing anyone to view the content free of charge. Publish with ChemistryCentral and every scientist can read your work free of charge W. Jeffery Hurst, The Hershey Company. available free of charge to the entire scientific community peer reviewed and published immediately upon acceptance cited in PubMed and archived on PubMed Central yours you keep the copyright Submit your manuscript here: http://www.chemistrycentral.com/manuscript/ * Correspondence: [email protected] Chemoinformatics and Metabolism team, European Bioinformatics Institute, Hinxton, UK Hastings et al. Journal of Cheminformatics 2011, 3(Suppl 1):O4 http://www.jcheminf.com/content/3/S1/O4 © 2011 Hastings et al; licensee BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Upload: janna-hastings

Post on 06-Jul-2016

212 views

Category:

Documents


0 download

TRANSCRIPT

ORAL PRESENTATION Open Access

Chemical ontologies: what are they, what arethey for and what are the challengesJanna Hastings*, Nico Adams, Marcus Ennis, Duncan Hull, Christoph Steinbeck

From 6th German Conference on Chemoinformatics, GCC 2010Goslar, Germany. 7-9 November 2010

Ontologies encode human knowledge in computationallyaccessible forms. They are designed to narrow the gapbetween the knowledge of human experts and the func-tionality available in computer systems, by expressingexpert knowledge in a manner computers can manipulateand reason over. With the ever-growing deluge of data inmodern scientific domains, researchers need intelligenttools able to filter out irrelevant and automatically orga-nise relevant information into meaningful categories. TheChemoinformatics and Metabolism team at the EBI isdeveloping chemical ontologies for structure-based chemi-cal classification, role or bioactivity-based chemical classi-fication, and chemical information entities such asdescriptors and algorithms. Our ontologies provide collec-tions of names and synonyms which are useful for textmining, stable identifiers which are essential to semanticintegration of data, and a semantically rich encoding ofmany aspects of the chemical domain. But for such ontol-ogies to be maximally useful for diverse users and intero-perable with other ontologies in the scientific domain,similar-sounding things have to be disentangled in ourlanguage and our ontology. Recent work addresses the dis-tinguishing of structures from chemicals [1], and of bioac-tivity from drug uses [2]. Ontologies are backed by logicalformalisms such as the Web Ontology Language, OWL.One of the challenges of chemical ontology is representingcomplex chemical structures in the underlying formalism.Cyclic structures prove particularly challenging for logic-based representation. Recent research in our group inves-tigated the inclusion of chemical graphs in OWL [3]. Inte-grating chemoinformatics tools with chemical ontology isthe subject of ongoing research.

Published: 19 April 2011

References1. Hastings J, Batchelor C, Steinbeck C, Schulz S: What are Chemical

Structures and their Relations? Front. Artif. Intell. Appl. 2010, 209:257-270.2. Batchelor C, Hastings J, Steinbeck C: Ontological dependence, dispositions

and institutional reality in chemistry. Front. Artif. Intell. Appl. 2010,209:271-284.

3. Hastings J, Dumontier M, Hull D, Horridge M, Stevens R, Sattler U,Steinbeck C, Horne T, Britz K: Representing chemicals using OWL,Description Graphs and Rules. In CEUR Workshop Proceedings, OWLED2010: OWL: Experiences and Directions, Seventh International Workshop.Volume 614. San Francisco, California, USA;Clark K, Sirin E 2010.

doi:10.1186/1758-2946-3-S1-O4Cite this article as: Hastings et al.: Chemical ontologies: what are they,what are they for and what are the challenges. Journal ofCheminformatics 2011 3(Suppl 1):O4.

Open access provides opportunities to our colleagues in other parts of the globe, by allowing

anyone to view the content free of charge.

Publish with ChemistryCentral and everyscientist can read your work free of charge

W. Jeffery Hurst, The Hershey Company.

available free of charge to the entire scientific communitypeer reviewed and published immediately upon acceptancecited in PubMed and archived on PubMed Centralyours you keep the copyright

Submit your manuscript here:http://www.chemistrycentral.com/manuscript/

* Correspondence: [email protected] and Metabolism team, European Bioinformatics Institute,Hinxton, UK

Hastings et al. Journal of Cheminformatics 2011, 3(Suppl 1):O4http://www.jcheminf.com/content/3/S1/O4

© 2011 Hastings et al; licensee BioMed Central Ltd. This is an open access article distributed under the terms of the Creative CommonsAttribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction inany medium, provided the original work is properly cited.