2 nd international conference on biomedical ontology (icbo’11)

47
2 nd International Conference on Biomedical Ontology (ICBO’11) Ontology-Based Analysis of Event-Related Potentials Gwen Frishkoff 12 , Robert Frank 2 , Paea LePendu 3 , & Snežana Nikolič 1 1 Psychology & Neuroscience, Georgia State University 2 NeuroInformatics Center (NCBO), University of Oregon 3 National Center for Biomedical Ontology (NCBO), Stanford University http://nemo.nic.uoregon.edu

Upload: nuru

Post on 23-Feb-2016

49 views

Category:

Documents


0 download

DESCRIPTION

2 nd International Conference on Biomedical Ontology (ICBO’11). Ontology-Based Analysis of Event-Related Potentials Gwen Frishkoff 12 , Robert Frank 2 , Paea LePendu 3 , & Snežana Nikoli č 1 1 Psychology & Neuroscience, Georgia State University - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

2nd International Conference on Biomedical Ontology (ICBO’11)

Ontology-Based Analysis of Event-Related Potentials

Gwen Frishkoff12, Robert Frank2, Paea LePendu3,

& Snežana Nikolič1

1 Psychology & Neuroscience, Georgia State University2 NeuroInformatics Center (NCBO), University of Oregon

3 National Center for Biomedical Ontology (NCBO), Stanford University

http://nemo.nic.uoregon.edu

Page 2: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

Why ontology-based analysis? Linking Data to Knowledge in Human Neuroscience

Ontology-based analysis of ERP data Data Information

• Pipeline for automated (and therefore objective)separation of ERP patterns and extraction of summary metrics for each pattern

Information Knowledge• Ontology to represent metrics in semantically structured

way so as to automatically classify & label ERP patterns within and across experiments

Overview

Page 3: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

Why ontology-based analysis? Linking Data to Knowledge in Human Neuroscience

Ontology-based analysis of ERP data Data Information

• Pipeline for automated (and therefore objective)separation of brainwave (ERP) patterns and automated extraction of summary metrics, which are output to RDF

Information Knowledge• Ontology to represent data (in RDF) and automatically

(and therefore objectively) classify & label ERP patterns within and across experiments

Overview

Page 4: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

“The plural of ‘anecdote’ is not ‘data’.”

— Roger Brinner (Economist)

Assertion #1: In a scientific domain, the priority should be to capture and track

assertions about data.

Corollary: To capture complex (and presently ill-defined) patterns in data, we need bottom-up (data-driven) analysis.

Page 5: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

The plural of ‘data’ is not ‘knowledge’.

Assertion #2: To draw meaningful inferences from data, they must be linked to

a well-structured knowledge base

(ontology).

Page 6: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

ONTOLOGY

DATA

INFORMATION

Data mining (i.e., analysis)

Knowledge engineering

Ontology mining?

Page 7: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

The plural of ‘data’ is not ‘knowledge’.

Assertion #3: Ontology Semantic Structure. It cannot be automatically

extracted from data (or patterns in data). Cf. Searle’s Chinese Room argument…

Corollary: To build a valid ontology, we need top-down (knowledge-driven) methods

(ala BFO/OBO).

Page 8: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

Introduction to ERP Domain (I):The Data = Measurements of Scalp EEG

EEGs (“brainwaves” or flunctuations in brain electrical potentials) are recorded by placing two or more electrodes on the scalp surface.

256-channel Geodesic Sensor Net ~5,000 ms

Page 9: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

Introduction to ERP Domain (II):From EEG to Event-Related Potentials (ERP)

ERPs (event-related potentials) are the result of averaging across multiple segments of EEG, time-locking to an event of interest.

AVERAGE OVER (LOTS OF)

EEG SEGMENTS

EEG

ERP

Page 10: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

Introduction to ERP Domain (III): Entities of Interest = ERP Patterns (in Data!)

ERP patterns characterized by three types of attributes:

(1) TIME latency of peak positive or peak negative potential (left) (2) SPACE scalp topography of this potential (right); and(3) FUNCTION experimental context in which these patterns are characteristically observed (e.g., presentation of visual stimulus)

120 ms

Page 11: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

• Tried and true method for noninvasive brain functional mapping

• Direct measure neuronal activity• Whole-brain measurement (at scalp)• Millisecond temporal resolution• Portable and inexpensive• Important clinical applications (e.g., potential biomarkers for AD, presurgical planning)

• Recent innovations give new windows into rich, multi-dimensional patterns– Rich spatial info (high-density EEG)– Combined temporal & spectral info (JTF)– Multimodal (EEG/ fMRI/MEG) measures

1 sec

What’s great about ERPs …

Page 12: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

If ERPs are so great….

Why are there so few meaningful applications in biomedicine?

And why so few (arguably no) cross-lab meta-analyses?

Page 13: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

Problem #1: Patterns superposed in space & timeLATENT (INFERRED) PATTERNS

(THIS IS WHAT WE WANT TO TALK ABOUT)MEASURED DATA

(THIS IS WHAT WE ACTUALLY MEASURE/OBSERVE!)

Superposition

Page 14: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

Everyone has one, and nobody likes to use anyone else’s.

Problem #2a: Patterns (actually, pattern labels) are like toothbrushes…

Prosody-specific negativityPhonological mapping negativity

Medial frontal

negativity

MEANINGFULNESS

RECOGNITION

POTENTIAL

fN400old-new

effect

N400 Effect

N300

Page 15: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

410 ms

450 ms

330 ms

Consider a Hypothetical Database Query: Show me all the N400 patterns in the database.

Peak latency 410 ms

“CANONICAL N400”

Will the “real” N400 please step forward?

Problem #2b: Conversely, different scientists use the same label for incommensurable patterns.

Page 16: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

Putative “N400”-labeled patterns

Parietal N400

≠≠

fN400

Parietal P600

Assertion #3: We cannot ground ERP meta-analysis in prior

literature (e.g., text mining). We need a reliable workflow for data

analysis & classification.

Page 17: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

Summary: Motivation for NEMO• Lots of different — and equally valid! — methods for

pattern analysis• Inconsistent and subjective use of metrics and labels for

pattern summary and classification• No existing methods or tools to support ERP data

sharing and integration

Assertion #4: The best way to address these issues is to combine data-driven methods for pattern analysis with knowledge-driven methods for ontology development

and application (to interpret analysis results)

Page 18: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

Neural ElectroMagnetic Ontologies

A set of formal (OWL) ontologies for representation of ERP domain concepts

A suite of tools for data-driven extraction and ontology-based annotation of ERP patterns

A database that includes publicly available, annotated data from our NEMO ERP consortium to demonstrate application of ontology for quantitative meta-analysis of results from studies of language and cognition

Page 19: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

Why ontology-based analysis? Linking Data to Knowledge in Human Neuroscience

Ontology-based analysis of ERP data Data Information

• Pipeline for automated (and therefore objective)separation of ERP patterns and extraction of summary metrics for each pattern

Information Knowledge• Ontology to represent data (in RDF) and automatically

(and therefore objectively) classify & label ERP patterns within and across experiments

Overview

Page 20: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

FROM DATA TO INFORMATION….

Extraction of meaningful

patterns (i.e., data analylsis)

Page 21: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

ERP Pattern Analysis: Current Practice

N400 component

P3 component“Bumpology”

Bumpology^2?

Page 22: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

NEMO Ontology-based Analysis: Overview

1. ERP Pattern Extraction

2. ERP Metric Extraction

3. RDF Generation (Data Annotation)

4. (Metadata Entry)

5. ERP Pattern Classification

Page 23: 2 nd  International Conference on Biomedical Ontology (ICBO’11)
Page 24: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

1. NEMO Pattern Extraction

NEMO ERP Pattern Extraction Toolkithttp://nemoontologies.svn.sourceforge.net/viewvc/nemoontologies/toolkit/release

• NEMO_ERP_Pattern_Decomposition/• NEMO_ERP_Pattern_Segmentation/

Page 25: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

Pattern Extraction I: DecompositionAdvantages:

• Data-driven• Automated/ Objective• Sensitive (able to separate

superposed patterns)

P100

N100

fP2

P1r/ N3

P1r/ MFN

100ms

170ms

200ms

280ms

400ms

Disdvantages:• Requires expertise (~vanilla

PCA)• Not used by majority of ERP

researchers

NEMO ERP Pattern Extraction Toolkithttp://nemoontologies.svn.sourceforge.net/viewvc/nemoontologies/toolkit/release

• NEMO_ERP_Pattern_Decomposition/

Page 26: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

Pattern Extraction II: Segmentation

NEMO ERP Pattern Extraction Toolkithttp://nemoontologies.svn.sourceforge.net/viewvc/nemoontologies/toolkit/release

• NEMO_ERP_Pattern_Segmentation/

Page 27: 2 nd  International Conference on Biomedical Ontology (ICBO’11)
Page 28: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

2. Metric Extraction

NEMO ERP Metric Extraction Toolkithttp://nemoontologies.svn.sourceforge.net/viewvc/nemoontologies/toolkit/release

• NEMO_ERP_Metric_Extraction/

Page 29: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

Typical semi-structured representation of ERP data

Peak latency measurement (in ms)

ERP pattern (extracted from “raw” ERP data using PCA/ICA etc.)

Page 30: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

Why ontology-based analysis? Linking Data to Knowledge in Human Neuroscience

Ontology-based analysis of ERP data Data Information

• Pipeline for automated (and therefore objective)separation of brainwave (ERP) patterns and automated extraction of summary metrics, which are output to RDF

Information Knowledge• Ontology to represent metrics in semantically structured

way so as to automatically classify & label ERP patterns within and across experiments

Overview

Page 31: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

ONTOLOGY

FROM INFORMATION TO KNOWLEDGE….

Page 32: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

NEMO Ontology-based Analysis: Overview

1. ERP Pattern Extraction

2. ERP Metric Extraction

3. RDF Generation (Data Annotation)

4. (Metadata Entry)

5. ERP Pattern Classification

Page 33: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

Recall: Entities of interest (at Stage 1) = Patterns in Data

1 sec

TIME SPACE

FUNCTION Modulation of pattern features (time,

space, amplitude) in different experiment conditions

Page 34: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

NEMO Ontology (in a nutshell)

L1: Brain

Physiological processes(BFO/OPB)

L3: Brain

Physiological data

(OBI/IAO)

Page 35: 2 nd  International Conference on Biomedical Ontology (ICBO’11)
Page 36: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

3. RDF Generation

NEMO ERP Metric Extraction Toolkithttp://nemoontologies.svn.sourceforge.net/viewvc/nemoontologies/toolkit/release

• NEMO_ERP_Metric_Extraction/

# OWL Ontology Declaration / Import: GAF-LP1_NN_ERP_data<http://purl.bioontology.org/NEMO/data/GAF-LP1_NN_ERP_data> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://www.w3.org/2002/07/owl#Ontology>.<http://purl.bioontology.org/NEMO/data/GAF-LP1_NN_ERP_data> <http://www.w3.org/2002/07/owl#imports> <http://purl.bioontology.org/NEMO/ontology/NEMO.owl>.

# Instance Declaration 000: GAF-LP1_NN_ERP_data<http://purl.bioontology.org/NEMO/data/GAF-LP1_NN_ERP_data> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://purl.bioontology.org/NEMO/ontology/NEMO.owl#NEMO_0000495>.

Page 37: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

Data annotation using RDF “Triples”

In natural language =

The data represented in cell Z (row A, column 1) is an instance of (“is a”) a peak latency temporal measurement (i.e., the time at which the pattern is of maximal amplitude)

Note that the predicate links an instance to a class within NEMO ontology.

In RDF form: <002> <type> <NEMO_0745000>Subject – Predicate –

Object

Page 38: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

GOAL: Represent extracted information with rich, formal semantics that allow us to reason over data (both within and across datasets)

RDF Graph (“triples”)

Page 39: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

ERP PATTERN CLASSIFICATION

Page 40: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

5. Pattern Classification (I)(1) Temporal Criterion

(3) Functional Criterion

(2) Spatial Criterion

Page 41: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

5. Pattern Classification (II)

RDF Data loads NEMO ontology

RDF Data is opened in Protégé ontology editing software

Page 42: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

5. Pattern Classification (III)

HermiT Reasoner is used to generate inferences

Page 43: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

5. Pattern Classification (IV)

Instance-level information (i.e., ERP pattern instances) are successfully classified!

Page 44: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

Take-home messages1. For some biomedical applications it may important to capture L3

(DATA) as well as L1 (REALITY) explicitly, i.e., within the ontology

2. In linking the data to the ontology (e.g., for classification/labeling of patterns), it may be important consider data-driven methods for pattern analysis and metric extraction

3. An advantage of this approach is that we can generate relatively stable (non-controversial) representation of data (RDF artifacts), which we will archive and maintain — separate from, but linked to, the ontology — even as the ontology is uncertain & changing.

4. Further, robust representation of data across studies provides basis for valid quantitative meta-analysis, which provide high-quality evidence to inform pattern rules in the ontology

Page 45: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

Ongoing Work & Open Issues• Evolving pattern rules to represent more

complex functional criteria (i.e., expt metadata)• Temporal reasoning (can we squeeze this into

DL/OWL?)• Representing uncertainty in pattern rules &

classification of pattern instances (beyond Evidence Codes?)

• Clinical applications: Pilot cross-lab work with aphasics (stroke & TBI patients with language disorders)

Page 46: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

Funding from the National Institutes of Health (NIBIB), R01-MH084812 (Dou, Frishkoff, Malony)

NEMO Ontology Task ForceRobert M. Frank (NIC)Dejing Dou (CIS)Paea LePendu (CIS)Haishan Liu (CIS)Allen Malony (NIC, CIS)Jason Sydes (CIS)*Snezana Nikolic (PSY, GSU)

*emeritus

Acknowledgments

www.nemo.nic.uoregon.edu

NEMO EEG/MEG Data ConsortiumTim Curran (U. Colorado)Dennis Molfese (U. Louisville)John Connolly (McMaster U.)Kerry Kilborn (Glasgow U.)Charles Perfetti (U. Pittsburgh)

Special thanks to:Maryann Martone & associates (NIF)Jessica Turner (cogPO)Angela Laird (BrainMap)Sivaram Arabandi (OGMS)

YOU (BIO-ONTOLOGY

COMMUNITY)

Page 47: 2 nd  International Conference on Biomedical Ontology (ICBO’11)

Recent References• Frishkoff, G., Frank, R., LePendu, P., & Nikolic, S. (2011, in press). Ontology-

based Analysis of Event-Related Potentials. Proceedings of the International Conference on Biomedical Ontology (ICBO'11).

• Frishkoff, G., Frank, R., Sydes, J., Mueller, K., & Malony, A. (2011, subm). Minimal Information for Neural Electromagnetic Ontologies (MI-NEMO): A standards-compliant workflow for analysis and integration of human EEG. Standards in Genomic Sciences (SIGS).

• Liu, H., Frishkoff, G., Frank, R. M. F., & Dou, D. (2011, subm). Integration of Human Brain Data: Metric and Pattern Matching across Heterogeneous ERP Datasets. Journal of Neurocomputing.

• Frank, D. & Frishkoff, G. A. (2011, in prep.). The NEMO ERP Analysis Toolkit: Combining data-driven and knowledge-driven methods for ERP pattern analysis. Neuroinformatics.

• Frishkoff, G.A., Dou, D., Frank, R., LePendu, P., and Liu, H. (2009). Development of Neural Electromagnetic Ontologies (NEMO): Representation and integration of event-related brain potentials. Proceedings of the International Conference on Biomedical Ontologies (ICBO09). July 24-26, 2009. Buffalo, NY.