the university of alabama at birmingham | uab - databases · 2016-08-09 · outline •...
TRANSCRIPT
Metabolomics Databases
Xiuxia Du, Stephen Barnes
Outline• Comprehensive metabolomics databases
• Compound databases
• Spectral databases
• Metabolic pathway databases
• Drug databases
• Disease & physiology databases
• Raw data databases
2
Outline• Comprehensive metabolomics databases
• Compound databases
• Spectral databases
• Metabolic pathway databases
• Drug databases
• Disease & physiology databases
• Raw data databases
3
Comprehensive databases• HMDB
4
HMDB• Overview
5
• Breakdown by mass
HMDB
6
HMDB• One metabocard
7
HMDB• One metabocard
8
HMDB• Searches
9
HMDB• Downloads
10
HMDB• Download all metabolites
11
Outline• Comprehensive metabolomics databases
• Compound databases
• Spectral databases
• Metabolic pathway databases
• Drug databases
• Disease & physiology databases
• Raw data databases
12
Compound databases• PubChem
• ChemSpider
• ChEBI
• KEGG Glycan
• IIMDB
13
Compound databases• PubChem
• ChemSpider
• ChEBI
• KEGG Glycan
• IIMDB
14
PubChem• Website
15
PubChem• Statistics (July 12, 2016)
16
PubChem• Breakdown by mass
17
PubChem• Information on one compound
18
PubChem• Information on one compound
19
PubChem• Search and FTP
20
PubChem• FTP
21
PubChem• Compound XML files
22
PubChem• Download compound XML files
• Total number of XML files: 4,849
• Number of compounds in each XML file: 25,00023
ChemSpider• Website
24
ChemSpider• Information
25
ChemSpider• One compound
26
27
28
ChemSpider• Compounds with the same molecular formula
29
ChEBI• Chemical Entities of Biological Interest
30
ChEBI• About
31
ChEBI• One compound
32
• Substructure search
33
ChEBI• One compound
34
Outline• Comprehensive metabolomics databases
• Compound databases
• Spectral databases
• Metabolic pathway databases
• Drug databases
• Disease & physiology databases
• Raw data databases
35
Spectral databases• NIST 14
• METLIN
• MassBank
• MoNA
• Gold Metabolome Database
• Feign GC-MS database
• HMDB
• BMRB
• Madison Metabolomics Consortium Database
• BML-NMR
• mzCloud
36
Spectral databases• NIST 14
• METLIN
• MassBank
• MoNA
• Gold Metabolome Database
• Feign GC-MS database
• HMDB
• BMRB
• Madison Metabolomics Consortium Database
• BML-NMR
• mzCloud
37
NIST 14• Electron ionization mass spectral library
- 276,259 spectra of 242,477 unique compounds
• MS/MS library: 234,284 spectra
- 51,216 ion trap spectra for 42,126 different ions of 8,171 compounds
- 183,068 collision cell spectra (QTOF and tandem quad) spectra for 14,835 different ions of 7,692 compounds
38
NIST 14 EI library
39
NIST 14 EI library
40
• Focuses on
- Drugs, metabolites, and poisons
- Pesticides and fungicides
- Organics present in soil, water, and air
- Amino acids, di- and tai-peptides
- Common sample contaminants
- Common analytical derivatives of the above
NIST 14 EI library
41
• Breakdown by mass
42
NIST 14 MS/MS library
METLIN
43
METLIN
44
METLIN
45
METLIN
46
METLIN
47
METLIN
48
MassBank
49
• About
MassBank
50
• Database services
MassBank
51
MassBank
52
MassBank
53
MassBank
54
MassBank
55
MoNA
56
Metabolic pathway databases• KEGG
• MetaCyc
• HumanCyc
• BioCyc
• Reactome
• WikiPathways
57
Drug databases• DrugBank
• Therapeutic target databases
• PharmGKB
• STITCH
• SuperTarget
58
Disease & physiology databases• OMIM
• METAGENE
• OMMBID
59
Raw data databases• Metabolomics Workbench
- Funded by the NIH Common Fund Metabolomics Program
- Serve as a national and international repository for metabolomics data and metadata
- Provide access to raw data, metabolite standards, protocols ……
60
Raw data databases• Metabolomics Workbench
• MetaboLights
61
Metabolomics orkbench• Website
62
Raw data databases• Metabolomics Workbench: summary of all studies
63
Raw data databases• Metabolomics Workbench: species
64
Raw data databases• Metabolomics Workbench: sample sources
65
Raw data databases• Metabolomics Workbench: diseases
66
Raw data databases• MetaboLights
67
PubChem, NIST, and HMDB, again
68
3,412,370 2,000
471
59,621
1
PubChem NIST
HMDB
5,0834,647
3,481,721 66,269
10,202
• In terms of unique molecular formula
PubChem, NIST, and HMDB, again
69
PubChem HMDB
91,101,371 26,65315,234
91,116,605 41,887
• In terms of unique InChi Key
Acknowledgement• Aleksandr Smirnov
70
71
Thank you!