research summary beta
TRANSCRIPT
Projects
Wenbo MuM.S, Bioinformatics
M.S, StatisticsB.S, Compute Sciences
•Analysis of Gene Expression
•Analysis of Genetic Variants
•Analysis of DNA Methylation
•Development of Data Mining Methods
Analysis Of Gene Expression
• Identified uniquely dys-regulated miRNAs in two types of colorectal cancer (MSI/MSS).
• Discovered fenfluramine-induced gene dys-regulation pattern in human pulmonary artery smooth muscle and endothelial cells.
• Explored distinct temporal involvement of microRNAs and pathways in lipopolysaccharide-induced acute lung injury in mice
Analysis Of Gene Expression---Quality Control
Box plot Clustering PCA
S2 S5 U3 U5 U6 C3 C6 T10 T2 T5 T8 U9 U12
24
68
1012
Boxplot of Intensity
C3
C5 C4
C1
C6
C2
C7 U8
U10 U3
U6 T6 S5 U1 T11
S6 T2 T4U11
T12
T1 T8 U5
S4 U2
S2 S1 S3 S7T3 T7
T10 T9 U9
U12 U4 T5
T13
U13
0.00
0.05
0.10
0.15
0.20
0.25
0.30
0.35
Sample Clustering (ward)Height
●●
●
●●
●
●●
●●
●
●
●
●
●
●
●
●
●●
●
●
●
●
●
●
●
●
● ●
●●
●
●
●
●
●
●
●
−60 −40 −20 0 20 40 60
−150
−100
−50
0
Dimension 1
Dim
ensi
on 2
MSSMSIHealthy
Analysis Of Gene Expression---Analysis of mRNA Expression
A
B
0
500
1000
1500
2000
2500
3000
0
500
1000
1500
2000
PASMC PAEC
Total changed All upregulated Upregulated (>2) All downregulated Downregulated (>2)
Num
ber o
f Gen
es
Num
ber o
f Gen
es
>2 fold
1.5-2 fold <1.5 fold
PASMC↑ PAEC↑
PASMC↑ PAEC↓
PASMC↓ PAEC↑
PASMC↓ PAEC↓ PA
EC
PAS
MC
2
PAS
MC
2
PAS
M32
C
0
2
4
6
8
10
12
Fold
Cha
nge
1.5
Cyt
okin
e-cy
toki
ne re
cept
or in
tera
ctio
n P
athw
ays
in c
ance
r Fo
cal a
dhes
ion
MA
KP
sign
alin
g tra
nsdu
ctio
n
Cal
cium
sig
nalin
g pa
thw
ay
Axo
n gu
idan
ce
Reg
ulat
ion
of a
ctin
cyt
oske
leto
n C
ell a
dhes
ion
mol
ecul
es (C
AM
s)
EC
M-r
ecep
tor i
nter
actio
n
TGF-β
sign
alin
g pa
thw
ay
Vasc
ular
sm
ooth
mus
cle
cont
ract
ion
Wnt
sig
nalin
g pa
thw
ay
Neu
roac
tive
ligan
d-re
cept
or in
tera
ctio
n
Cel
l sur
face
rece
ptor
link
ed s
igna
l tra
nsdu
ctio
n
Che
mok
ine
sign
alin
g pa
thw
ay
PASMC enriched GO Terms and Pathways
Yao W, Mu W*, Zeifman A, Lofti M, Remillard CV, Makino A, Garcia JGN, Yuan JX, Zhang W. Fenfluramine-induced gene dysregulation in normal human pulmonary artery smooth muscle and endothelial cells. Pulmonary Circulation. 2011; 1(3): 405-418. PMID: 22140631
GO term and Pathway AnalysisOverview of Differential Genes
Analysis Of Gene Expression---Functional Analysis
Network Analysis
Yao W, Mu W*, Zeifman A, Lofti M, Remillard CV, Makino A, Garcia JGN, Yuan JX, Zhang W. Fenfluramine-induced gene dysregulation in normal human pulmonary artery smooth muscle and endothelial cells. Pulmonary Circulation. 2011; 1(3): 405-418. PMID: 22140631
STK35
BMPR1B
ZFYVE16
TLL2
AMHR2
GDF10
Cytoscape
Analysis Of Gene Expression---Integrative Analysis of mRNA and microRNA
Integrated analysis to identify biologically-relevant miRNA/mRNA
Differential expression in CRC vs. normal
20#MSS#CRCs#
12#MSI#CRCs#
7#normal#mucosa#
miRNA/mRNA expression profiling
GeneChip® Human Gene 1.0 ST
(29k transcripts)
TaqMan® Human MicroRNA ‘A’
(377 microRNAs)
Samples
Identification of miRNAs and mRNAs differentially expressed between CRC and
normal mucosa
Prediction of mRNA targets according to miRNA expression
Identification of MSI-specific, MSS-specific, and pairs common to both
phenotypes (Table'1)
Selection of 10 highest ranked pairs (Figure'1): • miRNA absolute fold change > 2.0
• target mRNA greatest absolute fold change
Identification of inversely correlated mRNA/miRNA pairs within each CRC phenotype (MSI and MSS) 41#
0#
1#
2#
3#
4#
Expression
*Tumor# Normal#
miRNA#*#
*#
42#
41#
0#
1#
2#Expression
**#
*#mRNA#*#
Xicola R, Mu W, Rawson J.B, Huang L, Sapoznik V.R, Doyle B.J, Jover R, Carracedo A, Andreu M, Bessa X, Castells A, Boland C.R, Goel A, Investigators E, Dai Y, Llor X. Identification of miRNAs and their gene targets differentially expressed in microsatellite stable and unstable colorectal cancers through an integrated analysis. Digestive Disease Week. 2011
Analysis Of Genetic Variants
• Preprocess Raw Data and Quality Control
• GWAS
• eQTL Identification
• Meta-analysis
• Admixture Mapping
• Genotype Imputation
Analysis Of Genetic Variants---Quality Control
Analysis Of Genetic Variants---GWAS
Analysis Of Genetic Variants---Imputation
Analysis Of Genetic Variants---Admixture Mapping
−50
510
15
1 2 3 4 5 6 7 8 9 11 12 14 16 18 21
− log10(P)Score
Analysis Of DNA Methylation
• DNA methylation differentiation
• mQTL Identification
Analysis Of DNA Methylation---Quality Control
60#CEU#and#73#YRI#samples#
485,578&CpG#sites#
Illumina#450K#methyla=on#array#
1.#Remove#probes#whose#call#rate#<#0.95#2.#Remove#probes#that#map#ambiguously#to#the#genome#(~140,000)#3.#Remove#probes#that#contain#common#SNPs#[MAF>0.01]#(~55,000)#4.#Remove#probes#on#sex#chromosomes#(~7,000)#5.#Control#for#batch#effect#using#COMBAT#
283,540&highly#informa=ve##CpG#sites#
Extract#CpG#sites#by#gene#symbol#annota=on#Iden=fy#differen=al#methylated#probes#between#YRI#and#CEU#samples#using#Wilcoxon#Rank_sum#test#Evaluate#significance#of#methyla=on#enrichment#for#VIPs#using#fisher#exact#test##
135(24)&differen=ally#methylated#CpG#sites#
402&CpG#sites#within#43#VIPs#
Analysis Of DNA Methylation---mQTL Identification
Analysis Of DNA Methylation---mQTL Identification
Sample Codes---eQTL Identification
Development Of Data Mining Method
Development Of Data Mining MethodmRNA
expression miRNA
expression miRNA target
prediction
Correlation coefficient matrices
TF binding similarity score matrix
miRNA binding score matrix
Score matrix Module
Identification algorithm
Transcription factor binding profile
Module miRNA set
TF set Gene set
Mu W, Roqueiro D, Yang D. A Local Genetic Algorithm for the Identification of Condition-Specific microRNA-Gene Modules. Special Issue on Computational Systems Biology
Development Of Data Mining Method
Histogram of Permutated Modules
p value: 0.0034
Freq
uenc
y
0.55 0.60 0.65 0.70 0.75 0.80 0.85
020
040
060
080
010
00
●
Development Of Data Mining Method
SKILLS