integration ii prediction. kernel-based data integration svms and the kernel “trick”...

23
Integration II Prediction

Upload: barry-mcbride

Post on 12-Jan-2016

225 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Integration II Prediction. Kernel-based data integration SVMs and the kernel “trick” Multiple-kernel learning Applications – Protein function prediction

Integration II

Prediction

Page 2: Integration II Prediction. Kernel-based data integration SVMs and the kernel “trick” Multiple-kernel learning Applications – Protein function prediction

Kernel-based data integration

• SVMs and the kernel “trick”• Multiple-kernel learning• Applications– Protein function prediction– Clinical prognosis

Page 3: Integration II Prediction. Kernel-based data integration SVMs and the kernel “trick” Multiple-kernel learning Applications – Protein function prediction

SVMs

These are expression measurements from two genes for two populations(cancer types)

The goal is to define a cancer type classifier...

[Noble, Nat. Biotechnology, 2006]

Page 4: Integration II Prediction. Kernel-based data integration SVMs and the kernel “trick” Multiple-kernel learning Applications – Protein function prediction

SVMs

These are expression measurements from two genes for two populations(cancer types)

The goal is to define a cancer type classifier...

One type of classifier is a “hyper-plane”that separates measurements fromtwo cancer types

[Noble, Nat. Biotechnology, 2006]

Page 5: Integration II Prediction. Kernel-based data integration SVMs and the kernel “trick” Multiple-kernel learning Applications – Protein function prediction

SVMs

These are expression measurements from two genes for two populations(cancer types)

The goal is to define a cancer type classifier...

One type of classifier is a “hyper-plane”that separates measurements fromtwo cancer types

E.g.: a one-dimensional hyper-plane

[Noble, Nat. Biotechnology, 2006]

Page 6: Integration II Prediction. Kernel-based data integration SVMs and the kernel “trick” Multiple-kernel learning Applications – Protein function prediction

SVMs

These are expression measurements from two genes for two populations(cancer types)

The goal is to define a cancer type classifier...

One type of classifier is a “hyper-plane”that separates measurements fromtwo cancer types

E.g.: a two-dimensional hyper-plane

[Noble, Nat. Biotechnology, 2006]

Page 7: Integration II Prediction. Kernel-based data integration SVMs and the kernel “trick” Multiple-kernel learning Applications – Protein function prediction

SVMs

Suppose that measurements are separable:there exists a hyperplane thatseparates two types

Then there are an infinite number ofseparating hyperplanes

Which to use?

[Noble, Nat. Biotechnology, 2006]

Page 8: Integration II Prediction. Kernel-based data integration SVMs and the kernel “trick” Multiple-kernel learning Applications – Protein function prediction

SVMs

Suppose that measurements are separable:there exists a hyperplane thatseparates two types

Then there are an infinite number ofseparating hyperplanes

Which to use?

The maximum-margin hyperplane

Equivalently: minimizer of

[Noble, Nat. Biotechnology, 2006]

Page 9: Integration II Prediction. Kernel-based data integration SVMs and the kernel “trick” Multiple-kernel learning Applications – Protein function prediction

SVMs

Which hyper-plane to use?

In reality: minimizer of trade-off between1. classification error, and2. margin size

loss penalty

Page 10: Integration II Prediction. Kernel-based data integration SVMs and the kernel “trick” Multiple-kernel learning Applications – Protein function prediction

SVMs

This is the primal problem

This is the dual problem

Page 11: Integration II Prediction. Kernel-based data integration SVMs and the kernel “trick” Multiple-kernel learning Applications – Protein function prediction

SVMs

What is K?

The kernel matrix:each entry is sample inner productone interpretation: sample similaritymeasurements completely described by K

Page 12: Integration II Prediction. Kernel-based data integration SVMs and the kernel “trick” Multiple-kernel learning Applications – Protein function prediction

SVMs

Implication:Non-linearity is obtained byappropriately defining kernelmatrix K

E.g. quadratic kernel:

Page 13: Integration II Prediction. Kernel-based data integration SVMs and the kernel “trick” Multiple-kernel learning Applications – Protein function prediction

SVMs

Another implication:No need for measurement vectorsall that is required is similarity between samples

E.g. string kernels

Page 14: Integration II Prediction. Kernel-based data integration SVMs and the kernel “trick” Multiple-kernel learning Applications – Protein function prediction

Protein Structure PredictionProtein structure

Protein sequence

Sequence similarity

Page 15: Integration II Prediction. Kernel-based data integration SVMs and the kernel “trick” Multiple-kernel learning Applications – Protein function prediction

Protein Structure Prediction

Page 16: Integration II Prediction. Kernel-based data integration SVMs and the kernel “trick” Multiple-kernel learning Applications – Protein function prediction

Kernel-based data fusion

Core idea: use different kernels for different genomic data sources a linear combination of kernel matrices is a kernel (under certain conditions)

Page 17: Integration II Prediction. Kernel-based data integration SVMs and the kernel “trick” Multiple-kernel learning Applications – Protein function prediction

Kernel-based data fusion

Kernel to use in prediction:

Page 18: Integration II Prediction. Kernel-based data integration SVMs and the kernel “trick” Multiple-kernel learning Applications – Protein function prediction

Kernel-based data fusion

In general, the task is to estimateSVM function along withcoefficients of the kernelmatrix combination

This is a type of well-studiedoptimization problem(semi-definite program)

Page 19: Integration II Prediction. Kernel-based data integration SVMs and the kernel “trick” Multiple-kernel learning Applications – Protein function prediction

Kernel-based data fusion

Page 20: Integration II Prediction. Kernel-based data integration SVMs and the kernel “trick” Multiple-kernel learning Applications – Protein function prediction

Kernel-based data fusion

Page 21: Integration II Prediction. Kernel-based data integration SVMs and the kernel “trick” Multiple-kernel learning Applications – Protein function prediction

Kernel-based data fusion

Same idea applied to cancer classification from expression and proteomic data

Page 22: Integration II Prediction. Kernel-based data integration SVMs and the kernel “trick” Multiple-kernel learning Applications – Protein function prediction

Kernel-based data fusion

• Prostate cancer dataset– 55 samples– Expression from microarray– Copy number variants

• Outcomes predicted:– Grade, stage, metastasis, recurrence

Page 23: Integration II Prediction. Kernel-based data integration SVMs and the kernel “trick” Multiple-kernel learning Applications – Protein function prediction

Kernel-based data fusion