muri meeting july 2002 gert lanckriet ( [email protected] )

MURI MeetingJuly 2002

Gert Lanckriet ([email protected])L. El Ghaoui, M. Jordan, C. Bhattacharrya, N. Cristianini, P.

BartlettU.C. Berkeley

Convex Optimization in Machine Learning

mailto:[email protected]

mailto:[email protected]

Convex Optimization in Machine Learning

QPLP

QCQP

SDPSOCP

Advanced Convex Optimization in Machine Learning

Linear Programming (LP)

Second Order Cone Programming (SOCP)

Semi-Definite Programming

MPM: Problem Sketch (1)

aT z = b : decision hyperplane


Probability of misclassification…

… for worst-case class-conditional density…

… should be minimized !

MPM: Main Result (5)

MPM: GeometricInterpretation

Robustness to Estimation Errors: Robust MPM (R-MPM)

Robust MPM (R-MPM)

MPM: Convex Optimization to solve the problem

Linear Classifier

Nonlinear Classifier Kernelizing

Convex Optimization:Second OrderCone Program (SOCP)

) competitive with Quadratic Program

(QP) SVMs

LemmLemmaa

MPM: Empirical results=1– and TSA (test-set accuracy) of the MPM, compared to BPB (best performance in Breiman's report (Arcing classifiers, 1996)) and SVMs. (averages for 50 random partitions into 90% training and 10% test sets)

• Comparable with existing literature, SVMs• =1- is indeed smaller than the test-set accuracy in all cases (consistent with as worst-case bound on probability of misclassification)• Kernelizing leads to more powerfull decision boundaries (linear

decision boundary < nonlinear decision boundary (Gaussian kernel))

The idea (1)

Machine learning

Kernel-based machine learning

The idea (2)

training set (labelled)

test set (unlabelled)

The idea (4)

Hard margin SVM classifiers (3)

SDP !


training set (labelled)

test set (unlabelled)

Learning the kernel matrix !


?



Learning Kernel Matrix with SDP !

Empirical results hard margin SVMs

muri meeting july 2002 gert lanckriet ( [email protected] )

Documents

hard margin svm classifiers

main result

berkeleyconvex optimization

machine learningthe

machine learningmpm

problem sketch

learning kernel matrix

decision hyperplanempm