combining multiple kernels for efficient image classification · mkl proposed to systematically...

Combining Multiple Kernels for Efficient Image Classification Behjat Siddiquie, Shiv N. Vitaladevuni and Larry S. Davis Method Image Classification: Kernel based methods (SVMs) have proved very successful Hard problems require multiple heterogeneous features Compute a kernel for each feature channel MKL proposed to systematically combine multiple kernels Classification Efficiency Need to compute the kernel distance to every Support Vector for all feature channels Virtually every training sample ends up being a Support Vector Multiple Kernel Learning (EMKL): Learns a weighted linear combination of the kernels Iterative optimization framework for learning the kernel weights as well as the SVM classification parameters simultaneously [Rakotomamonjy et al. ICML07] Composite kernel is a mercer kernel Effective for combining information from different feature channels Experiments UCI datasets: Standard Machine Learning benchmark Gaussian and Polynomial kernels used as the base kernels Two orders of magnitude reduction in complexity Painting dataset: 498 paintings, 6 different classes of painting styles Abstract nature of styles and high variability of paintings make this a challenging problem Features: Texture Captures characteristics of the brushwork MR8 filter bank [Varma & Zisserman ECCV02] HOG (histograms of gradients) Captures local shape Compute features on a densely on a grid as well as sparsely on edges Color Color histograms (RGB) from local patches Saliency Edge Continuity to identify salient curves HOG features extracted from patches centered on salient curves Pyramid Match Kernel: A set of features vectors are extracted from each feature channel Pyramid Match Kernel [Grauman & Darrell ICCV05] Feature space is subdivided into bins in a Pyramidal manner Approximate the correspondence between the feature sets by a weighted intersection kernel computed over these pyramidal bins Kernels corresponding to each feature channel are obtained BKSVM and EMKL learn a combination of these kernels for classification Efficiency: Performance/Accuracy tradeoff: 10-fold increase in efficiency with a 2% drop in performance 100-fold increase in efficiency with a 7% drop in performance Similar results with individual kernels BKSVM EMKL Feature Selection: EMKL: Intuitive weights assigned to the kernels Sparsity constraint BKSVM: Unconstrained Feature Combination: Individual features help in distinguishing certain classes Complement each other when combined Classification Results: Number of feature-sample pairs selected is 1/5 th of the total Boosted Kernel Learning (BKSVM): An efficient alternative to MKL Use AdaBoost for selecting discriminative feature-sample pairs Reduced dimensional feature vector obtained Elements of vector represent kernel distance to the selected training samples in the corresponding feature space Kernel learned from the feature vector Efficiency during test phase Compute kernel distances of the test sample to only the selected feature-sample pairs Efficiency is a function of the number of pairs selected (can be tuned by varying the number of AdaBoost rounds)

Upload: others

Post on 12-Oct-2020

4 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

TRANSCRIPT

Page 1: Combining Multiple Kernels for Efficient Image Classification · MKL proposed to systematically combine multiple kernels Classification Efficiency Need to compute the kernel distance

Combining Multiple Kernels for Efficient Image ClassificationBehjat Siddiquie, Shiv N. Vitaladevuni and Larry S. Davis

Method

� Image Classification:

� Kernel based methods (SVMs) have proved very successful

� Hard problems require multiple heterogeneous features

� Compute a kernel for each feature channel

� MKL proposed to systematically combine multiple kernels

� Classification Efficiency

� Need to compute the kernel distance to every Support Vector for

all feature channels

� Virtually every training sample ends up being a Support Vector

� Multiple Kernel Learning (EMKL):

� Learns a weighted linear combination of the kernels

� Iterative optimization framework for learning the kernel weights

as well as the SVM classification parameters simultaneously

� [Rakotomamonjy et al. ICML07]

� Composite kernel is a mercer kernel

� Effective for combining information from different feature

channels

Experiments

� UCI datasets:

� Standard Machine Learning benchmark

� Gaussian and Polynomial kernels used as the base kernels

� Two orders of magnitude reduction in complexity

� Painting dataset:

� 498 paintings, 6 different classes of painting styles

� Abstract nature of styles and high variability of paintings

make this a challenging problem

� Features:

� Texture

� Captures characteristics of the brushwork

� MR8 filter bank [Varma & Zisserman ECCV02]

� HOG (histograms of gradients)

� Captures local shape

� Compute features on a densely on a grid as well as sparsely on

edges

� Color

� Color histograms (RGB) from local patches

� Saliency

� Edge Continuity to identify salient curves

� HOG features extracted from patches centered on salient curves

� Pyramid Match Kernel:

� A set of features vectors are extracted from each feature channel

� Pyramid Match Kernel [Grauman & Darrell ICCV05]

� Feature space is subdivided into bins in a Pyramidal manner

� Approximate the correspondence between the feature sets by a

weighted intersection kernel computed over these pyramidal bins

� Kernels corresponding to each feature channel are obtained

� BKSVM and EMKL learn a combination of these kernels for

classification

� Efficiency:

� Performance/Accuracy tradeoff:

� 10-fold increase in efficiency with a 2% drop in performance

� 100-fold increase in efficiency with a 7% drop in performance

� Similar results with individual kernels

BKSVM EMKL

� Feature Selection:

� EMKL:

� Intuitive weights assigned to the kernels

� Sparsity constraint

� BKSVM:

� Unconstrained

� Feature Combination:

� Individual features help in

distinguishing certain classes

� Complement each other when

combined

� Classification Results:

� Number of feature-sample

pairs selected is 1/5th of the

total

�Boosted Kernel Learning (BKSVM):

� An efficient alternative to MKL

� Use AdaBoost for selecting discriminative feature-sample pairs

� Reduced dimensional feature vector obtained

� Elements of vector represent kernel distance to the selected

training samples in the corresponding feature space

� Kernel learned from the feature vector

� Efficiency during test phase

� Compute kernel distances of the test sample to only the selected

feature-sample pairs

� Efficiency is a function of the number of pairs selected (can be

tuned by varying the number of AdaBoost rounds)

Text Classification using String Kernels - Machine Learning @ Wash

The Classification Performance of Multiple Methods and

Pyramid Match Kernels: Discriminative Classification with ...publications.csail.mit.edu/tmp/MIT-CSAIL-TR-2005-017.pdf · Pyramid Match Kernels: Discriminative Classification with

Combining Multiple Kernels for Efﬁcient Image Classiﬁcation

How to Reuse a Faceted Classification and Put It on the ... · represented according to multiple classification criteria – The ontology model requires to represent multiple classification

VECTOR QUANTIZATION KERNELS FOR THE CLASSIFICATION OF PROTEIN SEQUENCES … · 2017-11-28 · VECTOR QUANTIZATION KERNELS FOR THE CLASSIFICATION OF PROTEIN SEQUENCES AND STRUCTURES

A Multispectral Sorting Device for Wheat Kernels A multispectral... · diverted the wheat kernels per the classification assignment. ... to respond to the demand for quality ... and

pyLEMMINGS: Large Margin Multiple Instance Classification ... · pyLEMMINGS: Large Margin Multiple Instance Classification and Ranking for Bioinformatics Applications Amina Asif 1,

Efficient kernels for sentence pair classification

Support Vector Machine and String Kernels for Protein Classification

Multiple Classification Analysis (MCA)

Multiple classification of the force and acceleration

Single to multiple kernel learning with four popular svm kernels (survey)

Choosing Data-Mining Methods for Multiple Classification

Mismatch string kernels for discriminative protein classification

Mismatch String Kernels for SVM Protein Classification · Mismatch String Kernels for SVM Protein Classification AthinaSpiliopoulou MorfoulaFragopoulou IoannisKonstas. Outline Definitions

Non-Sparse Regularization with Multiple Kernels

Deep CNN-LSTM with Combined Kernels from Multiple …performed sentiment classification, all on social media datasets using neural networks. In [2] and [23] authors compared multiple

Text Classification using String Kernelspapers.nips.cc/paper/1869-text-classification-using-string-kernels.pdf · Text Classification using String Kernels HUlna Lodhi John Shawe-Taylor

Packet Classification on Multiple Fields

Text classification using Text kernels

Local Features and Kernels for Classification of Texture ... · Local Features and Kernels for Classication of Texture and Object Categories: A Comprehensive Study Jianguo Zhangy

MULTIPLE SIGNAL CLASSIFICATION METHOD IN DIRECTION OF

Multiple Classification Analysis Using SPSS

Kernels for One-Class Nearest Neighbour Classification and Comparison of …s255khan/files/Kernels_for_One... · 2015-07-29 · Kernels for One-Class Nearest Neighbour Classification

60059470 Multiple Classification Analysis Using SPSS

Integrating Tree-Based Kernels and Support Vector Machine ...Integrating Tree-Based Kernels and Support Vector Machine for Remote Sensing Image Classification Azar Zafari Integrating

Fisher kernels for image representation & generative classification models Jakob Verbeek December 11, 2009

Online Learning of Structured Predictors with Multiple Kernels

Text Classification using String Kernels · Text Classification using String Kernels HUlna Lodhi John Shawe-Taylor N ello Cristianini Chris Watkins Department of Computer Science

FusionNet: 3D Object Classification Using Multiple Data

MULTIPLE SIGNAL CLASSIFICATION METHOD IN DIRECTION OF

Active Constrained Fuzzy Clustering: A Multiple Kernels Learning …ce.sharif.edu/~abin/files/pr2014.pdf · 2014. 8. 4. · Active Constrained Fuzzy Clustering: A Multiple Kernels

Multiple choice questions - Tactic Publications · Multiple choice questions . ... fishing rods, Barbie Dolls, teapots, handtools and ... Asset Classification Asset Classification

Discriminative classification methods, kernels, and topic models Jakob Verbeek January 8, 2010