زهره کریمی دسته بندی نیمه نظارتی (2) 1 introduction to semi-supervised...

(2) 1 Introduction to semi-supervised Learning, Xiaojin Zhu and Andrew B. Goldberg, University of Wisconsin, Madison, 2009.

2 Mixture EM Co-Training SVM

Co-Training 3 Location Named entity Classification

Co-Training 4 Named entity Classification Location

Co-Training 5 Named entity Classification Location

Co-Training 6 : .

7 view view Co-Training

8 Web-page classification : hyperlink: hyperlink Classify Speech phonemes Audio video

Multiview learning (1) The squared loss c(x, y, f (x)) = (y f (x))2 0/1 loss c(x, y, f (x)) = 0 if y = f (x), and 1 otherwise c(x, y = healthy, f (x) = diseased) = 1 and c(x, y = diseased, f (x) = healthy) = 100 9

Multiview learning (2) 10

Multiview Learning (3) MULTIVIEW LEARNING k k The semi-supervised regularizer: k 11 Individual Regularized Risk Semi-Supervised regularizer

Multiview learning(4) 12 : emprical risk

(1) 14 kNN NN

15 (2)

Regularization 16 f (1) f loss function (2) f ( regularization framework) special graph-based regularization

Mincut (1) 17 source sink source sink

18 1 3 5 4 2 Mincut (2)

19 Cost Function Regularizer Mincut Regularized Risk problem Mincut (3)

Harmonic Function (1) 20

Harmonic Function (2) 21

Harmonic Function (3) 22 unnormalized graph Laplacian matrix L W is an (l + u) (l + u) weight matrix, whose i, j -th element is the edge weight wij

Harmonic Function (4) 23 unnormalized graph Laplacian matrix

Manifold Regularization (1) 24 Transductive f (x) = y

Manifold Regularization (2) 25 Inductive

Manifold Regularization (3) 26 normalized graph Laplacian matrix L Laplacian

(1) 27

Spectral graph theory 28 (2)

(3) 29 a smaller eigenvalue corresponds to a smoother eigenvector over the graph The graph has k connected components if and only if 1 =... = k = 0. The corresponding eigenvectors are constant on individual connected components, and zero elsewhere.

Graph Spectrum 30

(4) 31 Regularization term ai i Regularization term . f ( i ) .

(5) 32 k-connected component Regularization term

(6) 33

35 margin: geometric margin.

Support Vector Machines 36

Support Vector Machines 37 The signed geometric margin: The distance from the decision boundary to the closest labeled instance decision boundary Maximum margin hyperplane must be unique

Non-Separable Case (1) 38

Non-Separable Case (2) 39 lie inside the margin, but on the correct side of the decision boundary lie on the wrong side of the decision boundary and are misclassified are correctly classified

S3VM (1) 42

S3VM (2) 43 the majority (or even all) of the unlabeled instances are predicted in only one of the classes

S3VM (3) 44 Convex function The S3VM objective function is non-convex The research in S3VMs has focused on how to efficiently find a near-optimum solution

Logistic regression SVM and S3VM are non-probabilistic models probabilistic model conditional log likelihood Gaussian distribution as the prior on w:

Logistic regression L ogistic loss regularizer

Logistic regression

Entropy Regularizer Logistic Regression+Entropy Regulizer For SemiSupervised Learning Intuition if the two classes are well-separated, then the classification on any unlabeled instance should be confident: it either clearly belongs to the positive class, or to the negative class. Equivalently, the posterior probability p(y|x) should be either close to 1, or close to 0. Entropy

Semi-supervised Logistic Regression entropy regularizer for logistic regression

Entropy Regularizer

S3VM Entropy Regularization

زهره کریمی دسته بندی نیمه نظارتی (2) 1 introduction to semi-supervised...

Documents