crv 2015
DESCRIPTION
My presentation of CRV 2015 paperTRANSCRIPT
![Page 1: Crv 2015](https://reader036.vdocuments.net/reader036/viewer/2022062518/563dbb8d550346aa9aae2b39/html5/thumbnails/1.jpg)
Zero-Shot Object Recognition Using Semantic Label Vectors
![Page 2: Crv 2015](https://reader036.vdocuments.net/reader036/viewer/2022062518/563dbb8d550346aa9aae2b39/html5/thumbnails/2.jpg)
Traditional Object Recognition
Lion classifier
LionNot Lion
![Page 3: Crv 2015](https://reader036.vdocuments.net/reader036/viewer/2022062518/563dbb8d550346aa9aae2b39/html5/thumbnails/3.jpg)
3
Traditional Object Detection
Learns a weight vector to distinguish between “Lion” and others
![Page 4: Crv 2015](https://reader036.vdocuments.net/reader036/viewer/2022062518/563dbb8d550346aa9aae2b39/html5/thumbnails/4.jpg)
Limitation of Traditional Object Recognition
• Too many objects in the nature• Humans can recognize between 5,000 and
30,000 object categories• Collecting training images for all these object
categories is tedious and expensive
![Page 5: Crv 2015](https://reader036.vdocuments.net/reader036/viewer/2022062518/563dbb8d550346aa9aae2b39/html5/thumbnails/5.jpg)
Transfer Learning
• Transfer the knowledge of known objects to recognize previously unseen objects
• Use shared properties such as physical attributes
• Use contextual information from textual knowledge base
![Page 6: Crv 2015](https://reader036.vdocuments.net/reader036/viewer/2022062518/563dbb8d550346aa9aae2b39/html5/thumbnails/6.jpg)
Motivation: Finding images based on textual description
The Bengal Tiger The Bengal tiger's has zebra like
stripes ranging from dark brown to black
In comparison, a weight range of 150 to 189 kg (331 to 417 lb) is considered fairly average for a male African lion
The biggest and perhaps most fearsome of the world's big cats, the tiger shares 95.6 percent of its DNA with humans' cute and furry companions, domestic cats.
![Page 7: Crv 2015](https://reader036.vdocuments.net/reader036/viewer/2022062518/563dbb8d550346aa9aae2b39/html5/thumbnails/7.jpg)
Can you find the “Tiger” based on the existing knowledge?
The Bengal Tiger The Bengal tiger's coat is yellow
to light orange, with stripes ranging from dark brown to black
In comparison, a weight range of 150 to 189 kg (331 to 417 lb) is considered fairly average for a male African lion
The biggest and perhaps most fearsome of the world's big cats, the tiger shares 95.6 percent of its DNA with humans' cute and furry companions, domestic cats.
![Page 8: Crv 2015](https://reader036.vdocuments.net/reader036/viewer/2022062518/563dbb8d550346aa9aae2b39/html5/thumbnails/8.jpg)
8
Word Vector Representation
![Page 9: Crv 2015](https://reader036.vdocuments.net/reader036/viewer/2022062518/563dbb8d550346aa9aae2b39/html5/thumbnails/9.jpg)
Attributes
![Page 10: Crv 2015](https://reader036.vdocuments.net/reader036/viewer/2022062518/563dbb8d550346aa9aae2b39/html5/thumbnails/10.jpg)
10
Knowledge Transfer Using Wordvector
![Page 11: Crv 2015](https://reader036.vdocuments.net/reader036/viewer/2022062518/563dbb8d550346aa9aae2b39/html5/thumbnails/11.jpg)
11
Transferring knowledge
Unknown classes Known classes
![Page 12: Crv 2015](https://reader036.vdocuments.net/reader036/viewer/2022062518/563dbb8d550346aa9aae2b39/html5/thumbnails/12.jpg)
12
Baselines
WordVector DistanceWordNet Distance
Word vector for “Lion”
Word vector for “Tiger”
Predicted known class
Closest unknown class
![Page 13: Crv 2015](https://reader036.vdocuments.net/reader036/viewer/2022062518/563dbb8d550346aa9aae2b39/html5/thumbnails/13.jpg)
13
Zero-Shot Object Recognition
Unknown classes Known classes