【nn輪読会】youtube-8m: a large-scale video classification benchmark

17
2017/7/12 YouTube-8M: A Large-Scale Video Classification Benchmark [ Google Research 2016/9/27 arXiv: 1609.08675v1] TFUG NN #3 1

Upload: tomomi-moriyama

Post on 21-Jan-2018

698 views

Category:

Data & Analytics


1 download

TRANSCRIPT

Page 1: 【NN輪読会】YouTube-8M: A Large-Scale Video Classification Benchmark

2017/7/12

YouTube-8M: A Large-Scale Video Classification Benchmark

[ Google Research 2016/9/27 arXiv: 1609.08675v1] TFUG NN #3

1

Page 2: 【NN輪読会】YouTube-8M: A Large-Scale Video Classification Benchmark

✤ Kaggle 2

Page 3: 【NN輪読会】YouTube-8M: A Large-Scale Video Classification Benchmark

YouTube-8M

ImageNet…

3

Page 4: 【NN輪読会】YouTube-8M: A Large-Scale Video Classification Benchmark

YouTube-8M

4

2TB

1GPU 1

1) 1 1

2) Inception

3) PCA

4) TensorFlow

→ URL

Page 5: 【NN輪読会】YouTube-8M: A Large-Scale Video Classification Benchmark

✤ YouTube

5

Page 6: 【NN輪読会】YouTube-8M: A Large-Scale Video Classification Benchmark

✤ Knowledge Graph entity

6

Page 7: 【NN輪読会】YouTube-8M: A Large-Scale Video Classification Benchmark

✤ 3 1-2.57

Page 8: 【NN輪読会】YouTube-8M: A Large-Scale Video Classification Benchmark

✤ …

✤ 78.8% 14.5%

✤ → 80%

✤ →

8

Page 9: 【NN輪読会】YouTube-8M: A Large-Scale Video Classification Benchmark

DBoF

✤ k N

✤ ReLuM

✤ → (MxN)

✤ Max pooling

9

Page 10: 【NN輪読会】YouTube-8M: A Large-Scale Video Classification Benchmark

✤ φ

✤ PCA

✤ L210

Page 11: 【NN輪読会】YouTube-8M: A Large-Scale Video Classification Benchmark

✤ mAP:

✤ Hit@k: k

1

✤ PERR(Precision at equal recall rate):

✤ GAP: Kaggle

-11

Page 12: 【NN輪読会】YouTube-8M: A Large-Scale Video Classification Benchmark

✤ 1

✤ 2

DBoF,LSTM

12

Page 13: 【NN輪読会】YouTube-8M: A Large-Scale Video Classification Benchmark

✤ PERR

15%

13

Page 14: 【NN輪読会】YouTube-8M: A Large-Scale Video Classification Benchmark

✤ ActivityNet

✤ Sports-1M

14

Page 15: 【NN輪読会】YouTube-8M: A Large-Scale Video Classification Benchmark

Kaggle

✤ 6

✤ Google Cloud …15

Page 16: 【NN輪読会】YouTube-8M: A Large-Scale Video Classification Benchmark

Kaggle 1

✤ https://github.com/antoine77340/LOUPE

✤ Learnable pooling with Context Gating for video classification

✤ [Antoine Miech arXiv:1706.06905v1 2017/6/21]

✤ 25

✤ 7 GAP 84.698%Gated NetVLAD (256 clusters), Gated NetFV (128 clusters), Gated Soft-DBoW (4096 clusters), Soft-DBoW (8000 Clusters), Gated NetRVLAD (256 Clusters), GRU (2 layers, hidden size: 1200) LSTM (2 layers, hidden size: 1024)

16

Page 17: 【NN輪読会】YouTube-8M: A Large-Scale Video Classification Benchmark

✤ Q.p8

✤ A.

…)

80% 80%

p8

17