vango project

20
Van Go Your personal art curator Zuzanna Klyszejko

Upload: zuzanna-klyszejko

Post on 12-Apr-2017

34 views

Category:

Data & Analytics


0 download

TRANSCRIPT

Page 1: Vango Project

Van GoYour personal art curator

Zuzanna Klyszejko

Page 2: Vango Project

Problem When you’re visiting a new place your time is limited

There is too much to see

You need to make a choice

Page 3: Vango Project

If a visitor wants to see only landscapes and plants, there should be a way to choose a subset of museum

objects that makes the most of their time

Page 4: Vango Project

”This painting was said by him to have been inspired by

the work of Li Cheng, a tenth-century landscape artist.

Its spare, rather dry brushwork again repeats the

deliberately simple, austere quality that is the feature

of many so-called literati paintings. Hanging scroll.

Landscape. Bare trees in winter, with reference to Li

Cheng (919-67) and a river. Painted in a very dry style.

Inscriptions and seals. Ink on paper.”

Scraped 100 000 curatorial descriptions of paintings and drawings from British Museum’s database

Page 5: Vango Project

”this painting be say by him to have be inspire by the work of li cheng , a tenth - century landscape artist . its spare , rather dry brushwork again repeat the deliberately simple , austere quality that be the feature of many so - call literati painting . hang scroll . landscape . bare tree in winter , with reference to li cheng ( 919 - 67 ) and a river . paint in a very dry style . inscription and seal . ink on paper.”

Tokenizing and Lemmatizing

Scraped 100 000 curatorial descriptions of paintings and drawings from British Museum’s database

Page 6: Vango Project

Tokenizing and Lemmatizing

Count Vectorizer + TFIDF

Scraped 100 000 curatorial descriptions of paintings and drawings from British Museum’s database

Page 7: Vango Project

Tokenizing and Lemmatizing

Count Vectorizer + TFIDF

Latent Semantic Analysis (PCA)

Scraped 100 000 curatorial descriptions of paintings and drawings from British Museum’s database

Page 8: Vango Project

Tokenizing and Lemmatizing

Count Vectorizer + TFIDF

Latent Semantic Analysis (PCA)

Cosine similarity

Scraped 100 000 curatorial descriptions of paintings and drawings from British Museum’s database

Page 9: Vango Project

How does it work in practice? Demo

Page 10: Vango Project

Problem: Validation

Page 11: Vango Project

Problem: Validation

Landscapehut

lake

cliff

mountainamid

bridge

cattledistance

Page 12: Vango Project

Landscape

hut

tree

lake river

cliff

mountainamid

bridge cattle

bank

distance

Page 13: Vango Project

stream

Landscape

boat

fish

wind

windmill

Landscape

hut

lake river

cliff

mountainamid

bridge cattle

distance

Page 14: Vango Project

Landscape

hut

tree

stream

lake river

cliff

mountainamid

bridge cattle

bank

distance

Landscape

boat

fish

wind

windmill

Page 15: Vango Project

hut

tree

stream

lake river

cliff

mountainamid

bridge cattle

bank

distance

boat

fish

wind

windmill

Landscape Landscape

Page 16: Vango Project

treeriver

cliff

mountain

cattle

bank

JACCARD INDEX

Page 17: Vango Project

Dataset 1

Train a model

Dataset 2

Train a model

Dataset 3

Train a model

K-fold cross validation to verify the model using Jaccard index

“landscape”

Page 18: Vango Project

About me

PhD in Cognitive Neuroscience (NYU)

VanGo web app: vango.hopto.org

Zuzanna Klyszejko

Page 19: Vango Project
Page 20: Vango Project