spacy lightning talk for kyivpy #21

9

Click here to load reader

Upload: anton-kasyanov

Post on 24-Jan-2018

31 views

Category:

Software


0 download

TRANSCRIPT

Page 1: spaCy lightning talk for KyivPy #21

spaCy to the rescueor why NLTK is not cool anymore

Anton Kasyanov | DataRobot

Page 2: spaCy lightning talk for KyivPy #21

Anton Kasyanov | DataRobot

What is spaCy

• Natural language processing library

• Industrial strength - based on latest research

• Fast - written using Cython

Page 3: spaCy lightning talk for KyivPy #21

Anton Kasyanov | DataRobot

Usage

import spacy nlp = spacy.load(‘en') doc = nlp( ‘Hello, world.’ ‘Here are two sentences.’ )

Page 4: spaCy lightning talk for KyivPy #21

Anton Kasyanov | DataRobot

Tokeniser

token = doc[0] sentence = next(doc.sents) assert token is sentence[0] assert sentence.text == \ ‘Hello, world.'

Page 5: spaCy lightning talk for KyivPy #21

Anton Kasyanov | DataRobot

Word Vectors

doc = nlp(“Apples and oranges are similar.\ Boots and hippos aren’t.")

apples = doc[0] oranges = doc[2] boots = doc[6] hippos = doc[8]

assert apples.similarity(oranges) > \ boots.similarity(hippos)

Page 6: spaCy lightning talk for KyivPy #21

Anton Kasyanov | DataRobot

Syntactic Parser

Page 7: spaCy lightning talk for KyivPy #21

Anton Kasyanov | DataRobot

Speed

Page 8: spaCy lightning talk for KyivPy #21

Anton Kasyanov | DataRobot

Other features• Part-of-Speech tagger

• Named entities recognition

• Integer IDs for words

• Multi-threading support

• Deep learning

• German, English, French (so far)

Page 9: spaCy lightning talk for KyivPy #21

Anton Kasyanov | DataRobot

Thanks!https://spacy.io

antonkasyanov.com