computational linguistics - university of...

53
Introduction Computational Linguistics Shuly Wintner, [email protected] Department of Computer Science University of Haifa Haifa, Israel Winter 2008 Shuly Wintner (University of Haifa) Computational Linguistics c Copyrighted material 1 / 689

Upload: vuliem

Post on 06-Mar-2018

238 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Computational Linguistics

Shuly Wintner, [email protected]

Department of Computer Science

University of Haifa

Haifa, Israel

Winter 2008

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 1 / 689

Page 2: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

What is this course about?

Natural language processing:

A subfield of computer science, and in particularartificial intelligenceConcerned with computational processing of naturallanguagesEmulates cognitive capabilities without being committedto a true simulation of cognitive processesProvides such novel products as computers that canunderstand everyday human speech, translate betweendifferent human languages, and otherwise interactlinguistically with people in ways that suit people ratherthan computers.

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 2 / 689

Page 3: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Natural language processing applications

Machine translation

Natural language interfaces to computer systems

Speech recognition

Text to speech generation

Automatic summarization

E-mail filtering

Intelligent search engines

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 3 / 689

Page 4: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

What is this course about?

Computational linguistics:

An approach to linguistics that employs methods andtechniques of computer scienceA formal, rigorous, computationally based investigationof questions that are traditionally addressed bylinguisticsWhat do people know when they know a naturallanguage?What do they do when they use this knowledge?How do they acquire this knowledge in the first place?

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 4 / 689

Page 5: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Example of an application: machine translation

The spirit is willing but the flesh is weak

The vodka is excellent but the meat is lousy

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 5 / 689

Page 6: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Example of an application: machine translation

From http://babelfish.altavista.com/, using technology developed bySYSTRAN; and from Google translation tools.

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 6 / 689

Page 7: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Example of an application: machine translation

Language is one of the fundamental aspects of human behavior

and is a crucial component of our lives. In written form it serves

as a long-term record of knowledge from one generation to the

next. In spoken form it serves as our primary means of

coordinating our day-to-day behavior with others. This book

describes research about how language comprehension and

production work.

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 7 / 689

Page 8: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Example of an application: machine translation

Il linguaggio e una delle funzioni fondamentali di comportamento

umano ed e un componente cruciale delle nostre vite. Nella

forma scritta serve da record di lunga durata di conoscenza da

una generazione al seguente. Nella forma parlata serve da nostri

mezzi primari di coordinazione del nostro comportamento

giornaliero con altri. Questo libro descrive la ricerca circa come

la comprensione di una lingua e la produzione funzionano.

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 8 / 689

Page 9: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Example of an application: machine translation

Altavista:

The language is one of the fundamental functions of human

behavior and is a crucial member of our screw. In the written

shape servants from record of long duration of acquaintance from

one generation to following. In the shape speech she serves from

our primary means of coordination of our every day behavior with

others. This book describes the search approximately as the

understanding of a language and the production work.

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 9 / 689

Page 10: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Example of an application: machine translation

Google:

Language is a fundamental aspect of human existence behavior

and is a key component of our lives. In writing that serves as a

Long term record knowledge from one generation to another

neighbor. In spoken form it serves as our principal means of

coordinate our day to day behavior with others. This book

describes Search as language comprehension and production

work.

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 10 / 689

Page 11: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Comparison

Language is one of the fundamental aspects of human behavior

and is a crucial component of our lives

The language is one of the fundamental functions of human

behavior and is a crucial member of our screw

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 11 / 689

Page 12: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Comparison

In written form it serves as a long-term record of knowledge from

one generation to the next

In the written shape servants from record of long duration of

acquaintance from one generation to following

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 12 / 689

Page 13: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Comparison

This book describes research about how language comprehension

and production work

This book describes the search approximately as the

understanding of a language and the production work

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 13 / 689

Page 14: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Question answering (From http://www.ask.com/)

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 14 / 689

Page 15: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Question answering (From http://www.ask.com/)

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 15 / 689

Page 16: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Question answering (From http://www.ask.com/)

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 16 / 689

Page 17: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Why are the results so poor?

Language understanding is complicated

The necessary knowledge is enormous

Most stages of the process involve ambiguity

Many of the algorithms are computationally intractable

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 17 / 689

Page 18: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

What kind of knowledge is required?

Phonetic and phonological knowledge

Morphological knowledge

Syntactic knowledge

Semantic knowledge

Pragmatic knowledge

Discourse knowledge

World knowledge

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 18 / 689

Page 19: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

What kind of knowledge is required?

Phonetic and phonological knowledge

Morphological knowledge

Syntactic knowledge

Semantic knowledge

Pragmatic knowledge

Discourse knowledge

World knowledge

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 19 / 689

Page 20: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Phonetics and phonology

Phonetics studies the sounds produced by the vocal tract and used inlanguage, including the physical properties of speech sounds,their perception and their production

Phonology studies the module of the linguistic capability that relates tosound, abstracting away from their physical properties.Defines an inventory of basic units (phonemes), constraintson their combination and rules of pronunciation

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 20 / 689

Page 21: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Problems in phonological processing

Homophones (homonyms): words that are pronounced alike but aredifferent in meaning or derivation or spelling:weak — weekto — too — twoĎŇŮĎ — ĎŇŮ+Ď — ĎŇŃ+Ď

Free variation: alternation of sounds with no change in meaning: thedifferent pronunciations of the guttural sounds in Hebrew

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 21 / 689

Page 22: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Problems in phonological processing

Allophones: variants of phonemes that are in complementary distribution:little

Phonotactic constraints: restrictions on the distribution(occurrence) of phonemes with respect to one another:ŊŇ{ŽĎ — ŊŇĹŰĎ

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 22 / 689

Page 23: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

What kind of knowledge is required?

Phonetic and phonological knowledge

Morphological knowledge

Syntactic knowledge

Semantic knowledge

Pragmatic knowledge

Discourse knowledge

World knowledge

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 23 / 689

Page 24: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Morphology

Morphology studies the structure of words.

Morpheme: a minimal sound-meaning unit. Can either be bound (not aword) or free (word).Free morphemes: book, ŸŤŚBound morphemes: books, ŊĽŸŤŚ

Affix: a morpheme which is added to other morphemes, especiallyroots or stems.

suffixes follow the root/stemprefixes precedes the root/steminfixes are inserted into the root/stem

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 24 / 689

Page 25: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Morphology

Derivational morphology: words are constructed from roots (or stems) andderivational affixes:inter+national → internationalinternational+ize → internationalizeinternationalize+ation → internationalizationŊŇŹ → ŽĚŐŇŹ

Inflectional morphology: inflected forms are constructed from base formsand inflectional affixes: ŸŤŚ+ŊĽ → ŊĽŸŤŚ

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 25 / 689

Page 26: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Problems in morphological processing

Ambiguity: The various analyses of the word ĎŽĄŹ:

ĎŽĄŹ: [+verb][+base]ŹĄĎ[+root]ŹĄĎ[+binyan]+Pa’al[+person/gender/number]+3p/F/Sg[+tense]+pastĎŽĄŹ: [+verb][+base]ŹĄŽ[+root]ŹĄŽ[+binyan]+Pa’al[+person/gender/number]+3p/F/Sg[+tense]+pastŽĄŹ+Ď: [+noun][+base]ŹĄŽ[+gender]+fem[+number]+sing[+possessiveSuffix]+3p/F/SgŹ+ĎŽĄ: [+subord]Ź[+noun][+base]ĄŽĎ[+gender]+fem[+number]+singŹ+Ą+ĎŽ: [+subord]Ź[+preposition]Ą[+noun][+base]ŽĎ[+gender]+masc[+number]+singŹ+Ą+(Ď)+ĎŽ: [+subord]Ź[+preposition]Ą[+def][+noun][+base]ŽĎ[+gender]+masc[+number]+singŹ+ŽĄ+Ď:[+subord]Ź[+noun][+base]ĄŽ[+gender]+fem[+number]+sing[+possessiveSuffix]+3p/F/Sg

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 26 / 689

Page 27: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

What kind of knowledge is required?

Phonetic and phonological knowledge

Morphological knowledge

Syntactic knowledge

Semantic knowledge

Pragmatic knowledge

Discourse knowledge

World knowledge

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 27 / 689

Page 28: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Syntax

Natural language sentences have structure.

Young green frogs sleep quietly

Colorless green ideas sleep furiously

Furiously sleep ideas green colorless

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 28 / 689

Page 29: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Syntax

colorless green ideas sleep furiously

ADJ ADJ N V ADV

NP

,,

ll

NP

��

�� l

l

VP

,,

ll

S

�������

@@

@@

@@

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 29 / 689

Page 30: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Problems of syntactic processing

Expressiveness: what formalism is required for describing naturallanguages?

Parsing: assigning structure to grammatical strings, rejectingungrammatical ones.

top–down vs. bottom–upright to left vs. left to rightchart based vs. backtracking

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 30 / 689

Page 31: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Problems of syntactic processing

Ambiguity:

I saw the spy with the brown hat

I saw the bird with the telescope

I saw the spy with the telescope

Control:Kim asked Sandy to call the plumber

Kim promised Sandy to call the plumber

Coordination:This book describes research about how language

comprehension and production work

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 31 / 689

Page 32: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

What kind of knowledge is required?

Phonetic and phonological knowledge

Morphological knowledge

Syntactic knowledge

Semantic knowledge

Pragmatic knowledge

Discourse knowledge

World knowledge

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 32 / 689

Page 33: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Semantics

Semantics assigns meanings to natural language utterances

A semantic representation must be precise and unambiguous

A good semantics is compositional : the meaning of a phrase isobtained from the meanings of its subphrases

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 33 / 689

Page 34: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Problems of semantic processing

Word sense ambiguity: book; round; about; ĎŹĽĆŤ

Scope ambiguity:

every student hates at least two courses

every student doesn’t like math

all doors will not open

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 34 / 689

Page 35: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Problems of semantic processing

Co-reference and anaphora:

Kim went home after she robbed the bank

After she robbed the bank, Kim went home

In the next few paragraphs, some preliminary

constraints are suggested and problems with them

are discussed.

Language is one of the fundamental aspects of

human behavior. In written form it serves as a

long-term record of knowledge.

VP anaphora: Kim loves his wife and so does Sandy.

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 35 / 689

Page 36: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

What kind of knowledge is required?

Phonetic and phonological knowledge

Morphological knowledge

Syntactic knowledge

Semantic knowledge

Pragmatic knowledge

Discourse knowledge

World knowledge

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 36 / 689

Page 37: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Pragmatics

Pragmatics is the study of how more gets communicated than is said.

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 37 / 689

Page 38: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Pragmatics

Presupposition: the presuppositions of a sentence determine the class ofcontexts in which the sentence can be felicitously uttered:

The current king of France is bald

Kim regrets that he voted for Obama

Sandy’s sister is a ballet dancer

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 38 / 689

Page 39: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Pragmatics

Implicature: what is conveyed by an utterance that was not explicitlyuttered:

– How old are you? – Closer to 30 than to 20.

I have two children.

Could you pass the salt?

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 39 / 689

Page 40: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Pragmatics

Speech acts: the illocutionary force, the communicative force ofutterances, resulting from the function associated with them:

I’ll see you later

prediction: I predict that I’ll see you laterpromise: I promise that I’ll see you laterwarning: I warn you that I’ll see you later

I sentence you to six months in prison

I swear that I didn’t do it

I’m really sorry!

Non-literal use of language: metaphor, irony etc.

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 40 / 689

Page 41: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

What kind of knowledge is required?

Phonetic and phonological knowledge

Morphological knowledge

Syntactic knowledge

Semantic knowledge

Pragmatic knowledge

Discourse knowledge

World knowledge

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 41 / 689

Page 42: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Discourse

A discourse is a sequence of sentences

Discourse has structure much like sentences do

Understanding discourse structure is extremely important for dialogsystems

An example dialog:

When does the train to Haifa leave?

There is one at 2:00 and one at 2:30.

Give me two tickets for the earlier one, please.

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 42 / 689

Page 43: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Problems of discourse processing

Non-sentential utterances: aha; to Haifa; the last one

Cross-sentential anaphora

Reference to non-NPs:Kim visited the University of Haifa.

It changed her life.

She does it every year.

It really surprised Sandy.

It was summer then.

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 43 / 689

Page 44: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

What kind of knowledge is required?

Phonetic and phonological knowledge

Morphological knowledge

Syntactic knowledge

Semantic knowledge

Pragmatic knowledge

Discourse knowledge

World knowledge

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 44 / 689

Page 45: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

World knowledge

– Is the train to Haifa late? – It left Tel Aviv at 8:30.

George W. Bush left for Vietnam today. This is the last foreign

visit of the American president.

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 45 / 689

Page 46: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Processing Hebrew

The script

Writing direction

Deficiencies of the Hebrew writing system

Richness of the morphology

Root-and-pattern word formation

Paucity of linguistic resources

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 46 / 689

Page 47: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Infrastructure for processing language

Lexicons

Dictionaries

Morphological analyzers and generators

Part-of-speech taggers

Shallow parsers

Syntactic analyzers

Computational grammars

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 47 / 689

Page 48: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Hebrew processing: the state of the art

Lexicons

Dictionaries

Morphological analyzers and generators

Part-of-speech taggers

Shallow parsers

Syntactic analyzers

Computational grammars

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 48 / 689

Page 49: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Conclusions

Natural languages are complex

Applications which require deep linguistic knowledge still do notperform well

Applications which can rely on shallow knowledge or on statisticalapproaches perform better

Hebrew poses additional problems for language processing

To build Hebrew language applications, essential linguistic resourcesmust be developed

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 49 / 689

Page 50: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Structure of the course

Morphology

introduction to morphology: word structureinflections and derivationsfinite-state automatafinite-state transducers

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 50 / 689

Page 51: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Structure of the course

Syntax

introduction to syntax: the structure of naturallanguagescontext-free grammars: grammars, forms, derivations,trees, languagesparsing: top-down, CYK algorithm, Earley algorithm,bottom-up chart parsingthe complexity of natural languagesthe limitations of CFGsunification grammars: feature structures and unification

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 51 / 689

Page 52: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Structure of the course

Empirical methods in NLP

POS taggingShallow parsingMachine learning and classificationApplications

Other topics

As time permits

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 52 / 689

Page 53: Computational Linguistics - University of Haifacs.haifa.ac.il/~shuly/teaching/09/nlp/intro-handout.pdf · Computational Linguistics ... variants of phonemes that are in complementary

Introduction

Practicalities

Textbook: Nothing mandatory or even recommended. Some of thematerial can be found in Daniel Jurafsky and James H.Martin, Speech and Language Processing, Prentice-Hall,2000

Grading: Take-home exam

Attendance: Optional but highly recommended.

Shuly Wintner (University of Haifa) Computational Linguistics c©Copyrighted material 53 / 689