dr. douglas b. lenat , 3721 executive center drive, suite 100, austin, tx 78731

51
March 15, 2006 1 Dr. Douglas B. Lenat , 3721 Executive Center Drive, Suite 100, Austin, TX 78731 Email: [email protected] Phone: (512) 342-4001 2 July 2005 Applications of the Cyc Cyc Formal Ontology Upper Ontology Symposium

Upload: sona

Post on 17-Mar-2016

49 views

Category:

Documents


1 download

DESCRIPTION

Upper Ontology Symposium. Applications of the Cyc Formal Ontology. Dr. Douglas B. Lenat , 3721 Executive Center Drive, Suite 100, Austin, TX 78731 Email : [email protected] Phone: (512) 342-4001. 2 July 2005. A formal ontology has two parts: - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

1

Dr. Douglas B. Lenat , 3721 Executive Center Drive, Suite 100, Austin, TX 78731

Email: [email protected] Phone: (512) 342-4001

2 July 2005

Applications of the CycCyc Formal Ontology

Upper Ontology Symposium

Page 2: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

22 July 2005

The sentences are written in logic, not English, so computers can deeply understand them, not just store them. It can deduce the same sorts of things from them that you or I could.

A formal ontology has two parts:(1) A set of terms (sort of like words)(2) A set of axioms involving those terms

(sort of like sentences built out of them)

Page 3: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

32 July 2005

A formal ontology: terms + axioms (in logic)

CYC: 300k terms, 3.2 million handcrafted axioms.Very general ones (“Upper Ontology”) all the way

down to some domain-specific terms and axioms.

This afternoon’s talk: Formal Ontologies in general

This talk: Examples of how CYC is applied today

Page 4: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

Application#1: Smarter searching

• Query: “Someone smiling”

• Caption: “A man helping his daughter take her first step”

find information

find information

by inference (+KB)

by inference (+KB)

When you become happy, you smile.

You become happy when someone you love accomplishes a milestone.

Taking one’s first step is a milestone.

Parents love their children.

(implies        (and            (isa ?PARENT Person)            (children ?PARENT ?CHILD))

       (loves ?PARENT ?CHILD))

.

Page 5: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

vets

Do you mean:• vets (military veteran) • vets (veterinary surgeon)

Web Results 1 -25

New Search Revise

vets: 25,947 matches

1. Photographs of Cyclo-Vets @ work

2. Veterans National Archives

3. Recommended Vets for Hamster Owners

4. Sponsors on Vets On Line

5. Pops Place BBS Index Page

as fa

st as

usu

al

a sec

ond

later

Page 6: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

Do you mean:• vets (military veteran) • vets (veterinary surgeon)

Web Results 1 -25

New Search Revise

vets: 25,947 matches

1. Photographs of Cyclo-Vets @ work

2. Veterans National Archives

3. Recommended Vets for Hamster Owners

4. Sponsors on Vets On Line

5. Pops Place BBS Index Page

(ex-serviceman OR "military veteran") OR vet OR veteran

AND NOT (veterinarian OR "veterinary surgeon" OR animal)

(ex-serviceman OR ”mili

Page 7: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

Do you mean:• vets (military veteran) • vets (veterinary surgeon)

(ex-serviceman OR ”mili

Web Results 1 -25

New Search Revise

2. Surf Point - Society & Issues: Military/Armed Forces: War Veterans3. A Vet Remembers4. Retail and Wholesale Merchants of Military/ Veteran Goods and Services

1. Veterans News and Information Service - Military, Army, Navy, Marine Corps, Air Force, Coast Guard

(ex-serviceman OR "military veteran") OR vet OR veteran

AND NOT (veterinarian OR "veterinary surgeon" OR animal)

vets: 25,947 matchesvets: 388,109 matches

as fa

st as

usu

al

Page 8: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

vets

Do you mean:• vets (military veteran) • vets (veterinary surgeon)

Web Results 1 -25

New Search Revise

vets: 25,947 matches

1. Photographs of Cyclo-Vets @ work

2. Veterans National Archives

3. Recommended Vets for Hamster Owners

4. Sponsors on Vets On Line

5. Pops Place BBS Index Page

Page 9: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

Do you mean:• vets (military veteran) • vets (veterinary surgeon)

Web Results 1 -25

New Search Revise

vets: 25,947 matches

1. Photographs of Cyclo-Vets @ work

2. Veterans National Archives

3. Recommended Vets for Hamster Owners

4. Sponsors on Vets On Line

5. Pops Place BBS Index Page

veterinarian OR "veterinary surgeon" OR animal OR vet

AND NOT (ex-serviceman OR "military veteran" OR veteran)

veterinarian OR “veteri

Page 10: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

Do you mean:• vets (military veteran) • vets (veterinary surgeon)

Web Results 1 -25

New Search Revise

vets: 25,947 matches

1. Veterinary Book List

2. Advice from The White Cross Veterinary Group

3. Welcome to the World of Eco-Vet

4. Animal Wellness International

5. The economy or management of animals

veterinarian OR "veterinary surgeon" OR veterinary OR vet

AND NOT (ex-serviceman OR "military veteran" OR veteran)

veterinarian OR “veteri

vets: 153,060 matches

Page 11: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

11

Three Improvements to Search

• Deep semantic search involving n axioms (slow)• Add in OR and AND-NOT terms, to reduce the

number of false negatives and false positives• Suggest plausible appropriate follow-on queries

– For veterinarians: how to train to be a vet– For veterans: benefits of reenlisting

Page 12: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

12

Application#2: Deep Question-Answering

• Even 2-3 step reasoning is relatively deep

• Draw on knowledge from all levels of the Cyc ontology (upper, middle, and domain-specific)

• The following examples come from current DTO and AFRL programs transitioned to RDEC

Page 13: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

13

What factors argue <for/against> the conclusion that <ETA> <performed> <the March 2004 Madrid attacks>?

For:- ETA often executes attacks near national election- ETA has performed multi-target coordinated attacks- Over the past 30 years, ETA performed 75% of all terrorist attacks in Spain- Over the past 30 years, 98% of all terrorist attacks in Spain were performed by Spain-based groups, and ETA is a Spain-based group.

Against:-ETA warns (a few minutes ahead of time) of attacks that would result in a high number civilian casualties, to prevent them. There was no such warning prior to this attack.-ETA generally takes responsibility for its attacks, and it did not do so this time.-ETA has never been known to falsely deny responsibility for an attack, and it did deny responsibility for this attack.

Page 14: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

14

Page 15: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

15

Page 16: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

16

Page 17: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

17

murder of rafik hariri

Page 18: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

18

murder of rafik hariri

Page 19: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

19

Page 20: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

20

Page 21: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

21

Page 22: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

22

Page 23: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

23

Page 24: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

24

Page 25: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

25

Page 26: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

26

Page 27: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

27

Page 28: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

28

Page 29: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

29

Page 30: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

30

Cyc ReasoningModules

Interface to External Data Sources

Cyc

API

Know

ledg

e En

try T

oolsUser Interface

(with Natural Language Dialog)

DataBases

WebPages

Text Sources

Other KBs

OtherApplications

KnowledgeAuthors

KnowledgeUsers

ExternalData

Sources

Cyc Ontology & Knowledge Base

A Typical Architecture: Formal Ontology + Inference Engines + Interfaces/API’s

Page 31: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

31

Application#3: Semantic Data Base Integration (Virtual Joins)

• Similar to that last “deep question answering” application, but some of the information is outside the KB: in data bases, on websites, in other ontologies / knowledge bases, etc.

• Map the schema of each of information source to Cyc, and have it call on those external sources as needed, to solve sub-sub-…-problems of the query

Page 32: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

OFAC DB8 USGS NARCL

FBI Most

WantedCATS CDE DB4

DB4

Qusay HusseinUday Hussein

SuspN

DB8PrenomQusai Hussein 30Odai Hussein

Surnom annDec. 31, 1996

Sept. 9, 2003YOB

1964

Data Warehousing: a Quadratic Solution

Page 33: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

you! HAL CYC

QusayHusseinAl-TakritiUdaiHusseinAl-Takriti

(age ?PERSON (YearsDuration ?AGE)) (birthDate ?PERSON ?BIRTH-DATE)

RULES

CONCEPTS

DB4YOB

Qusay HusseinUday Hussein 1964

DB8Prenom annQusai Hussein 30Odai Hussein

OFAC DB8 USGS NARCL

FBI Most

WantedCATS CDE DB4

Dec. 31, 1996

Sept. 9, 2003SuspN

Surnom

1966

32

A Solution that Scales Linearly

(…and, by the way, enables DB population/enrichment)

Page 34: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

DB4YOB

Qusay HusseinUday Hussein 1964

DB8Prenom annQusai Hussein 30Odai Hussein

OFAC DB8 USGS NARCL

FBI Most

WantedCATS CDE DB4

Dec. 31, 1996

Sept. 9, 2003SuspN

Surnom

1966

32

(…and, by the way, enables DB population/enrichment)

A Solution that Scales Linearly

Page 35: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

35

A very recent Cyc SKSI example

“What major US cities are particularly vulnerable to an anthrax attack?”

The answer is logically implied by data dispersed through several sources:

USGSGNISDB

AMVAKB RAND

R

UNFAODB

DTRACATS

DB

Page 36: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

36

“major US city” ?C is a U.S. City with >1M population

“particularly vulnerable to an anthrax attack” – the current ambient temperature at ?C is above freezing,

and– ?C has more than 100 people for each hospital bed,

and– the number of anthrax host animals near ?C exceeds 100k

“What major US cities are particularly vulnerable to an anthrax attack?”

Page 37: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

37

“What major US cities are particularly vulnerable to an anthrax attack?”

U.S. cities with population > 1 million

USGSGNISDB

AMVAKB RAND

R

UNFAODB

DTRACATS

DB

1-2 conjuncts in a

CycL “Ask” expression

Page 38: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

38

1-2 conjuncts in a

CycL “Ask” expression

“What major US cities are particularly vulnerable to an anthrax attack?”

U.S. cities with population > 1 million

USGSGNISDB

AMVAKB RAND

R

UNFAODB

DTRACATS

DB

  (and    (isa ?C USCity)    (> (NumberOfInhabitantsFn ?C) 106)    (vulnerableToScriptedEventTypeUsing      ?C      DeployingABioAgentByInfectingAZoonoticHost

Anthrax-Bacterium))

Page 39: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

39

The Geographic Names Information System (GNIS)DB maintained by the US Geological Survey (USGS).

USGSGNISDB

 state |         name          | type  |     county     | state_fips |  -------+-----------------------+-------+----------------+------------+ TX    | Dallas                | ppl   | Dallas         |         48 | MN    | Hennepin County       | civil | Hennepin       |         27 |    CA    | Sacramento County     | civil | Sacramento     |          6 |    AZ    | Phoenix               | ppl   | Maricopa       |          4 |  

primary_lat | primary_long| elevation | population |     status      | ------------+-------------+-----------+------------+------------------+  32.78333 |       -96.8 |       463 |    1022830 | BGN 1978 1959  45.01667 |      -93.45 |         0 |    1032431 |  38.46667 |  -121.31667 |         0 |    1041219 |  33.44833 |  -112.07333 |      1072 |    1048949 | BGN 1931 1900 1897

Page 40: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

40

The Geographic Names Information System (GNIS)DB maintained by the US Geological Survey (USGS).

USGSGNISDB

So how do we explain to our system that: • row 1 of that table is “about” the city of Dallas, TX

• the population field of that table contains the numberof inhabitants of the city that that row is “about”

• here is exactly how to access tuples of that database

• that access will be fast, accurate, recent, complete

Page 41: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

41

The Geographic Names Information System (GNIS)DB maintained by the US Geological Survey (USGS).

USGSGNISDB

• the population field of that table contains the numberof inhabitants of the city that that row is “about”

We provide the field encodings and decodings, some of which correspond to explicit fields like population, two-letter state codes, etc:

(fieldDecoding Usgs-Gnis-LS ?x       (TheFieldCalled “population”) (numberOfInhabitants

(TheReferentOfTheRow Usgs-Gnis) ?x))

Page 42: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

42

The Geographic Names Information System (GNIS)DB maintained by the US Geological Survey (USGS).

USGSGNISDB

• how to access tuples of that database We provide all the information needed for a JDBC connection script:

We assert, in the context (MappingMtFn Usgs-KS), all of these:

(passwordForSKS Usgs-KS "geografy")(portNumberForSKS Usgs-KS 4032)(serverOfSKS Usgs-KS "sksi.cyc.com")(sqlProgramForSKS Usgs-KS PostgreSQL)(structuredKnowledgeSourceName Usgs-KS "usgs")(subProtocolForSKS Usgs-KS "postgresql")(userNameForSKS "sksi")

Page 43: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

43

The Geographic Names Information System (GNIS)DB maintained by the US Geological Survey (USGS).

USGSGNISDB

• that access will be fast, accurate, recent, complete We provide meta-level assertions about the database, about each table of the database, about the completeness etc. of various kinds of data in the DB, etc.

We assert, in the context (MappingMtFn Usgs-KS):

(schemaCompleteExtentKnownForValueTypeInArg Usgs-Gnis-LSUSCitynumberOfInhabitants 1)

Page 44: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

44

The Geographic Names Information System (GNIS)DB maintained by the US Geological Survey (USGS).

USGSGNISDB

• that access will be fast, accurate, recent, complete We provide meta-level assertions about the database, about each table of the database, about the completeness etc. of various kinds of data in the DB, etc.

We assert, in the context (MappingMtFn Usgs-KS):

(resultSetCardinality Usgs-Gnis-PS        (TheSet (PhysicalFieldFn Usgs-Gnis-PS "state")) TheEmptySet 60.0)

(resultSetCardinality Usgs-Gnis-PS        (TheSet            (PhysicalFieldFn Usgs-Gnis-PS "primary_long")            (PhysicalFieldFn Usgs-Gnis-PS "primary_lat")            (PhysicalFieldFn Usgs-Gnis-PS "name"))        (TheSet            (PhysicalFieldFn Usgs-Gnis-PS "county")            (PhysicalFieldFn Usgs-Gnis-PS "state")) 530.36)

Page 45: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

45

“major US city” U.S. City with >1M population

“particularly vulnerable to an anthrax attack” – the current ambient temperature at ?C is above freezing,

and– ?C has more than 100 people for each hospital bed,

and– the number of anthrax host animals near ?C exceeds 100k

“What major US cities are particularly vulnerable to an anthrax attack?”

Cyc knows that pullets are chickens, so don’t add those two numbers together!

Page 46: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

46

Page 47: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

47

Page 48: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

48

Page 49: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

49

Page 50: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

50

Page 51: Dr. Douglas B. Lenat                 ,  3721 Executive Center Drive, Suite 100, Austin, TX 78731

March 15, 2006

512 July 2005

A formal ontology: terms + axioms (in logic)

CYC: 300k terms, 3.2 million handcrafted axioms.Very general ones (“Upper Ontology”) all the way

down to some domain-specific terms and axioms.

Three of the CurrentApplications of Cyc:• Smarter searching (augment queries with OR and AND-NOT

terms; suggest meaningful follow-up queries)• Relatively deep Question-answering for analysts• Semantic Knowledge Source Integration (SKSI): map

external DBs, websites, ontologies,… to Cyc for it to call on• 100’s more (that we know of) OpenCyc/ResearchCyc apps.Research: Characterize our systems as agents, to interoperate