t a quick tour in digital humanities methods and techniques · digital humanities a reasonable...

38
T A Quick Tour in Digital Humanities Methods and Techniques Antonio Lieto Università di Torino and ICAR-CNR (Italy) Home: http://www.di.unito.it/~lieto/ pPart of these slides based on the paper: Lieto A. (2015) “Computational Models (of Narrative) for Literary Studies”, in Semicerchio, Rivista di Poesia Comparata, Pacini Editore. Available online at http://www.academia.edu/24085161/ Computational_Models_of_Narrative_for_Literary_Studies

Upload: dokhue

Post on 10-Nov-2018

219 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: T A Quick Tour in Digital Humanities Methods and Techniques · Digital Humanities A reasonable definition: “Humanities computing is a practice of representation, a form of modeling

T A Quick Tour in Digital Humanities Methods and Techniques

Antonio Lieto Università di Torino and ICAR-CNR (Italy)

Home: http://www.di.unito.it/~lieto/

pPart of these slides based on the paper: Lieto A. (2015) “Computational Models (of Narrative) for Literary Studies”, in

Semicerchio, Rivista di Poesia Comparata, Pacini Editore. Available online at http://www.academia.edu/24085161/

Computational_Models_of_Narrative_for_Literary_Studies

Page 2: T A Quick Tour in Digital Humanities Methods and Techniques · Digital Humanities A reasonable definition: “Humanities computing is a practice of representation, a form of modeling

2

There is not a clear definition…

Digital Humanitiesis born of the

encounter betweentraditional humanities

and computationalmethods.

From the printed “text” to the digital world

What are Digital Humanities?

Page 3: T A Quick Tour in Digital Humanities Methods and Techniques · Digital Humanities A reasonable definition: “Humanities computing is a practice of representation, a form of modeling

3

Involves a plethora of disciplines in both the humanities and in the computing sides.

DH applications in:

- Computer Vision and Visualisation Techniques- Natural Language Processing- Information Retrieval- Semantic Technologies- Social Network Analysis- etc.

Digital Humanities (DH)

Page 4: T A Quick Tour in Digital Humanities Methods and Techniques · Digital Humanities A reasonable definition: “Humanities computing is a practice of representation, a form of modeling

4

Involves different processes:

- Digitalization (e.g. Google Books)- Classification of the Material- Metadata Organization- Intelligent and Semantic Document Navigation and

Retrieval- Design of Intelligent User Interface (e.g. for creative

thinking and Serendipity) etc.- Computational Analysis (e.g. for author comparison,

plagiarism detection, stylometric and genre analysis etc.)

Digital Humanities (DH)

Page 5: T A Quick Tour in Digital Humanities Methods and Techniques · Digital Humanities A reasonable definition: “Humanities computing is a practice of representation, a form of modeling

Digital Humanities

A reasonable definition:

“Humanities computing is a practice of representation, a form of modeling or, mimicry. It is also a way of reasoning and a set of ontological commitments, and its representational practice is shaped by the need for efficient computation on the one hand, and for human communication on the other” (J. Unsworth, 2014)

In humanities computing, the computer is used as a tool for modeling humanities data and our understanding of it, and that activity is entirely distinct from using the computer when it models other processes.

In DH (or Humanities Computing) the obtained representation/model has to be testable. It must be possibile to test the digital model created of some phenomena to model its validity (it must be possibile to change parameters, to generate new models etc.)

Humanities computing must enable deep and automated analysis. 5

Page 6: T A Quick Tour in Digital Humanities Methods and Techniques · Digital Humanities A reasonable definition: “Humanities computing is a practice of representation, a form of modeling

Tension

6

A tension exists in the contemporary era ofthe Digital Humanities, with one wing of thehumanities embracing quantitative methods,the other continuing to insist upon itsroots in qualitative analysis.

The quantitative wing becomes integratedinto the social sciences. The other fightsto defend its autonomy and critical stance.Integration between the two approaches is needed to discover new knowledge.

Distant reading vs Closed Reading

Page 7: T A Quick Tour in Digital Humanities Methods and Techniques · Digital Humanities A reasonable definition: “Humanities computing is a practice of representation, a form of modeling

Goal: Narrative Understanding and Reasoning

Ex.

Page 8: T A Quick Tour in Digital Humanities Methods and Techniques · Digital Humanities A reasonable definition: “Humanities computing is a practice of representation, a form of modeling

Goal: Narrative Understanding Ex.

Page 9: T A Quick Tour in Digital Humanities Methods and Techniques · Digital Humanities A reasonable definition: “Humanities computing is a practice of representation, a form of modeling

DH and literary studies

I will focus here on some aspects where tools coming to AI have been applied to the analysis of literary texts.

Main Areas and Methodologies Involved: - Text Mining - Social Network Analysis for the visualization

of elements emerging from the text - Ontologies and Knowledge Representation

Page 10: T A Quick Tour in Digital Humanities Methods and Techniques · Digital Humanities A reasonable definition: “Humanities computing is a practice of representation, a form of modeling

Text Mining

Area of Computational Linguistics

- Textual documents are transformed and formally treated.

- Usually Vector representations of words (BOW etc.)

- Documents Similarity (different metrics), Pattern Detection, Named Entity Recognition, Extraction and Linking, Authorship Attribution, Plagiarism Detection etc.

Page 11: T A Quick Tour in Digital Humanities Methods and Techniques · Digital Humanities A reasonable definition: “Humanities computing is a practice of representation, a form of modeling

E.g. Text Mining Application

Vector 1 Document 1 NEW Vector

Document (x)

Vector 2 (Document 2)

Cluster 1

Cluster 2

Page 12: T A Quick Tour in Digital Humanities Methods and Techniques · Digital Humanities A reasonable definition: “Humanities computing is a practice of representation, a form of modeling

Network Analysis (from TM)

From Bolioli et al. CMN 2013

Page 13: T A Quick Tour in Digital Humanities Methods and Techniques · Digital Humanities A reasonable definition: “Humanities computing is a practice of representation, a form of modeling

Ontologies

- Ontological Models can be used to represent and structure the information available in a textual resource (semantic information access, complex information retrieval)

- An ontology is a formal (logic-based) knowledge representation system of a given domain

- They allow describe the semantic content of given element.

- Allows form of automated reasoning (e.g. differently from XML etc.)

Page 14: T A Quick Tour in Digital Humanities Methods and Techniques · Digital Humanities A reasonable definition: “Humanities computing is a practice of representation, a form of modeling

Ontological Elements

Page 15: T A Quick Tour in Digital Humanities Methods and Techniques · Digital Humanities A reasonable definition: “Humanities computing is a practice of representation, a form of modeling

Ontologie Applicate

• «Applied Ontology make explicit the hidden assumption of the reality: the structure of the world as we see it» (Guarino).

• Le ontologie applicate hanno un goal chiaro: aiutare le persone (e le macchine) a capirsi recipricomente (understanding e communication).

• Differenza con l’ontologia filosofica

• Natura interdisciplinare degli studi sulle Applied Ontology (Philosophy, Cognitive Science, Logic, Linguistics, Computer Science). 15

Page 16: T A Quick Tour in Digital Humanities Methods and Techniques · Digital Humanities A reasonable definition: “Humanities computing is a practice of representation, a form of modeling

Ontologie Applicate

• Si arriva alla realizzazione di una ontologia applicata mediante una analisi ontologica: studio del contenuto delle cose, delle nostre assunzioni (concetti) sul mondo fatte indipendentemente dal modo in cui tali assunzioni sono rappresentate (ad es. nel nostro cervello).

• Focus sulla definizione di tali assunzioni/concetti e non sul modo in cui sono rappresentati.

• Prima analisi ontologica poi rappresentazione (Guarino’s motto) 16

Page 17: T A Quick Tour in Digital Humanities Methods and Techniques · Digital Humanities A reasonable definition: “Humanities computing is a practice of representation, a form of modeling

17

Page 18: T A Quick Tour in Digital Humanities Methods and Techniques · Digital Humanities A reasonable definition: “Humanities computing is a practice of representation, a form of modeling

Le ontologie supportano forme di ragionamento automatico.

Il ragionamento basato su ontologie è “limitato” a forme di ragionamento logico-deduttivo.

Le ontologie e i ragionatori lanciati su esse non supportano forme di ragionamento non monotòno (induzione, abduzione, default reasoning ecc.)

Page 19: T A Quick Tour in Digital Humanities Methods and Techniques · Digital Humanities A reasonable definition: “Humanities computing is a practice of representation, a form of modeling

Logica e Ragionamento/2

Nelle inferenze deduttive la conclusione (C) di una inferenza è una conseguenza logica della premesse (P1...Pn).

Ciò vuol dire che, in questa tipologia di inferenze, non può mai verificarsi il caso che, contemporaneamente, le Premesse siano Vere (V) e la conclusione Falsa (F).

Page 20: T A Quick Tour in Digital Humanities Methods and Techniques · Digital Humanities A reasonable definition: “Humanities computing is a practice of representation, a form of modeling

Es. Inferenza Deduttiva

(P1) Giuseppe Garibaldi è un Gremlin. (P2) Tutti i Gremlin sono idraulici.

Quindi: Giuseppe Garibaldi è un idraulico (C)

Page 21: T A Quick Tour in Digital Humanities Methods and Techniques · Digital Humanities A reasonable definition: “Humanities computing is a practice of representation, a form of modeling

Es. Inferenza Deduttiva/2

(P1) Giuseppe Garibaldi è un Gremlin. (P2) Tutti i Gremlin sono idraulici.

Quindi: Giuseppe Garibaldi è un idraulico (C)

N.B. Affinché una inferenza sia deduttivamente valida non è necessario che le premesse e conclusioni siano V.

Page 22: T A Quick Tour in Digital Humanities Methods and Techniques · Digital Humanities A reasonable definition: “Humanities computing is a practice of representation, a form of modeling

Inferenze non Deduttive

Nelle inferenze non deduttive la conclusione (C) di una inferenza NON è una conseguenza logica della premesse (P1...Pn).

Esistono diverse tipologie di inferenze non deduttive (ad es. l’induzione, l’abduzione, ragionamento per default...).

Page 23: T A Quick Tour in Digital Humanities Methods and Techniques · Digital Humanities A reasonable definition: “Humanities computing is a practice of representation, a form of modeling

(P1) Tutti i cigni osservati sino ad ora in Europa sono bianchi. (P2) Tutti i cigni osservati sino ad ora in Nord America sono bianchi. (Pn) Non sono stati mai osservati cigni che non fossero bianchi.

(C): Tutti i cigni sono bianchi Es. Induzione

Page 24: T A Quick Tour in Digital Humanities Methods and Techniques · Digital Humanities A reasonable definition: “Humanities computing is a practice of representation, a form of modeling

(P1) L’assassino ha sporcato di fango il tappeto. (P2) Chiunque fosse entrato dal giardino avrebbe sporcato di fango il tappeto.

(C) Quindi: L’assassino è entrato dal giardino.

Es. Abduzione

Page 25: T A Quick Tour in Digital Humanities Methods and Techniques · Digital Humanities A reasonable definition: “Humanities computing is a practice of representation, a form of modeling

(P1) Gli uccelli, salvo alcune eccezioni, sono in grado di volare. (P2) Titti è un uccello.

(C) Titti è in grado di volare Es. Default

Page 26: T A Quick Tour in Digital Humanities Methods and Techniques · Digital Humanities A reasonable definition: “Humanities computing is a practice of representation, a form of modeling

(P1) Gli uccelli, salvo alcune eccezioni, sono in grado di volare. (P2) Titti è un uccello.

(C) Titti è in grado di volare Es. Default

Supponiamo di aggiungere due nuove premesse:

(P3)Titti è un pinguino (P4) I pinguini sono uccelli che non sono in grado di volare (C) Titti è in grado di volare

Page 27: T A Quick Tour in Digital Humanities Methods and Techniques · Digital Humanities A reasonable definition: “Humanities computing is a practice of representation, a form of modeling

27

Page 28: T A Quick Tour in Digital Humanities Methods and Techniques · Digital Humanities A reasonable definition: “Humanities computing is a practice of representation, a form of modeling

Ontology Reasoning/1

Esempio tratto da Enrico Franconi, Tutorial «Ontologies and Databases», ESWC 2012.

Page 29: T A Quick Tour in Digital Humanities Methods and Techniques · Digital Humanities A reasonable definition: “Humanities computing is a practice of representation, a form of modeling

Ontology Reasoning/2

Page 30: T A Quick Tour in Digital Humanities Methods and Techniques · Digital Humanities A reasonable definition: “Humanities computing is a practice of representation, a form of modeling

Slide from Nicola Guarino

Page 31: T A Quick Tour in Digital Humanities Methods and Techniques · Digital Humanities A reasonable definition: “Humanities computing is a practice of representation, a form of modeling
Page 32: T A Quick Tour in Digital Humanities Methods and Techniques · Digital Humanities A reasonable definition: “Humanities computing is a practice of representation, a form of modeling

Simple reasoning

What’s the idea…

Page 33: T A Quick Tour in Digital Humanities Methods and Techniques · Digital Humanities A reasonable definition: “Humanities computing is a practice of representation, a form of modeling

Upshots

• I presented a brief outline of the main methods and techniques of possible interests for scholars involved in literary studies and analysis.

Page 34: T A Quick Tour in Digital Humanities Methods and Techniques · Digital Humanities A reasonable definition: “Humanities computing is a practice of representation, a form of modeling

34

Page 35: T A Quick Tour in Digital Humanities Methods and Techniques · Digital Humanities A reasonable definition: “Humanities computing is a practice of representation, a form of modeling

35

Page 36: T A Quick Tour in Digital Humanities Methods and Techniques · Digital Humanities A reasonable definition: “Humanities computing is a practice of representation, a form of modeling

36• http://selection.datavisualization.ch/

Page 37: T A Quick Tour in Digital Humanities Methods and Techniques · Digital Humanities A reasonable definition: “Humanities computing is a practice of representation, a form of modeling

Alcuni link e progetti..• http://perunaenciclopediadantescadigitale.eu/dantesources/

index.html Dante Sources, visualizzare la lista e la distribuzione dei testi utilizzati da Dante durante la composizione delle sue opere.

• http://digitaldante.columbia.edu/Digital Dante, uno dei primi progetti di DH

• Manoscritti medioevali: https://medium.com/fonti-storiche-online/manoscritti-medievali-dove-cercarli-online-63fe5c3a4ca0#.qbsia3j2o

• https://litlab.stanford.edu/pamphlets/ Literary lab Stanford

• http://app.labyrinth-project.it:8080/LabyrinthTest/# Labyrinth (Demo Version). Esempio di Tecnologia Semantica utilizzata per la navigazione di archivi digitali. Link: http://drops.dagstuhl.de/opus/volltexte/2013/4149/pdf/p076-damiano.pdf

• ne esistono molti altri…

37

Page 38: T A Quick Tour in Digital Humanities Methods and Techniques · Digital Humanities A reasonable definition: “Humanities computing is a practice of representation, a form of modeling

Domande/Curiosità/

Riflessioni?

[email protected]