new discovery tools for digital humanities and spatial data (summary of the july, brown bag talk by...

45
New Discovery Tools for Digital Humanities and Spatial Data MIT July 2014 Merrick Lex Berman Center for Geographic Analysis, Harvard

Upload: micah-altman

Post on 16-May-2015

1.168 views

Category:

Technology


3 download

DESCRIPTION

My colleague, (Merrick) Lex Berman, who is Web Service Manager & GIS Specialist, at the Center for Geographic Analysis at Harvard presented this as part of the Program on Information Science Brown Bag Series. Lex is an expert in applications related to digital humanities, GIS, and chinese history -- and has developed many interesting tools in this area. In his talk, Lex notes how the library catalog has evolved from the description of items in physical collections into a wide-reaching net of services and tools for managing both physical collections and networked resources: The line between descriptive metadata and actual content is becoming blurred. Librarians and catalogers are now in the position of being not only docents of collections, but innovators in digital research, and this opens up a number of opportunities for retooling library discovery tools. His presentation will presented survey of methods and projects that have extended traditional catalogs of libraries and museums into online collections of digital objects in the field of humanities -- focusing on projects that use historical place names and geographic identifiers for linked open data will be discussed.

TRANSCRIPT

Page 1: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

New Discovery Tools

for Digital Humanities and Spatial Data

MIT July 2014

Merrick Lex Berman

Center for Geographic Analysis, Harvard

Page 2: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

Physical discovery finding {scrolls}

John Willis Clark, The Care of Books (Cambridge, 1901). Paul Pelliot at the Mogao Caves, Dunhuang (1908).

Page 3: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

Transmission reading {texts}

John Willis Clark, The Care of Books (Cambridge, 1901). The Emperor and the Assassin (1999).

Page 4: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

Bound and chained preserving information {locks}

St. Walburga Library, Zutphen, Netherlands 1564).Andrews, William: “Curiosities of the Church: Studies of Curious Customs, Services and Records” (1891)

Page 5: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

Abyssinian manuscript (ms 134, Goldschmidt no 21) Frankfurt University LibraryBibliotheque Nationale

Binding pages: forming collections systematizing {knowledge}

Page 6: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

indexing {resources}

Denver Public Library – Horses, biography.John Allen, “Farewell Cards,” On Wisconsin Magazine, Summer 2012.

Page 7: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

Internal catalogs → federated catalogs → digital indexing {URIs}

Page 8: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

accessing {digital objects}

Page 9: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

traversing {internal structures}

Denver Public Library – Horses, biography.John Allen, “Farewell Cards,” On Wisconsin Magazine, Summer 2012.

Page 10: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

integrating {digital surrogates}

Page 11: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

Are footnotes connections or stops? citing {references}

“A Squib, from Dr. Jortin’s Tabby-Cat,” The Gentleman’s Magazine, (April 1792).

Page 12: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

CHARON http://en.wikipedia.org/wiki/Charon_(mythology)

http://en.wikipedia.org/wiki/Greek_mythology

From references to graphs traversing {links}

Page 13: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

William Cunningham, The Cosmographical Glasse

(John Day, 1559).

Rationalstructure

Page 14: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

a tiny slice of the Internet (showing 5 million edges)

The center will not hold!

To a noisy interconnected universe of knowledge {graph}

Page 15: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

So what happens to the library?

Philology Library at the Free University of Berlin

Page 16: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

On the road from DH 1.0 to DH 2.0

monolithic database projects - custom data models - custom user interfaces

Patrik Svensson, The Landscape of Digital Humanities

http://digitalhumanities.org/dhq/vol/4/1/000080/000080.html

Page 17: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

monolithic database projects - custom data models - custom user interfaces

open APIs - machine readable - metadata and content

Page 18: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

monolithic database projects - custom data models - custom user interfaces

open APIs - machine readable - metadata and content

linked open data - machine readable - disaggregated

interoperability - ontologies

Page 19: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

monolithic database projects - custom data models - custom user interfaces

open APIs - machine readable - metadata and content

linked open data - machine readable - disaggregated

interoperability - ontologies

open scholarship - digital publication - annotation - interpretation - curation + narrative - collaborative - open-ended

Page 20: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

Geographic data and the library

Geospatial datasets - GIS

Page 21: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

Geographic data and the library

Geospatial datasets - GIS

Maps and scanned maps

Page 22: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

Geographic data and the library

Geospatial datasets - GIS

Maps and scanned maps

The catalog as geodata

Page 23: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

Geospatial datasets

Page 24: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

Maps and scanned maps

Federated search system

Digital Repository

Page 25: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

The catalog as geodata geocoding {topnyms}

651 - Subject Added Entry-Geographic Name

https://maps.googleapis.com/maps/api/geocode/json?address=Georgia

Page 26: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

What to geocode? parsing {tokens}

651 - Geographic Name

260 – Place of Publication

245 – Title

Page 27: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

Mapping options: georss → openlayers serializing {feeds}

http://goo.gl/UTpwyt

Page 28: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

Mapping options: json → leaflet Serializing { json }

http://goo.gl/NGLfh6

Page 29: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

Mapping options: customized from Metacarta geocoding product

Page 30: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

Mapping options: geoHollis beta demonstration

http://goo.gl/jorMvG

Page 31: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

Issues raised: geoHollis beta

geocoding

- Identification of placenames in metadata varies in accuracy - Ambiguous matches cannot be easily fixed - Geocoding many types of information can be confusing - No explicit way to match geocoded words to point on map

clustering

- Max number of search results greatly impacts the clusters - The geocoded words don’t explain how the cluster was decided

facets

- adding and removing facets is essential

Page 32: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

Moving forward: the semantic web linking data {rdf}

Richard Cyganiak, Linked Open Data Cloud (2011).

Page 33: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

linking geodata {rdf}

Page 34: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

Locations in Classical Texts and Atlases

Perseus Project

Pleiades

Google Ancient Places

Page 35: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

Pelagios - linking historical objects to places - using Pleiades

Pelagios API http://pelagios.dme.ait.ac.at/api

Documentation http://pelagios-project.blogspot.com/

Pleiades

Page 36: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

Pelagios - linking historical objects to places - using CHGIS

Pelagios API http://pelagios.dme.ait.ac.at/api

Documentation http://pelagios-project.blogspot.com/

Pleiades CHGIS

Page 37: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

XML web service for Chinese historical placenames:

Page 38: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

http://chgis.hmdc.harvard.edu/tgaz/apihttp://chgis.hmdc.harvard.edu/xml/

11111

CHGIS Gazetteer Web Service Temporal Gazetteer Web Service

Faceted search web service for historical placenames

Page 39: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

RDF gazetteer interchange format

Page 40: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

Potential connections: LOD projects in DH world

Pelagios

Page 41: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

Implications for the future

Transformation of the catalog:

- metadata describing resources → direct access to digital resources

- users expect instant, machine-readable information

- human readable forms & facets → APIs

- users expect both discovery and deeper analysis of content

- aggregated data and visualizations of data rather than lists

- linked open data and the semantic web will encroach into library science - graph databases and structures will get applied to search tools

For Geographic data:

- search and visualization using maps will be more commonplace

- catalog metadata will be geocoded (internally or on-the-fly)

Page 42: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

Map to catalog spatial indexing {metadata}

Page 43: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

Kalev Leetaru, “Fulltext Geocoding Versus Spatial Metadata for Large Text Archives: Towards a Geographically Enriched Wikipedia” D-Lib Magazine, (Sep-Oct 2012)

http://goo.gl/VUWsvO

Unbound and linked analyzing {metadata}

Page 44: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

Interesting examples using spatial search:

- New York Public Library – Maps Division

- Ed Summers ici webmap for wikipedia

- Europeana.eu

- Spatial History Project - Stanford

- Spatial Humantities – Scholar’s Lab - UVA

- MapRankSearch – Klokan Tech

- Digital Silk Road – Stein Placename Database - GeoCLEF 2008

- Biodiversity of the Hengduan Mountains

Page 45: New Discovery Tools for Digital Humanities and Spatial Data (Summary of the July, Brown Bag Talk by Lex Berman)

Bibliography on big data and spatial humanities

https://www.zotero.org/groups/dh_and_big_data/items

Temporal Gazetteers and Linked Geodata

http://www.fas.harvard.edu/~chgis/gazetteer