Knowledge maps for libraries and archives - uses and use cases
International UDC Seminar "Classification & Authority Control: Expanding Resource Discovery” , The National Library of Portugal in Lisbon, 29-30
October 2015.
Andrea Scharnhorst, eHumanities DANS/KNAW (Netherlands)Richard P. Smiraglia, University of Wisconsin, Milwaukee (USA)
Christophe Guéret, eHumanities DANS/KNAW (Netherlands)Alkim Almila Akdag Salah, eHumanities DANS/KNAW (Netherlands)
Functions of Knowledge MapsRaising awarenessBeing used as reference system and for navigationBeing used in the process of data curation and management
Being used as heuristic device in research
Raising awarenesshttp://mtchl.net/tag/generousinterfaces/ http://www.slq.qld.gov.au/showcase/discover-the-queenslander#/grid
http://www.drasticdata.nl/ProjectDANSEasy/index.htm
Macroscopes as heuristic devices in research on the evolution of KOS
How is the collection of library situated in time and (topical) space?
What can we learn from the instantiation of UDC numbers is catalogue data
about the UDC?about the use of the UDC?via the UDC about the collection?
Former research
Comparison of the topical profile of Wikipedia/EN and UDC Evolution of the UDC according to the Master Reference Files Instantiations in libraries Sample of OCLC data (records with UDC numbers) Sample of the library KU Leuven
The Portuguese case PORBASE (the union catalogue of the Portuguese libraries) http://oai.bn.pt/servlet/OAIHandler set: porbase
Catálogo BNP (the catalogue of the BNP) http://oai.bn.pt/servlet/OAIHandler set: catalogo
BND domínio público (a dataset of the National Digital Library public domain files) http://oai.bn.pt/servlet/OAIHandler set: bndlivre
Courtesy of the BNP, thanks to Maria Ines Cordeiro
Baseline statisticsData set # records #records with
a xxxx year format
#records with more than one UDC
classBNP porbase 1 115 278 520 412 349 029
BNP catalogo 877 677 338 505 369 718
BNP bndlivre 21 526 2 670 12 437
PORBASE – The Portuguese Union Catalog
Social Sciences
Applied Sciences. Medicine, Technology
Linguistics. Literature
Comparing profiles - Publication Dates
Dates in the WorldCat ranged from 1714 to 2009, with the majority clustered between 1973 and 2009.Dates in Leuven ranged from 1599 to 2011, with the majority post-dating 1960. Dates in PORBASE start end of the the 19th century, with most entries between 1960 and 2000.In the BNP we see a wider spectrum of years: 1700, 1775, 1825 and 1900 and with peaks in 1875 and 2000.Records from the BND span the period between1500 and 2000, with the largest number of records in the years 1875 and 2000.
UDC Main Classes – across setsIn the WorldCat all main classes occurred1 and 2 (4%); 0 5 7 9 (8-10%); 3 (22%), 6 (18%), 8 (16%)
In Leuven8, 1, (1-2%); 7, 9, 5 (8-11%); 6 (16%), 2 (18%), 3 (21%)
PORBASE 3 is the largest UDC class, 6, 7, 8, 9, 0 have much smaller number of records, few can be found in 1 or 2
BNPMost of the UDC numbers are in class 9 and almost as many in class 7, small groups in classes 3 and 6, very few in 0, 1, 2 and 8
BNDMost records are in 9 and 7, and a small batch can be found in 3.
0 = Science and Knowledge….1 = Philosophy, Psychology2 = Religion. Theology3 = Social sciences5 = Mathematics, Natural sciences6 = Applied sciences, Medicine, Technology7 = Arts, Entertainment, Sports8 = Linguistics, Literature9 = Geography, History
Conclusions (I) – functions of visual explorations and knowledge maps Curation and management
Showcasing and raising awareness
Navigation
Research
Conclusions II – functions of visual explorations and knowledge maps Research
Structure and evolution of KOS – large scale and comparative analysis
New ways for library statistics, and Library Catalogue Analysis
Big Data analysis (incl. visual analytics) to curate ‘messy data’
Topical analysis, Science Dynamics and Digital Humanities
Ginda, Börner,Scharnhorst 2015
References• Börner, Katy (2010) Atlas of Science, MIT Press; ~ (2014) Atlas of Knowledge, MIT Press
• Scharnhorst, A. (2015). Walking through a library remotely. Why we need maps for collections and how KnoweScape can help us to make them. Les Cahiers Du Numérique, 11(1), 103–127. doi:10.3166/lcn.11.1.103-127
• Smiraglia, R. P., Scharnhorst, A., Akdag Salah, A., & Gao, C. (2013). UDC in Action. In A. Slavic, A. Akdag Salah, & S. Davies (Eds.), Classification and visualization: interfaces to knowledge (pp. 259–270). Würzburg: Ergon Verlag. Preprint available at http://arxiv.org/abs/1306.3783
• Akdag Salah, Almila, Cheng Gao, Krzysztof Suchecki, Andrea Scharnhorst, and Richard P. Smiraglia. 2012. “The Evolution of Classification Systems: Ontogeny of the UDC.” In Categories, Contexts and Relations in Knowledge Organization: Proceedings of the Twelfth International ISKO Conference 6-9 August 2012 Mysore, India, ed. A. Neelameghan and K.S. Raghavan, 51–57. Würzburg: Ergon Verlag.
• Ginda, M., Scharnhorst, A., & Borner, K. (2015). Modelling the Structure and Dynamics of Science Using Books. Digital Libraries; Physics and Society. Retrieved from http://arxiv.org/abs/1503.03287 (preprint)
• Christine L. Borgman, Andrea Scharnhorst, Henk Van den Berg, Herbert Van de Sompel, and Andrew Treloar. "Who Uses the Digital Data Archive? An Exploratory Study of DANS" Information Today (2015).