linking literature to astronomy data with dois · •make telescope bibliographies easier...

14
Linking Literature to Astronomy Data with DOIs Some Challenges Sarah Weissman – STScI/MAST Code4Lib DMV 2017

Upload: others

Post on 08-Jun-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Linking Literature to Astronomy Data with DOIs · •Make Telescope bibliographies easier •Largely a manual process currently •Archive planning ... •Publishers aren’t building

LinkingLiteraturetoAstronomyDatawithDOIs

SomeChallengesSarahWeissman – STScI/MAST

Code4LibDMV2017

Page 2: Linking Literature to Astronomy Data with DOIs · •Make Telescope bibliographies easier •Largely a manual process currently •Archive planning ... •Publishers aren’t building

STScI/MAST– whoarewe?

• Hubble!JWST!• …inBaltimore!

Imagecredits:HST,NASA2009viahubblesite.org;JWST,NorthrupGrummanviawebbtelescope.org;DivinefromPinkFlamingos,viabaltimoreorless.com;Mikulski,STScI viaYouTube

Page 3: Linking Literature to Astronomy Data with DOIs · •Make Telescope bibliographies easier •Largely a manual process currently •Archive planning ... •Publishers aren’t building

DataDOIs

• DOI=DigitalObjectIdentifier.It’sapermanentlinktoadigitalobject.• URL+ID+metadatacontainer

• DataDOIdemo

Page 4: Linking Literature to Astronomy Data with DOIs · •Make Telescope bibliographies easier •Largely a manual process currently •Archive planning ... •Publishers aren’t building

WhatareDOIsgoodfor?

• Globallyresolvable(kindoflikeURLs)• Machinereadable(likeURLs)• Persistentlinks(aslongasyouupdatethem)• Usuallycomepackagedwithsomekindofmetadata(ifyoucanfindit)• Noteasilybookmarked• Obscuretheirdestination(likebit.ly)• Havetobeupdated• Meantformachines,nothumans(10.7059/T9G0bled33)

ByWilliams,HenrySmith,1863-;Williams,EdwardHuntington,1868-1944,jointauthor[Norestrictions],viaWikimediaCommons

Page 5: Linking Literature to Astronomy Data with DOIs · •Make Telescope bibliographies easier •Largely a manual process currently •Archive planning ... •Publishers aren’t building

WhyDataDOIs

• Allowastronomerstolinktotheirdatainastandardizedway• Convenience• Reproducibility,openness

• MakeTelescopebibliographieseasier• Largelyamanualprocesscurrently• Archiveplanning• Justifyfunding

Page 6: Linking Literature to Astronomy Data with DOIs · •Make Telescope bibliographies easier •Largely a manual process currently •Archive planning ... •Publishers aren’t building

DataDOIsatMAST

• CollaborationbetweenMAST,AASPublishingandtheSTScI Library• DebutedourDOIserviceinApril2016,currentlyinBetamode• ~14STScI authorshavecreatedDOIsforpublication• Fall/Winter2017- Plantoopenserviceto12otherinstitutions.• Links:• http://archive.stsci.edu/doi/search/ (MainDOIentrypoint)• https://mast.stsci.edu/portal/DOI/help (DOIAPIdocumentation)

Page 7: Linking Literature to Astronomy Data with DOIs · •Make Telescope bibliographies easier •Largely a manual process currently •Archive planning ... •Publishers aren’t building
Page 8: Linking Literature to Astronomy Data with DOIs · •Make Telescope bibliographies easier •Largely a manual process currently •Archive planning ... •Publishers aren’t building

Challenges– Permanence&Uniqueness

• HowtomakesurethatyourDOIlinkskeepworking?• Landingpage+service(demo)• YALI– yetanotherlevelofindirection

• Howtolinkto”data”?• Dataisoftenthenot-well-definedglobofstuff(files,databaserecords)

• Whatleveltolinktoyourdata?• Observation,dataproduct

• Onceascientistdownloadsdata,theytypicallytransformit,soitnolongerresemblesitsforminthearchive.

• Hadadatamodel(CAOM),soweusedit• Evengiventhis,thingsaremessy.IDformatsnotwell

definedandnotactuallyunique!

Page 9: Linking Literature to Astronomy Data with DOIs · •Make Telescope bibliographies easier •Largely a manual process currently •Archive planning ... •Publishers aren’t building

Challenges– Buyinfrompublishers

• LuckilywehadagoodworkingrelationshipwithAASJournalsandEJpress• Luckilytheworldofastronomywrt dataandpublishingisrelativelyopen.(E.g.http://adswww.harvard.edu/)• Publishersaren’tbuildingtheirownsoftware.• Buildrelationshipswitheachpublisher• (Publisherhastobuildupprogramswitheachdatacenter.)

Page 10: Linking Literature to Astronomy Data with DOIs · •Make Telescope bibliographies easier •Largely a manual process currently •Archive planning ... •Publishers aren’t building

Challenges– Metadata

• Areanumberofstandardsformetadata– DataCite,ERC,DC,CrossRef• WewentwithDataCite **BUT**wedon’twantdataDOIstobefirstclasscitableobjects.• Adhoccollectionsofdatathatcouldintheorychange• UsuallywhenanastronomerpublishesadatasetthereisapaperandTHATshouldbecited

• DataCite hasdomain-specificelements(relatedIdentifierType),whichmakesithardtouseforgeneralpurposemetadata.

Page 11: Linking Literature to Astronomy Data with DOIs · •Make Telescope bibliographies easier •Largely a manual process currently •Archive planning ... •Publishers aren’t building

Challenges– LargeData

• Limitationsonoursoftware(Javascript)onlyallowuserstoworkwithsomuchdataatonce.• Largedatasetslikecatalogscancontainmillions,evenbillionsofrows,howtoefficientlyrepresentanysubsetofthisdata?

RobertWilliamsandtheHubbleDeepFieldTeam(STScI)and NASA

Page 12: Linking Literature to Astronomy Data with DOIs · •Make Telescope bibliographies easier •Largely a manual process currently •Archive planning ... •Publishers aren’t building

Challenges– API

TFW:• YouwantyourDOIlandingpageURLtocontaintheDOI,butitdoesn’texistyet.• YourAPIisreallyathinlyveiledproxyforanotherAPI.• YourAPIissupposedtobegenerallyapplicable,butit’sactuallyinextricablylinkedtothisgddmnJavaScriptGUI.• Youridentifiersaren’trecognizedbyDataCite soyouhavetouseacustomizedmetadataformat.

Page 13: Linking Literature to Astronomy Data with DOIs · •Make Telescope bibliographies easier •Largely a manual process currently •Archive planning ... •Publishers aren’t building

FutureWork

• GetoutofBeta.Expandtomoreusers.• Fullyintegratewithourdatasearchtool(mast.stsci.edu).Rightnowwearejustaclient.• ProvidemorelinksbetweenrelatedDOIs,relatedliterature.• **Usedatataggingtobuildtoolsfordataenrichment.**

Page 14: Linking Literature to Astronomy Data with DOIs · •Make Telescope bibliographies easier •Largely a manual process currently •Archive planning ... •Publishers aren’t building

Questions?

• sweissman [at]stsci.edu