the worldwidescience alliance: an international partnership to improve access to scientific and...
TRANSCRIPT
The WorldWideScience Alliance: An International Partnership to Improve Access to Scientific and
Technical Information
Lorrie A. JohnsonUnited States Department of Energy
Office of Scientific and Technical Information
• What is WorldWideScience.org?• Launch of WorldWideScience.org and Formation of the
WorldWideScience Alliance• Search Technology• Multilingual Translations• Integration of Multimedia and Scientific Research Data• Role in Public Access• Future Directions
What is WorldWideScience.org?
A global science gateway comprised of over 100 national and international scientific databases and portals
More than 70 countries are represented
History and Formation of the WorldWideScience Alliance
• WorldWideScience.org concept emanated from Science.gov model (2006)
• Initial partnership between U.S. Department of Energy and the British Library (2007)
• Transition to multilateral governance (WorldWideScience Alliance) and ICSTI* sponsorship (2008)
*International Council for Scientific and Technical Information*International Council for Scientific and Technical Information
The Search Technology Behind WorldWideScience.org
Federated search technology: Searches the “deep web,” which may not be searchable by
major search engines Performs a real-time, simultaneous search of participating
databases Overcomes barrier of needing to know about all resources Little or no burden on database owners Returns a consolidated, relevance-ranked results list Links to original records at source databases, including full text
if available Integrates symbiotic technologies to further accelerate
scientific discovery
A Measure of WorldWideScience.org’s Uniqueness
90%“Unique”
33 sample queries launched in Google, Google Scholar, and WorldWideScience.org
Similar quantities in the numbers of results, but very little overlap
Among the “top 50” results from each search engine, only ~10% overlap – or 90% uniqueness – in WorldWideScience.org results
WorldWideScience.org Topic Pages
Utilizing the power of Google, Bing, Yahoo, and other search engines:
Millions of Topic Pages created
Exposes distributed WorldWideScience.org content to commercial search engines via site map protocols
Multilingual Translations
The world’s first “one to many” and “many to one” multilingual translations tool in science
• Most automatic translations are limited to translating from a single language into another single language
• WorldWideScience.org partnering with Microsoft® Translator enables true multilingual functionality
Multilingual Translations
Translating ten languages, with potential for more:
ArabicChinese 中文 GermanDeutsch EnglishSpanishEspañol FrenchFrançaisJapanese 日本語Korean 한국어Portuguese PortuguêsRussianРусский
User enters search term and selects language
Query is translated into languages of all databases, searches are completed, and results are combined into a single, relevance-ranked results list
User translates results into their preferred language
Translated bibliographic record (English to German)
Multimedia and Speech Indexing
Multimedia (e.g., video, audio, images) represents a major emerging form of scientific information
Multimedia presents special opportunitiesand challenges – lack of written transcripts, minimal metadata, scientific/technical/medical terminology, lengthy videos (>1 hour)
Multimedia Search Results
Videos accessible
Search terms identified via snippets
Users can play the exact segments of the videos where search terms were spoken
Integration of Scientific Research Data
Scientific Research Data can be:
Difficult to find…. and Hard to navigate
TTT F Phe TTC F Phe TTA L Leu TTG L Leu i TCT S Ser TCC S Ser TCA S Ser TCG S Ser TAT Y Tyr TAC Y Tyr TAA * Ter TAG * Ter TGT C Cys TGC C Cys TGA * Ter TGG W Trp CTT L Leu CTC L Leu CTA L Leu CTG L Leu i CCT P Pro CCC P Pro CCA P Pro CCG P Pro CAT H His CAC H His CAA Q Gln CAG Q Gln CGT R Arg CGC R Arg CGA R
Results containing research and numeric datasets
A data result, showing the landing page for this dataset
Research Data, via the landing page, is accessible
Data can be viewed or downloaded
WorldWideScience.org and Public Access
Within the United States,
White House Office of Science and Technology Policy (OSTP) Directive – February 2013 (addressed both scholarly publications and digital research data)
U.S. Department of Energy released public access plan in July 2014, and launched the Public Access Gateway for Energy & Science (PAGESBeta) in August 2014
Public access results from the U.S. Department of Energy’s PAGESBeta
Record from PAGESBeta
The Accepted Manuscript will be available 12 months after publication
Public Access as a Growth Opportunity for WorldWideScience.org
Other U.S. federal agencies with public access plans include: U.S. Department of Agriculture Department of Defense National Aeronautics and Space Administration National Institute of Standards and Technology Centers for Disease Control Food and Drug Administration Agency for Healthcare Research and Quality National Institutes of Health National Oceanic and Atmospheric Administration National Science Foundation Office of the Assistant Secretary for Preparedness and Response
Canada: “Tri-agency Open Access Policy on Publications” released in February 2015
United Kingdom: Research Councils’ open access policy, in place for several years
Other countries in progress….
As public and open access content grows, new resources will be incorporated into WorldWideScience.org (scholarly publications and digital research data)
With the extension of federated searching on an international scale, combined with multilingual translations, WorldWideScience.org enables access to a diverse array of scientific and technical content in traditional and non-traditional formats