metadata towards an e-research cyberinfrastructure

27
Introduction DC 2006 1 Metadata towards an e-research cyberinfrastructure The case of French ETDs

Upload: derora

Post on 18-Jan-2016

31 views

Category:

Documents


0 download

DESCRIPTION

Metadata towards an e-research cyberinfrastructure. The case of French ETDs. Summary. Introducing ARTIST and authors of this collective work Main actors operating on French ETDs Their roles; Their metadata 3 case studies Creating metadata: thinking about reusability - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Metadata towards an  e-research cyberinfrastructure

Introduction DC 2006 1

Metadata towards an e-research

cyberinfrastructure

The case of French ETDs

Page 2: Metadata towards an  e-research cyberinfrastructure

Introduction DC 2006 2

Summary Introducing ARTIST and authors of this collective work

Main actors operating on French ETDs

Their roles; Their metadata

3 case studies

Creating metadata: thinking about reusability

Thematic survey around biodiversity: thinking vocabularies

Institutional survey: thinking ontologies

Conclusion

Page 3: Metadata towards an  e-research cyberinfrastructure

Introduction DC 2006 3

Authors Jacques DUCLOY – INIST, ARTIST Yann Nicolas – ABES Diane Le Hénaff - INRA Muriel FOULONNEAU – now CCSD Luc GRIVEL – Univ. Paris 1 Jean-Paul DUCASSE – Univ. Lyon 2 + several little interventions and

comments

Page 4: Metadata towards an  e-research cyberinfrastructure

Introduction DC 2006 4

ARTIST members Networked workshop Appropriation

by Research Communities of Technologies of Scientific & Technical Information

Community of: Researchers & Engineers Information Science & Information Technologies working in research world

http://artist.inist.fr/

Page 5: Metadata towards an  e-research cyberinfrastructure

Introduction DC 2006 5

ARTIST topics How to build

Digital Library or scholarly publishing applications dealing with e-science?

New approaches to make research or science in a cyberinfrastructure

Page 6: Metadata towards an  e-research cyberinfrastructure

Introduction DC 2006 6

ARTIST e-Science experimentations Scientific forum

Sample: cooperative linguistic discussion Carl Lagoze paper about DL

What is a Digital Library ? Scientific journal: AMETIST

Appropriation, Mutualisation, Experimentation Digital writing

Richer on-line version Experience becomes “reproductible”

Paper view Scientific focus and evaluation purpose

Digital Library experimentations (metadata -> DL) Cooperative writing: this article

Page 7: Metadata towards an  e-research cyberinfrastructure

Introduction DC 2006 7

This article: metadata for e-science

Not only for: (scientific) information retrievalBut also for: + research evaluation + federative digital libraries + research policy oriented studies + scientific surveys

Page 8: Metadata towards an  e-research cyberinfrastructure

Introduction DC 2006 8

main entities dealing with French ETDs Universities

And their national organization EPST (National Research Institute)

Sample CNRS : 30,000 people European framework

Each country gets its own organization + European actions (Delos, DRIVER…)

Francophone framework International framework

Networked Digital Library of Theses and Dissertations (NDLTD),

ePrints application profile

Page 9: Metadata towards an  e-research cyberinfrastructure

Introduction DC 2006 9

Translation difficulties showing different approaches Thèse = ETD

Thesis (English context) Dissertation (US context)

Veille scientifique Scientific survey (using informetric tools) In order to discover innovations

Pilotage de la recherche research policy oriented studies Strong role of French administration

Page 10: Metadata towards an  e-research cyberinfrastructure

Actors DC 2006 10

Actor: Cyberthèses

CyberdocsOpenOffice + XSLT

Word + style

Xml / TEI

PDF

Jury

ETD

Page 11: Metadata towards an  e-research cyberinfrastructure

Actors DC 2006 11

Cyberthèses metadata Thesis document: Xml, TEI-Lite

Xml version must be readable by a human being

Metadata: DC ETD-ms (Electronic Thesis & Dissertation Metadata

Standard) Further related axis:

TEI header, Latex -> MathML

Page 12: Metadata towards an  e-research cyberinfrastructure

Actors DC 2006 12

Actor: ABESMinistry of Education

Agency

Star

University: ETD

Union catalog

Persistent Identifier

Sudoc portal

Preservation

Dissemination(CINES)

Page 13: Metadata towards an  e-research cyberinfrastructure

Introduction DC 2006 13

Abes / Star metadata TEF (Thèses Electroniques Françaises) AFNOR standard (French member of

ISO) Dublin Core Qualified With several ETD adaptations (jury…) METS Rights Using Schematron

Page 14: Metadata towards an  e-research cyberinfrastructure

Actors DC 2006 14

Actor: CCSD Centre pour la Communication Scientifique

Directe

HAL: NationalArchive

InternationalArchive (Arxiv, Driver…)

LocalArchive

ThematicArchive

TEL …Inserm

Researcher author

Researcher reader

Page 15: Metadata towards an  e-research cyberinfrastructure

Introduction DC 2006 15

CCSD metadata At the beginning: local schema Strong relationship between:

Author University, laboratory, research team

DC export / OAI - PMH

Page 16: Metadata towards an  e-research cyberinfrastructure

Introduction DC 2006 16

Hal: institutional repository A French advantage related to open

archive « Protocole d’accord Universités

EPST sur les dépots par les chercheurs » In CNRS each researcher must produce

an activity report which in generated by Hal/CCSD

Some scientific headers can request a CCSD deposit

Page 17: Metadata towards an  e-research cyberinfrastructure

Actors DC 2006 17

Actor: INIST/CNRS Institut de l’Information Scientifique et

Technique http://www.inist.fr

Pascal & Francis bibliographic data bases 15,000,000 XML records

Scientific Portals, Scientific information analysis Vocabularies: termSciences

Page 18: Metadata towards an  e-research cyberinfrastructure

Introduction DC 2006 18

INIST - metadata Bibliographic records

Exodic Origin: CCF based MARC format Translated in SGML in 92 now with a DCQ approach Strong links between authors and affiliations

termSciences ISO 16642 (TMF)

Page 19: Metadata towards an  e-research cyberinfrastructure

Actors DC 2006 19

Thesis

Cyberthèses

STAR

Local archive

CCSD

Articles

OAI-PMH

The landscape we would like to have

INIST

Page 20: Metadata towards an  e-research cyberinfrastructure

Introduction DC 2006 20

Case study: sharing theses and their metadata Thinking about reusability by several actors

which interoperate during ETD life cycle

Inra unit

Ecole doctorale

Inra

UniversityAbes/star

ETD

Univ. Lab.

Page 21: Metadata towards an  e-research cyberinfrastructure

Introduction DC 2006 21

Sharing metadata Administrative metadata are

requested for a quite complex workflow

Contents must be matched A given person could have different

names… Different ways of naming units… Different classification schemes…

Page 22: Metadata towards an  e-research cyberinfrastructure

Introduction DC 2006 22

Case study 2: BiodivERsA BiodivERsA: European research policy

network about biodiversity x * 10 funding agencies,

y*100 research program z * 1000 projects x1 * 10000 results …

distributed network of CRIS CRIS: Current Research Information System

Page 23: Metadata towards an  e-research cyberinfrastructure

Introduction DC 2006 23

Vocabulary adaptations

CRIS…

Archive

DL

Thematic CRIS

Thematic Archive

Thematic DL

Global DL

Classification schemas must be matched for computation purpose (funding evaluation)

Page 24: Metadata towards an  e-research cyberinfrastructure

Introduction DC 2006 24

Case study 3 Affiliation must be managed with ontologies

UHP CNRS INRIA

CRINLoria Inria Lorraine Inria Sophia

YT Cortex OmégaOrpailleur

Page 25: Metadata towards an  e-research cyberinfrastructure

Introduction DC 2006 25

A technical conclusion Metadata (DCQ) is good but not

sufficient We need

vocabulary adaptations, ontologies Sharing several repositories

(vocabularies, affiliations…) Managing metadata history etc

Page 26: Metadata towards an  e-research cyberinfrastructure

Introduction DC 2006 26

The very conclusion We need to help people working

altogether and doing compromises We need researcher becoming owners

of their scientific information system… We need librarians appropriating

technologies and helping researcher to appropriate librarian feeling

We need engineers in computer science appropriating library and edition issues

That is what we try to help to do with ARTIST

Page 27: Metadata towards an  e-research cyberinfrastructure

Introduction DC 2006 27

Thank you for your listening Thank you for your questions…