the future of the journal and applications in an open scientific ecosystem
DESCRIPTION
Vishal Gupta + Anita de Waard at AI Mashup session at ESWC2010TRANSCRIPT
Elsevier- Open to Accelerate Science
The Future of the Journal, and
Applications in an Open Scientific Ecosystem
Anita de Waard , [email protected] Disruptive Technologies Director, Elsevier Labs
Vishal Gupta, [email protected] Head of Developer Programs, Elsevier
7th Extended Semantic Web Conference
May 31, 2010
2
Science is made of information...
2
Science is made of information...
...that gets created...
2
Science is made of information...
...that gets created... ... and destroyed.
3
What is the problem?
3
What is the problem?
1.Researchers can’t keep track of their data.
3
What is the problem?
1.Researchers can’t keep track of their data.
2.Data is not stored in a way that is easy for authors.
3
What is the problem?
1.Researchers can’t keep track of their data.
2.Data is not stored in a way that is easy for authors.
3.For readers, article text is not linked to the underlying data.
4
Workflow tools to the rescue!
4
Workflow tools to the rescue!
http://MyExperiment.org
4
Workflow tools to the rescue!
http://MyExperiment.org
http://VisTrails.org
4
Workflow tools to the rescue!
http://wings.isi.edu/
http://MyExperiment.org
http://VisTrails.org
The Vision
5
Work done with Ed Hovy, Phil Bourne, Gully Burns and Cartic Ramakrishnan
All data items created in the lab are added to the workflow system.
The Vision
5
Work done with Ed Hovy, Phil Bourne, Gully Burns and Cartic Ramakrishnan
All data items created in the lab are added to the workflow system.
The Vision
5
Each item in the system has metadata (including provenance) and relations to other data items added to it.
Work done with Ed Hovy, Phil Bourne, Gully Burns and Cartic Ramakrishnan
All data items created in the lab are added to the workflow system.
The Vision
5
Each item in the system has metadata (including provenance) and relations to other data items added to it.
Work done with Ed Hovy, Phil Bourne, Gully Burns and Cartic Ramakrishnan
When a paper is published, a slice of this information is exposed to the world. It remains connected to its related data item, and its heritage can be traced.
Rats were subjected to two grueling tests...
see figure 2 for more details (click on figure to see data)
All data items created in the lab are added to the workflow system.
The Vision
5
Each item in the system has metadata (including provenance) and relations to other data items added to it.
Work done with Ed Hovy, Phil Bourne, Gully Burns and Cartic Ramakrishnan
When a paper is published, a slice of this information is exposed to the world. It remains connected to its related data item, and its heritage can be traced.
Rats were subjected to two grueling tests...
see figure 2 for more details (click on figure to see data)
Applications run on this ‘exposed data’ universe. Rats were subjected to two grueling tests...
see figure 2 for more details (click on figure to see data)
Rats were subjected to two grueling tests...
see figure 2 for more details (click on figure to see data)
Some other publisher
6
The vision (same thing, now in words):
6
The vision (same thing, now in words): 1. All data items created in the lab (including all measurements,
graphs, emails, talks: everything!) gets included in a workflow system.
6
The vision (same thing, now in words): 1. All data items created in the lab (including all measurements,
graphs, emails, talks: everything!) gets included in a workflow system.
2. Each item in the system has a proper set of tags - including identification of provenance and authorship - and relations to other data items.
6
The vision (same thing, now in words): 1. All data items created in the lab (including all measurements,
graphs, emails, talks: everything!) gets included in a workflow system.
2. Each item in the system has a proper set of tags - including identification of provenance and authorship - and relations to other data items.
3. When a paper is published, a slice of this information is exposed to the world. It remains connected to its related data item, and its heritage can be traced.
6
The vision (same thing, now in words): 1. All data items created in the lab (including all measurements,
graphs, emails, talks: everything!) gets included in a workflow system.
2. Each item in the system has a proper set of tags - including identification of provenance and authorship - and relations to other data items.
3. When a paper is published, a slice of this information is exposed to the world. It remains connected to its related data item, and its heritage can be traced.
4. Applications run on this ‘exposed data’ universe.
6
The vision (same thing, now in words): 1. All data items created in the lab (including all measurements,
graphs, emails, talks: everything!) gets included in a workflow system.
2. Each item in the system has a proper set of tags - including identification of provenance and authorship - and relations to other data items.
3. When a paper is published, a slice of this information is exposed to the world. It remains connected to its related data item, and its heritage can be traced.
4. Applications run on this ‘exposed data’ universe. 5. Everything lives in the cloud.
7
What is needed to get there?
7
What is needed to get there?
A. Tools: Workflow tools that work for all science, are scalable, safe, and user-friendly
7
What is needed to get there?
A. Tools: Workflow tools that work for all science, are scalable, safe, and user-friendly
B. Metadata standards: Standards that allow interoperable exchange of information on any knowledge item created in a lab, including provenance and privacy/IPR rights
7
What is needed to get there?
A. Tools: Workflow tools that work for all science, are scalable, safe, and user-friendly
B. Metadata standards: Standards that allow interoperable exchange of information on any knowledge item created in a lab, including provenance and privacy/IPR rights
C. Social change: Scientists need to realize they should annotate their work
7
What is needed to get there?
A. Tools: Workflow tools that work for all science, are scalable, safe, and user-friendly
B. Metadata standards: Standards that allow interoperable exchange of information on any knowledge item created in a lab, including provenance and privacy/IPR rights
C. Social change: Scientists need to realize they should annotate their work
D. Semantic/Linked Data space at the publisher end.
7
What is needed to get there?
A. Tools: Workflow tools that work for all science, are scalable, safe, and user-friendly
B. Metadata standards: Standards that allow interoperable exchange of information on any knowledge item created in a lab, including provenance and privacy/IPR rights
C. Social change: Scientists need to realize they should annotate their work
D. Semantic/Linked Data space at the publisher end.
7
What is needed to get there?
A. Tools: Workflow tools that work for all science, are scalable, safe, and user-friendly
B. Metadata standards: Standards that allow interoperable exchange of information on any knowledge item created in a lab, including provenance and privacy/IPR rights
C. Social change: Scientists need to realize they should annotate their work
D. Semantic/Linked Data space at the publisher end.
E. Publishing systems that run as application servers.
7
What is needed to get there?
A. Tools: Workflow tools that work for all science, are scalable, safe, and user-friendly
B. Metadata standards: Standards that allow interoperable exchange of information on any knowledge item created in a lab, including provenance and privacy/IPR rights
C. Social change: Scientists need to realize they should annotate their work
D. Semantic/Linked Data space at the publisher end.
E. Publishing systems that run as application servers.
tool builders
standards bodies
institutes, funding bodies, individuals
7
What is needed to get there?
A. Tools: Workflow tools that work for all science, are scalable, safe, and user-friendly
B. Metadata standards: Standards that allow interoperable exchange of information on any knowledge item created in a lab, including provenance and privacy/IPR rights
C. Social change: Scientists need to realize they should annotate their work
D. Semantic/Linked Data space at the publisher end.
E. Publishing systems that run as application servers.
tool builders
standards bodies
institutes, funding bodies, individuals
publishers
7
What is needed to get there?
A. Tools: Workflow tools that work for all science, are scalable, safe, and user-friendly
B. Metadata standards: Standards that allow interoperable exchange of information on any knowledge item created in a lab, including provenance and privacy/IPR rights
C. Social change: Scientists need to realize they should annotate their work
D. Semantic/Linked Data space at the publisher end.
E. Publishing systems that run as application servers.
tool builders
standards bodies
institutes, funding bodies, individuals
publishers
publishers
7
What is needed to get there?
A. Tools: Workflow tools that work for all science, are scalable, safe, and user-friendly
B. Metadata standards: Standards that allow interoperable exchange of information on any knowledge item created in a lab, including provenance and privacy/IPR rights
C. Social change: Scientists need to realize they should annotate their work
D. Semantic/Linked Data space at the publisher end.
E. Publishing systems that run as application servers.
tool builders
standards bodies
institutes, funding bodies, individuals
publishers
publishers
8
Linked Data for Elsevier
10
8
Linked Data for Elsevier
<ce:section id=#123>
10
8
Linked Data for Elsevier
<ce:section id=#123> mice like cheesethis says
10
8
Linked Data for Elsevier
<ce:section id=#123>
said @anita on May 31 2010
mice like cheesethis says
10
8
but we all know she was jetlagged then
Linked Data for Elsevier
<ce:section id=#123>
said @anita on May 31 2010
mice like cheesethis says
10
8
but we all know she was jetlagged then
Linked Data for Elsevier
<ce:section id=#123>
said @anita on May 31 2010
immutable, $$, proprietary
mice like cheesethis says
10
8
dynamic, personal, task-driven, - open?
but we all know she was jetlagged then
Linked Data for Elsevier
<ce:section id=#123>
said @anita on May 31 2010
immutable, $$, proprietary
mice like cheesethis says
10
9
Semantic annotation grid
11
9
Semantic annotation grid
11
9
Semantic annotation grid
document
claim
triple
entity
collectionGranularity
11
9
Semantic annotation grid
document
claim
triple
entity
collectionGranularity
reader/data miningtypesetter/productionauthor/editorMoment
measure
11
9
Semantic annotation grid
automated
manual
semi-automated
Means
document
claim
triple
entity
collectionGranularity
reader/data miningtypesetter/productionauthor/editorMoment
measure
11
9
Semantic annotation grid
Automated Copy Editing
automated
manual
semi-automated
Means
document
claim
triple
entity
collectionGranularity
reader/data miningtypesetter/productionauthor/editorMoment
measure
11
9
Semantic annotation grid
Automated Copy Editing
Reflect
automated
manual
semi-automated
Means
document
claim
triple
entity
collectionGranularity
reader/data miningtypesetter/productionauthor/editorMoment
measure
11
10
.XMP RDF in all our PDFs: DC + PRISM
12
Scientific Applications, Open APIs and a New Publishing Ecosystem
“ If I were to guess what Web 3.0 is, I would tell you that it’s a different way of building applications…
My prediction would be that Web 3.0 will ultimately be seen as applications which are pieced together.
There are a number of characteristics: the applications are relatively small, the data is in the cloud, the applications can run on any device, PC or mobile phone, the applications are very fast and they’re very customizable.
Furthermore, the applications are distributed virally: literally by social networks, by email. You won’t go to the store and purchase them… That’s a very different application model than we’ve ever seen in computing. ” - Eric Schmidt
CEO Google
We conducted 3,000 interviews with researchers,
librarians and developers
Librarian feedback
“This is just amazing. What faculty is really after is for something that ties this all together, so it’s all in one place. This makes it really easy for them.”
“Apps (interacting) with results are very important to help save time... apps integrated into article such as the pop-up example is also very interesting…”
Researcher feedback
Developer feedback
19
“Holy ****…you are clearly aware of what the web really looks like, I’m very impressed with that. I haven’t seen anything so far that comes anywhere close to what you have done …I would love to help out in any capacity…”
Open APIs for applications
Developers can gain recognition and revenues
Institutions can become focal point for applications
Researchers can save time, improve their information discovery process and innovate
An ecosystem open to accelerate science
App Integration in Science Direct
Thank You
Anita de Waard