readability metrics for network visualization

21
iOpener Workbench: Tools for Rapid Understanding of Scientific Literature Cody Dunne, Ben Shneiderman, Bonnie Dorr & Judith Klavans {cdunne, ben, bonnie}@cs.umd.edu, [email protected] 27 th Annual Human-Computer Interaction Lab Symposium May 27-28, 2010 College Park, MD

Upload: akashag11111

Post on 06-Jul-2015

311 views

Category:

Education


1 download

TRANSCRIPT

Page 1: Readability Metrics for Network Visualization

iOpener Workbench: Tools for Rapid Understanding of Scientific Literature

Cody Dunne, Ben Shneiderman, Bonnie Dorr & Judith Klavans{cdunne, ben, bonnie}@cs.umd.edu, [email protected]

27th Annual Human-Computer Interaction Lab Symposium

May 27-28, 2010 College Park, MD

Page 2: Readability Metrics for Network Visualization

iOpener Workbench

Page 3: Readability Metrics for Network Visualization

Contribution

• Infrastructure for rapidly summarizing scientific endeavor

– Integrate statistics, visualization, reference management, and automatic summarization

– Multiple coordinated views

Page 4: Readability Metrics for Network Visualization

Use Cases

• Learn about new fields

• Understand how communities form

• Analyze citation patterns within communities

• Easily explore & export all papers in a community

Page 5: Readability Metrics for Network Visualization

What we integrate

• Potent network analysis tool – SocialAction

– Citation network statistics & visualization

– Automatic community detection & visualization

• Reference & document management – JabRef

– Powerful reference manager with extensive features for search, grouping, review, annotation, and export

• Document view with citation linking & highlight

• Automatically generated summaries

– Citation text, keywords, abstracts

Page 6: Readability Metrics for Network Visualization

What can you do with a graph?

• Statistics, lists, and text is helpful, but

• Visualizations show unexpected trends, clusters, gaps, outliers

• Data cleaning & verification

• “Information visualization answers questions you didn't know you had” – Ben S.

Page 7: Readability Metrics for Network Visualization

Importance of Survey Articles

• Rapidly expanding disciplines• Large volume of scientific publications• Increasing cross-disciplinary research• Need for accurate surveys of previous work

– Short summaries– In-depth historical notes

• Multiple users– Scientists– Students & Educators– Government decision makers

Page 8: Readability Metrics for Network Visualization

iOPENER

• NSF Info Integration & Informatics program

• Information Organization for PENningExpositions on Research

Page 9: Readability Metrics for Network Visualization

Components

• Bibliometric lexical link mining

• Automatic summarization techniques

• Visualization tools for structure and content

Page 10: Readability Metrics for Network Visualization

Ongoing Work

• Increase preprocessing of citation texts to vastly improve trimmer summary comprehension

• Preliminary case studies with UMD student domain experts

– Dependency parsing subset of the ACL Anthology Network (AAN)

Page 11: Readability Metrics for Network Visualization

Coming Soon

• Multi-dimensional in-depth long-term case studies

– longitudinal case studies with domain experts using their data

– close participant observation

• Software & generated surveys publicly available and presented to academia and wider audiences

Page 12: Readability Metrics for Network Visualization

iOpener Workbench

• Infrastructure to aid rapid summarization of scientific literature

• Integrates

– Statistics

– Visualization

– Reference management

– Automatic summarization

Page 13: Readability Metrics for Network Visualization

iOpener Workbench: Tools for Rapid Understanding of Scientific Literature

Cody Dunne, Ben Shneiderman, Bonnie Dorr & Judith Klavans{cdunne, ben, bonnie}@cs.umd.edu, [email protected]

tangra.si.umich.edu/clair/iopener

This work has been partially supported by NSF grant "iOPENER: A Flexible Framework to Support Rapid Learning in Unfamiliar Research

Domains", jointly awarded to UMD and UMich as IIS 0705832.

Page 14: Readability Metrics for Network Visualization

Network Analysis

Page 15: Readability Metrics for Network Visualization

Reference Manager

Page 16: Readability Metrics for Network Visualization

Document & Citation View

Page 17: Readability Metrics for Network Visualization

Summarization

Page 18: Readability Metrics for Network Visualization

Features – Network analysis

• SocialAction (Perer, Shneiderman)

• Citation network visualization – Force-directed placement (by linkages)

• Scatterplots of paper attributes & statistics

• Statistics rank tables

• Categorial and numerical range coloring

• Automatic community detection – Newman '04 fast heuristic

Page 19: Readability Metrics for Network Visualization

Features – Reference Manager

• Search by field with simple regex– abstract|keywords=nonprojective and year = 2008

• Grouping -- automatic, search results, manual

• DOI/URL, fulltext (annotated PDF, plain text)

• Metadata, abstracts

• User generated reviews

• BibTeX, Word, OpenOffice integration

• HTML, EndNote export

Page 20: Readability Metrics for Network Visualization

Document view - features

• Citation links

• Highlighting

Page 21: Readability Metrics for Network Visualization

Summarization - Features

• Automatically generated summariesCitation text, keywords, abstracts

• Working to substantially improve coherence & relevance