Project Management
NSF DataNet site visit to MITFebruary 8, 2010
DataSpace
February 8. 2010 1NSF Site Visit to MIT DataSpace
• Data Protection, Security• Distributed Policy Management • Data Analysis, Visualization• Data Analytics • Data Sharing Policy and Legal Advise• Data Quality• Data Semantics, Discovery• Data Interoperability, Integration• Data Storage Architecture• Data Curation Workflows
DataSpace Advisory Board (external, international,
10-15 members)
DataSpace Business Development Team
(DBDT)
Research & Prototyping
Development & Operations
DataSpace PI
DataSpace Project Director
Management Board(PIs & Senior Personnel)
Sloan School of Management Finance
& Administration
• Cyberinfrastructure Architecture • Software Design & Development • Infrastructure Planning • Data Curation Operations • Technology Operations • Service Modeling • Business Modeling
Marketing & Outreach
• Communication • Coordination • User Needs Assessment • Usability and Feedback • Public outreach (‘citizen science’) • Educational outreach • Scholarly Publishing outreach
Other Cyberinfrastructure
Partners
Other Sectors (e.g. finance,
pharma, health care, insurance)
DataNet Partners
Outside Partners
DataSpace Organizational Structure (preliminary)
February 8. 2010 2NSF Site Visit to MIT DataSpace
DataSpace Personnel
Management TeamStuart Madnick, Sloan School of Management (PI, faculty)*Hal Abelson, Electrical Engineering & Computer Science (co-PI, faculty)Ed DeLong, Civil & Environmental Engineering (co-PI, faculty)John Gabrieli, Brain & Cognitive Sciences (co-PI, faculty)Marilyn T. Smith, Information Services &Technology (co-PI, administration)MacKenzie Smith, Libraries (co-PI, administration)*Executive Director: to be hired (staff)*Additional Project Management Board Members (Ann Wolpert, Claude
Canizares, others to be named)
*DataNet funding provided
February 8. 2010 3NSF Site Visit to MIT DataSpace
DataSpace Personnel
Research & Prototyping TeamStuart Madnick, MIT Sloan School of Management (PI, faculty)Michael Siegel, MIT Sloan School of Management (research staff)*Tom Malone, MIT Sloan School of Management (faculty)Hal Abelson, Tim Berners-Lee, Danny Weitzner*, MIT CSAIL (faculty)David Karger, MIT CSAIL (faculty)Mei Hsu, Dejan Milojicic, Joe Pato, HP Labs (researcher)Stephen Todd, EMC (researcher)Alon Halevy, Google (researcer)Steve White, Microsoft (researcher)9 MIT Research Assistants* funded by DataNet
*DataNet funding provided
February 8. 2010 4NSF Site Visit to MIT DataSpace
DataSpace Personnel
Development & Operations TeamRichard Rodgers, MIT LibrariesAnne Silvester, MIT IS&TBrad McLean, DuraSpaceLead developer*: to be hiredSoftware developers, MIT (2.5)*: to be hiredSoftware developers, Georgia Tech, Rice, OSU (3)*: to be hiredMIT Data Coordinator, Data Operator*: to be hiredScience Commons expert*: to be hiredMark Pearrow, Neuroscience data expert*Data Curators, MIT (2)*: to be hiredData Curators, Georgia Tech, Rice, OSU (2.5)*: to be hiredData workflow analysts (undergrads), Rice University*: to be hired
*DataNet funding providedFebruary 8. 2010 5NSF Site Visit to MIT DataSpace
DataSpace Personnel
Marketing & Outreach TeamMichele, Kimpton, DuraSpaceDuraSpace outreach coordinator*: to be hired John Wilbanks, Science CommonsMacKenzie Smith, MITGeneva Henry, Rice University*Tyler Walters, Georgia Institute of TechnologyTerry Reese, Oregon State UniversityWei Lee Woon, Masdar InstituteMIT Data Curators (2): to be hiredData Curators, Georgia Tech, Rice, OSU (2.5): to be hiredChris Merlan, OpenCourseWare
*DataNet funding provided
February 8. 2010 6NSF Site Visit to MIT DataSpace
DataSpace Personnel
Business Development TeamMichael Siegel, MIT Sloan School of ManagementDataSpace PI and co-PIsDataSpace Executive Director: to be hiredMichele Kimpton, DuraSpaceGeneva Henry, Rice
Others to be selected from science participants, technology partners, institutional partners, MIT administration, advisory board
February 8. 2010 7NSF Site Visit to MIT DataSpace
Staff Breakdown
• 6 PIs and co-PIs • 23 Senior Personnel (faculty and senior staff)• 9 graduate student RAs, 2 undergrad staff• 16 FTE staff across 6 organizations
– 7.5 data curators– 6.5 developers– 2 specialists (for DuraSpace and Science Commons
February 8. 2010 NSF Site Visit to MIT DataSpace 8
Advisory Board• Christine L. Borgman, Professor & Presidential Chair in Information Studies, Graduate School of Education
and Information Science, University of California, Los Angeles
• Randy Buckner, Principal Investigator of the Cognitive Neuroscience Laboratory (CNL) and Professor of Psychology at Harvard University
• Scott Doney, Senior Scientist in Marine Chemistry & Geochemistry at Woods Hole Oceanographic Institution
• Keith Jeffery, President, European Research Consortium of Informatics and Mathematics (ERCIM) and Director, Information Technology and International Strategy, UK Rutherford Appleton Laboratory
• Liz Lyon, Director, UKOLN and Associate Director, UK Digital Curation Centre (DCC)
• Ed Roberts, David Sarnoff Professor of Management of Technology, MIT Sloan School of Management; MIT Technological Innovation & Entrepreneurship Program; and MIT Entrepreneurship Center
• Pam Samuelson, Professor, University of California at Berkeley School of Information and School of Law
• Dan Schutzer, Director, Financial Services Technical Consortium (FSTC)
• Andrew Treloar, Director and Chief Architect, ARCHER Project, Australian National Data Service (ANDS), Monash University, Clayton, Australia
• Wanda Orlikowski, Eaton-Peabody Professor of Communication Sciences at MIT, and Professor of Information Technologies and Organization Studies at MIT Sloan School of Management
February 8. 2010 9NSF Site Visit to MIT DataSpace
DataSpace Year 1 Project Plan in Development
February 8. 2010 NSF Site Visit to MIT DataSpace 10
DataSpace 5-Year Timelinein Development
February 8. 2010 NSF Site Visit to MIT DataSpace 11
Apr-11OSS release
7/1/2010 7/31/2015
Mar-11Initial ingest MIT fMRI,
oceanographic data
Jan-11DataSpace 0.1 operational;
DataSpace 1.0 architecture complete
Jul-12New domain data
ingested (e.g. nanotechnology)
Jan-13DataSpace 1.2
operational
Jul-11Data enhancement cycle;
Microsoft federates MIT data
Sep-11GT, OSU, Rice
operational
Mar-12New Partners
added
Jan-12DataSpace 1.1
operational
Dec-11GT, OSU, Rice
data loaded Dec-12Ad hoc data
submissions begin
Mar-14Non- MIT DataSpace
implementation
Jan-14DataSpace 1.3
operational
Jan-15DataSpace 1.4
operational
Jul-12Non-MIT DataSpace
operational
Mar-15Non-research-sector
DataSpace operational
Nov-10MIT data appraisal complete;
initial ontologies identified
Jul-13New partner data federated
Jul-14New partner data federated
Jun-11Base catalog
deployed
Mar-12Data Enhancement cycle;
Federated data launch
Nov-11Masdar
operational
Jul-11DataSpace 1.0
operational
Some Key Goals for Year 1• Complete staffing, project plan• Deploy DataSpace v0.1 (interim architecture)
– Build on existing software base (DSpace 2.0, Fedora)• Ingest of initial Neuroscience and Biological Oceanography data
– Selection/development of ontologies– Recording of metadata (including preservation policies, etc.)
• Design, Develop and Deploy DataSpace v1.0– Addition of new DataSpace middleware– Service models defined with partners
• Initial Dataset Catalog Deployed • Initial results of Business Development Management Team• Begin Educational and Outreach efforts (i-schools, OCW)
February 8. 2010 12NSF Site Visit to MIT DataSpace