building national cyberinfrastructure alan blatecky office of cyberinfrastructure

15
1 Building National Cyberinfrastructure Alan Blatecky Office of Cyberinfrastructure EPSCoR Meeting May 21, 2012 1

Upload: lark

Post on 10-Feb-2016

28 views

Category:

Documents


0 download

DESCRIPTION

Building National Cyberinfrastructure Alan Blatecky Office of Cyberinfrastructure EPSCoR Meeting May 21, 2012. 1. Framing the Challenge: Science and Society Transformed by Data. Modern science Data- and compute-intensive Integrative, multiscale - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Building National  Cyberinfrastructure Alan Blatecky Office of  Cyberinfrastructure

1

Building National Cyberinfrastructure

Alan BlateckyOffice of Cyberinfrastructure

EPSCoR MeetingMay 21, 2012

1

Page 2: Building National  Cyberinfrastructure Alan Blatecky Office of  Cyberinfrastructure

Framing the Challenge:Science and Society Transformed by

Data Modern science

Data- and compute-intensive

Integrative, multiscale Multi-disciplinary

Collaborations for Complexity Individuals, groups,

teams, communities Sea of Data

Age of Observation Distributed, central

repositories, sensor- driven, diverse, etc 2

Page 3: Building National  Cyberinfrastructure Alan Blatecky Office of  Cyberinfrastructure

Explosive Growth in Size, Complexity, and Data Rates

Enormous data sets are being generated by modern experiments and observations

Automatic extraction of new knowledge about the physical, biological and cyber world continues to accelerate

Infusion of data-intensive computation into science, engineering and education is revolutionizing research

Multi-cores, concurrent and parallel algorithms, virtualization and advanced server architectures will enable data mining and machine learning, and new approaches for innovation and discovery

Page 4: Building National  Cyberinfrastructure Alan Blatecky Office of  Cyberinfrastructure

4

Computer Architecture Trends Continuing growth in number of cores Increased use of hybrid accelerators Advances in interconnect technologies will

slow; more complex memory subsystems will be deployed

Power consumption becoming ever more important because of cost and performance

Application performance will be dominated by data movement

Clouds and data centers will play an increasingly larger role in data and compute infrastructure

Page 5: Building National  Cyberinfrastructure Alan Blatecky Office of  Cyberinfrastructure

5

Software Challenges Simulation and model scalability is a major

requirement for algorithm research and development

Parallel programming research is required to address order of magnitude changes in compute resources

New operating systems, architectures, file systems research, fault tolerance, verification and validation, complex simulation, and cybersecurity

Inadequate numbers of software workforce and expertise being produced

Focus on sustainability and usability is essential

Page 6: Building National  Cyberinfrastructure Alan Blatecky Office of  Cyberinfrastructure

Grand Challenge

CommunitiesLearning & WorkforceDevelopment

Campus Bridging,Cybersecurity

AdvancedComputationalInfrastructure

Data

Scientific Instruments

Software

Innovation,Discovery

CIF21: Cyberinfrastructure Framework for 21st Century Science and Engineering

6

Page 7: Building National  Cyberinfrastructure Alan Blatecky Office of  Cyberinfrastructure

Scientific Data Challenges

7

Byt

es p

er d

ay

2012 2020

Genomics

LHC

TeraGrid, BlueWaters

SquareKilometer

Array

Genomics

LHC

Climate, Environment

LSST

ExaBytes

PetaBytes

TeraBytes

GigaBytes

Climate, Environment

Volume/Growth

Useful

Lifetime

Distribution

Data Access

Many smaller datasets…

Page 8: Building National  Cyberinfrastructure Alan Blatecky Office of  Cyberinfrastructure

NSF Data strategy Establish a national data infrastructure to support

science, engineering and education Ensure that this infrastructure stays at the most

advanced state of sophistication and is sustainable

Support transformative interdisciplinary and collaborative research stimulated by data

Development of the next generation of compute and data intensive workforce

Development of a suite of policies for data, software, publications and other digital outputs

8

Page 9: Building National  Cyberinfrastructure Alan Blatecky Office of  Cyberinfrastructure

Advanced Computing Infrastructure Strategy

Foundational research to fully exploit parallelism and concurrency through innovations

Applications research and development in high end computing resources

Building, testing and deploying innovative resources in a collaborative environment

Development of comprehensive education and workforce programs

Development of grand challenge community programs

9

Page 10: Building National  Cyberinfrastructure Alan Blatecky Office of  Cyberinfrastructure

Scientific Software Elements:Small groups, individuals

Scientific Software Integration:Research Communities

Scientific Software Innovation Institutes:Large Multidisciplinary GroupsMulti-year

Creating Scalable SoftwareDevelopment Environments

Create a software ecosystem that scales from individual or small groups of software innovators to large hubs of software excellence

Focus on innovation Focus on sustainability10

Page 11: Building National  Cyberinfrastructure Alan Blatecky Office of  Cyberinfrastructure

Cyber-infrastructure: EarthCube

Goal: to transform the conduct of research in geosciences by supporting community-based cyberinfrastructure to integrate data and information for knowledge management across the Geosciences.

Community: More than 900 members subscribed to EarthCube web site. Second Charette: June 12-14

GEO-OCI Partnership11

Page 12: Building National  Cyberinfrastructure Alan Blatecky Office of  Cyberinfrastructure

Some observations Science and Scholarship are team sports Collaboration/partnerships will change significantly

Growth of dynamic coalitions and virtual organizations International collaboration becomes ever more important

Innovation and discovery will be driven by analysis Mining vast amounts of new and disparate data Collaboration and sharing of information

Mobility and personal control will continue to drive innovation and research communities

Gaming, virtualization and social networking will transform the way we do science, research and education

12

Page 13: Building National  Cyberinfrastructure Alan Blatecky Office of  Cyberinfrastructure

EPSCoR Cyberinfrastructure Suggestions

Become a provider rather than just a userContribute to XSEDE as a resource Build and coordinate data collections &

resources Issue is not scale, but capability,

diversity and multi-disciplinarity Coordinate efforts, develop

collaborative projects, practice community building

Focus on Education, especially CDS&E 13

Page 14: Building National  Cyberinfrastructure Alan Blatecky Office of  Cyberinfrastructure

• Data Infrastructure Building Blocks (DIBBs)• Software Infrastructure for Sustained Innovation (SI2)• EarthCube• Core Techniques and Technologies for Advancing Big Data

Science & Engineering (BIG DATA)• Computational and Data Intensive Science and Engineering in

the Mathematical and Physical Sciences (CDS&E)• Campus Cyberinfrastructure - Network Infrastructure and

Engineering Program (CC-NIE)• Science, Engineering and Education for Sustainability NSF-

Wide Investment (SEES)• Integrative Graduate Education and Research Traineeship, or

IGERT (CIF21 Track)

14

Solicitations that build Cyberinfrastructure

Page 15: Building National  Cyberinfrastructure Alan Blatecky Office of  Cyberinfrastructure

Discussion

15