research overview gagan agrawal associate professor

9

Research Overview Gagan Agrawal Associate Professor

Upload: lorin-watkins

Post on 18-Jan-2018

216 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

DESCRIPTION

An Overall Vision Our world will be full of distributed and dynamic data sources High speed networking (Grid computing) Sensor networks, mobile systems, embedded devices Processing this information involves many challenges A lot of data, distributed Often, continuous data streams (can’t store all data, real- time processing constraint) Complex interplay of communication and computational costs Application programmers want more transparency

TRANSCRIPT

Page 1: Research Overview Gagan Agrawal Associate Professor

Research Overview

Gagan Agrawal Associate Professor

Page 2: Research Overview Gagan Agrawal Associate Professor

Personnel Involved Ph.D student

Liang Chen Wei Du Ruoming Jin Feng Li (Jointly with Joel Saltz) Xiaogang Li

Masters (thesis) student Ge Yang

Undergrad student Leo Glimcher

Faculty collaborations: Joel Saltz, Tahsin Kurc, Umit Catalyurek, Srini Parthasarathy, Raghu Machiraju

Page 3: Research Overview Gagan Agrawal Associate Professor

An Overall Vision Our world will be full of distributed and dynamic

data sources High speed networking (Grid computing) Sensor networks, mobile systems, embedded devices

Processing this information involves many challenges

A lot of data, distributed Often, continuous data streams (can’t store all data, real-

time processing constraint) Complex interplay of communication and computational

costs Application programmers want more transparency

Page 4: Research Overview Gagan Agrawal Associate Professor

Research Projects Compilers: Compiling XQuery (Query Language for

XML data), Compiling for a distributed heterogeneous (grid) environment, parallelizing scientific data intensive and data mining codes

Middleware and Runtime Support: FREERIDE (Framework for Rapid Implementation of Datamining Engines), ongoing work on distributed processing of data streams

Data mining and OLAP algorithms: Mining for streaming data, Parallel and scalable mining algorithms, OLAP algorithms

Page 5: Research Overview Gagan Agrawal Associate Professor

Compiling Data Intensive Applications for a Grid Environment

Page 6: Research Overview Gagan Agrawal Associate Professor

Compiling XQuery Vision: XML has become an accepted standard

for distribution of datasets XQuery is the well-accepted high-level query

language for querying and processing XML datasets

Compiling complex data-intensive reduction operations written in XQuery

Reductions written using recursion Data-centric execution strategies Using XML Schemas to describe the datasets -

Page 7: Research Overview Gagan Agrawal Associate Professor

System Support for Data Mining in a Parallel Environment

Clusters of SMPs

Data Parallel Java

Compiler Techniques

MPI+Posix Threads+File I/O

FREERIDE(middleware)

Runtime Techniques

Page 8: Research Overview Gagan Agrawal Associate Professor

Distributed Processing of Data Streams Processing continuous data streams arising from

distributed sources A number of system and algorithmic challenges

Real time requirement on processing rate – tradeoffs between accuracy of analysis and efficiency

Placement of data – obviously want to process an individual stream close to the source of data

Feedback based control of accuracy – cannot allow any computational or communication stage to become the bottleneck

Performance modeling: impact of output size, level of sampling etc. on performance

Recently started work in this area ….

Page 9: Research Overview Gagan Agrawal Associate Professor

Algorithms for Mining and OLAP Decision tree construction for streaming data:

new one-pass algorithm with statistical accuracy bound

Parallel and scalable decision tree construction: use sampling, but without losing accuracy

Data cube construction: Parallel algorithms with optimal communication

volume Tiling based algorithms for scaling output sizes

A Tool for Supporting Integration Across Multiple Flat-File Datasets Xuan Zhang, Gagan Agrawal Ohio State University

Transitioning to Semesters CSE MS Program Prof. Gagan Agrawal Grad Studies Chair

Light-Weight Data Management Solutions for Scientific Datasets Gagan Agrawal, Yu Su Ohio State Jonathan Woodring, LANL

Shared Memory Parallelization of Decision Tree Construction Using a General Middleware Ruoming Jin Gagan Agrawal Department of Computer and Information

Introduction to CSE PhD Program Prof. Gagan Agrawal Grad Studies Chair

Performance Issues in Parallelizing Data-Intensive applications on a Multi-core Cluster Vignesh Ravi and Gagan Agrawal {raviv,agrawal}@cse.ohio-state.edu

Elastic Cloud Caches for Accelerating Service-Oriented Computations Gagan Agrawal Ohio State University Columbus, OH David Chiu Washington State University

1 Data Mining over the Deep Web Tantan Liu, Gagan Agrawal {liut,agrawal}@cse.ohio-state.edu Ohio State University April 12, 2011

CSE PhD Program Prof. Gagan Agrawal Grad Studies Chair

Effective Automatic Parallelization and Locality ... · and entertaining. I would also like to thank Gagan (Prof. Gagan Agrawal) for being a very helpful and accessible Graduate Studies

Ohio State University Department of Computer Science and Engineering 1 Tools and Techniques for the Data Grid Gagan Agrawal

Implementing Data Cube Construction Using a Cluster Middleware: Algorithms, Implementation Experience, and Performance Ge Yang Ruoming Jin Gagan Agrawal

1 A Grid-Based Middleware for Processing Distributed Data Streams Liang Chen Advisor: Gagan Agrawal Computer Science & Engineering

Data-Intensive Computing: From Multi-Cores and GPGPUs to Cloud Computing and Deep Web Gagan Agrawal u

Efficient Evaluation of XQuery over Streaming Data Xiaogang Li Gagan Agrawal The Ohio State University

Computer Science and Engineering FREERIDE-G: A Grid-Based Middleware for Scalable Processing of Remote Data Leonid Glimcher Gagan Agrawal

Ex-MATE: Data-Intensive Computing with Large Reduction Objects and Its Application to Graph Mining Wei Jiang and Gagan Agrawal

A Dynamic Scheduling Framework for Emerging Heterogeneous Systems Vignesh Ravi and Gagan Agrawal

1 Using Tiling to Scale Parallel Datacube Implementation Ruoming Jin Karthik Vaidyanathan Ge Yang Gagan Agrawal The Ohio State University

Smita Vijayakumar Qian Zhu Gagan Agrawal 1. Background Data Streams Virtualization Dynamic Resource Allocation Accuracy Adaptation Research

HiPC 2010 AN INTEGER PROGRAMMING FRAMEWORK FOR OPTIMIZING SHARED MEMORY USE ON GPUS Wenjing Ma Gagan Agrawal The Ohio State University

Assigning Schema Labels Using Ontology and Heuristics Xuan Zhang, Rouming Jin, Gagan Agrawal

Modeling and Adaptive Scheduling of Large-Scale Wide-Area Data Transfers Raj Kettimuthu Advisors: Gagan Agrawal, P. Sadayappan

HPDC 2014 Supporting Correlation Analysis on Scientific Datasets in Parallel and Distributed Settings Yu Su*, Gagan Agrawal*, Jonathan Woodring # Ayan

Compiler Supported High-level Abstractions for Sparse Disk-resident Datasets Renato Ferreira Gagan Agrawal Joel Saltz Ohio State University

Doctor of Philosphy in Computer and Cyber Sciences...GAGAN AGRAWAL Gagan Agrawal is a Professor in School of Computer and Cyber Sciences at Augusta University. Agrawal received his

ValuePack: Value-Based Scheduling Framework for CPU-GPU Clusters Vignesh Ravi, Michela Becchi, Gagan Agrawal, Srimat Chakradhar

High-level Interfaces and Abstractions for Data-Driven Applications in a Grid Environment Gagan Agrawal Department of Computer Science and Engineering

Auspice: AUtomatic Service Planning in Cloud/Grid Environments David Chiu Dissertation Defense May 25, 2010 Committee: Prof. Gagan Agrawal, Advisor Prof

Towards Methods for Systematic Research On Big Data Manirupa Das, Renhao Cui, David R. Campbell, Gagan Agrawal, Rajiv Ramnath

Compiler Support for Exploiting Coarse-Grained Pipelined Parallelism Wei Du Renato Ferreira Gagan Agrawal Ohio-State University

Ohio State University 1 Cyberinfrastructure for Coastal Forecasting and Change Analysis Gagan Agrawal Hakan Ferhatosmanoglu Xutong Niu Ron Li Keith Bedford

Compiler (and Runtime) Support for CyberInfrastructure Gagan Agrawal (joint work with Wei Du, Xiaogang Li, Ruoming Jin, Li Weng)

Graphic1 Kondhwa Road Katra Kondhwa Road Gagan Legacy Gagan Centrum Gagan Renaissance Gagan LaWish Happinest Gagan©3a Gagan Nurlfe Vestawoods Bishops School O Cascades Gagan institute

A Map-Reduce System with an Alternate API for Multi-Core Environments Wei Jiang, Vignesh T. Ravi and Gagan Agrawal