programming in hadoop guangda hu [email protected] huayang guo [email protected]

15

Programming in Hadoop Guangda HU [email protected] Huayang GUO [email protected]

Upload: grant-moody

Post on 17-Jan-2016

214 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

TRANSCRIPT

Page 1: Programming in Hadoop Guangda HU tarlou.gd@gmail.com Huayang GUO dragonghy@gmail.com

Programming in Hadoop

Guangda [email protected]

Huayang [email protected]

mailto:[email protected]

mailto:[email protected]

Page 2: Programming in Hadoop Guangda HU tarlou.gd@gmail.com Huayang GUO dragonghy@gmail.com

Hadoop Overview

• About Hadoop– Apache Hadoop is a Java software framework that

supports data-intensive distributed applications under a free license. It enables applications to work with thousands of nodes and petabytes of data.

Page 3: Programming in Hadoop Guangda HU tarlou.gd@gmail.com Huayang GUO dragonghy@gmail.com

Hadoop Overview

• Architecture– HDFS (Hadoop Distributed File System)– Job Tracker– Task Tracker

Page 4: Programming in Hadoop Guangda HU tarlou.gd@gmail.com Huayang GUO dragonghy@gmail.com

Hadoop Overview

• Mechanism– Map and Reduce

Page 5: Programming in Hadoop Guangda HU tarlou.gd@gmail.com Huayang GUO dragonghy@gmail.com

Hadoop Overview

• Applications– Facebook (Hadoop, Hive, Scribe)– Yahoo! (Hadoop in Yahoo Search)– Veritas (San Point Direct, Veritas File System)– IBM Transarc (Andrew File System)– UW Computer Science Alumni (Condor Project)

Page 6: Programming in Hadoop Guangda HU tarlou.gd@gmail.com Huayang GUO dragonghy@gmail.com

Our Work

• Setup running environment– Single node setup– Multi-node cluster setup– Network access

• Experiments and analysis– Word count– Integration– Largest number

Page 7: Programming in Hadoop Guangda HU tarlou.gd@gmail.com Huayang GUO dragonghy@gmail.com

Environment Setup

• Hardware– Two multi-core machines with Linux– Ethernet connection

• Software– Ubuntu 9.04– Hadoop 0.20.1– Five virtual machine on VirtualBox

Page 8: Programming in Hadoop Guangda HU tarlou.gd@gmail.com Huayang GUO dragonghy@gmail.com

Environment Setup

• Cluster structure– Two machines

• 166.111.69.85• 59.66.132.161

– One master node– Three slave nodes

Page 9: Programming in Hadoop Guangda HU tarlou.gd@gmail.com Huayang GUO dragonghy@gmail.com

Experiments

• Benchmark– Word count (default example)– Super word count (SuperWordCount.java)– Integration (Integration.java)– Largest numbers (LargestGen.java)

Page 10: Programming in Hadoop Guangda HU tarlou.gd@gmail.com Huayang GUO dragonghy@gmail.com

Benchmark Analysis

0 5000000000 10000000000 150000000000

50

100

150

200

250

300

Four nodes

Computation

Tim

e

Page 11: Programming in Hadoop Guangda HU tarlou.gd@gmail.com Huayang GUO dragonghy@gmail.com

Benchmark Analysis

0 5000000000 10000000000 150000000000

50

100

150

200

250

300

Two nodes

Computation

Tim

e

Page 12: Programming in Hadoop Guangda HU tarlou.gd@gmail.com Huayang GUO dragonghy@gmail.com

Benchmark Analysis

• More experiments

Files Computation Time (s)

24 2.4 * 109 102

120 2.4 * 109 179

Nodes Files Slope (sec/109)

4 24 ≈ 30

2 24 ≈ 40

Page 13: Programming in Hadoop Guangda HU tarlou.gd@gmail.com Huayang GUO dragonghy@gmail.com

Challenges & Acquirements

• Network & virtual cluster communication• Hadoop technique survey• Cooperation

Page 14: Programming in Hadoop Guangda HU tarlou.gd@gmail.com Huayang GUO dragonghy@gmail.com

References

• http://www.ibm.com/developerworks/cn/• http://en.wikipedia.org/wiki/Hadoop• http://www.michael-noll.com/wiki/• Linux Man Pages• Hadoop source code and Java Doc

http://www.ibm.com/developerworks/cn/

http://en.wikipedia.org/wiki/Hadoop

http://www.michael-noll.com/wiki/

Page 15: Programming in Hadoop Guangda HU tarlou.gd@gmail.com Huayang GUO dragonghy@gmail.com

Thanks

gmail.c om [email protected] [email protected] [email protected] [email protected] pawandeharu001 (@gmail.com [email protected] [email protected]

Spectrum Printing - Siemens€¦ · 14 Huayang Printing & Packaging Machinery: technology leadership in the Chinese flexo printing market with Simotion Indian manufacturers go global

Huayang Guo 1,2 , Ming Wu 1 , Lidong Zhou 1 , Gang Hu 1,2 , Junfeng Yang 2 , Lintao Zhang 1 1 Microsoft Research Asia 2 Columbia University

Introduction to Refractory Products of GuangDa Industry · With strategies of “technical innovation and win-win cooperation”, GuangDa ... Sinosteel Engineering & Technology Co.,

HUA YANG BERHADhuayang.listedcompany.com/newsroom/HUAYANG... · Any reference to persons shall include corporations, unless otherwise specified. Any reference in this Circular to

[email protected] [email protected] Association « Laëticlown » [email protected] [email protected]

CBMRS · 2017. 11. 9. · [email protected] [email protected] [email protected] [email protected] [email protected] [email protected]

Effectively Model Checking Real-World Distributed Systems Junfeng Yang Joint work with Huayang Guo, Ming Wu, Lidong Zhou, Gang Hu, Lintao Zhang, Heming

Efficient Deterministic Multithreading through Schedule ...huayang/files/peregrine.pdfdown [42]) because most code does not control synchronization and can still run in parallel, but

agrup.rodolfowalsh@gmail - WordPress.com · [email protected]. [email protected]. [email protected]

AFFORDABLE HOMES FOR ALLhuayang.listedcompany.com/newsroom/HUAYANG-AnnualReport201… · CONTENTS 02 Corporate Information 04 Notice of Annual General Meeting 08 Chairman’s Statement

Practical Software Model Checking via Dynamic Interface ...junfeng/papers/demeter-sosp11.pdf · Practical Software Model Checking via Dynamic Interface Reduction Huayang Guo† Ming

ORIGINAL ARTICLE ﬁc DsbA-L Overexpression Promotes ...€¦ · Lijun Zhou,2 Hongzhi Chen,4 Guangda Xiang,2 Christi A. Walter,2,5,6,7 Steven N. Austad,2,6 Nicolas Musi,3 Ralph A

d15k2d11r6t6rl.cloudfront.net...Guangda Thermoelectric / Shanghai Power / Design Institutes/ Research Institutes Shanghai Academy of Environmental Sciences / Shanghai Tongji Urban

Microsoft Research Asia Ming Wu, Haoxiang Lin, Xuezheng Liu, Zhenyu Guo, Huayang Guo, Lidong Zhou, Zheng Zhang MIT Fan Long, Xi Wang, Zhilei Xu

Consultas · 2 days ago · Consultas [email protected] [email protected] Seño Camila [email protected] Seño Carina [email protected] [email protected]

HUAYANG Catalog .pdfx

[email protected]

· 2018-09-22 · Wafangdian Guangda Bearing Manufacturing Co., Ltd. (WSW bearing) is located at Wafangdian City, in the north of Dalian City, Liaoning Province, which is famous

Scanned by CamScannergeominjk.nic.in/pdf/ContactNosWithEmailIDs_03072018.pdf · Email ID [email protected] [email protected] [email protected] [email protected] [email protected]

Practical Software Model Checking via Dynamic Interface ... · Practical Software Model Checking via Dynamic Interface Reduction Huayang Guo∗† Ming Wu† Lidong Zhou† Gang Hu∗†

· 2016-01-31 · adamkane13.ak gmail.com myersj yahoo.com devaughn93mcgee gmail.com [email protected] [email protected] davidhenson13 gmail.com joshua.maxwe1172 yahoo.com

The Ancient Tangluo Road and Huayang Township · Paper Reference: Zhou Zhongqing (2008). The Ancient Tangluo Road and Huayang Township. In: Collected Papers of the Symposium on the

IC - 02 Sep 2010 - Hua Yang Bhdhuayang.listedcompany.com/misc/research_reports/HuaYang...2010/09/02 · Equity Research Initiating Coverage Please refer to important disclosures on

Dalian JinYu HuaYang Trade Development Co., Ltd.€¦ · Dalian JinYu HuaYang Trade Development Co., Ltd. ... CAT HEUI SOLENOID 2 CAT C11/13/15/18 ... 61 EUI CAM BOX 62 TEST BENCH

[email protected] [email protected]

[email protected]/images/Contact_Details.pdf · [email protected] [email protected] [email protected] [email protected] [email protected]

March 20, 2019 TensorRT Inference - Nvidia · TensorRT Inference with TensorFlow Pooya Davoodi (NVIDIA) Chul Gwon (Clarifai) Guangda Lai (Google) Trevor Morris (NVIDIA) March 20,

Development of Non-imaging Spectral Library via Field ... · [email protected], [email protected], [email protected], [email protected], 4 [email protected]

[email protected] [email protected] [email protected] [email protected] [email protected]

DOPS: Learning to Detect 3D Objects and Predict Their 3D ......DOPS: Learning to Detect 3D Objects and Predict their 3D Shapes Mahyar Najibi1 Guangda Lai2 Abhijit Kundu2 Zhichao Lu2

Learn Chinese Easily in Yunnan Presented By Huayang Academy

Consultas - bibliotecarivadavia.edu.ar · Consultas [email protected] [email protected] [email protected] [email protected] [email protected] [email protected]

PEREGRINE: Efficient Deterministic Multithreading through Schedule Relaxation Heming Cui, Jingyue Wu, John Gallagher, Huayang Guo, Junfeng Yang Software

KM-1100M H KM-1100MH Item # 13359 MODULAR CRESCENT CUBER [email protected] [email protected] [email protected] [email protected] [email protected] [email protected]