presentation title goes heredownload.microsoft.com/download/c/f/f/cff0a653-6cd6-4e52-b97… ·...
TRANSCRIPT
HDInsight-微软Azure云中的Hadoop大数据解决方案概览及案例分享 孙巍 资深项目经理 微软亚太研发集团
课程代码DBI-B308
Hadoop?
Apache 开源项目
高扩展性分布式文件系统 (HDFS)
分布式数据处理框架
Microsoft解决方案
Hadoop 2.2 and 2.4
80% data compression with ORC
Hadoop
on
Windows
Hive 100x Query Speed Up
30,000+ code line contributions
HDFS in Cloud
(Azure)
10,000+ engineering hours
Committers to Hadoop
Hadoop 2.0
Data Node Data Node Data Node Data Node
Task Tracker Task Tracker Task Tracker Task Tracker
Name Node
Job Tracker
HMaster Coordination
Region Server Region Server Region Server Region Server
Stream processin
g
Search and query
Data analytics (Excel)
Web/thick client
dashboards
Devices to take action
RabbitMQ /
ActiveMQ
HDInsight on Hadoop 2.2 April 2014
HDInsight on Hadoop 1.1.2 Oct 2013
HDInsight on Hadoop 2.4 June 2014
O/S Upgrades
O/S Patching
$£€¥
Cloud
案例
课后提醒
https://channel9.msdn.com/Events/Ignite/Microsoft-Ignite-China-2015
http://aka.ms/IgniteChina2015