DBI 210 Harnessing Big Data with Hadoop Dipti
DBI 210 Harnessing Big Data with Hadoop Dipti Sangani; Madhu Reddy
BIG HYPE? OR REAL DEAL?
May 21, 2012 VC firms pour money into big-data vendors
title Microsoft’s Big Data Vision and Platform
Dipti Sangani diptis@microsoft. com Madhu Reddy Sr. Product mareddy@microsoft. com Planner Sr. Program Manager
Relational Data
New Economics Open Source Software Commodity Hardware. Cloud Scale.
What’s the social sentiment for my brand or products? LIVE DATA FEEDS SOCIAL & WEB ANALYTICS How do I optimize my fleet based on weather and traffic patterns? How do I better predict future outcomes? ADVANCED ANALYTICS
IMMERSIVE INSIGHT , WHEREVER YOU ARE CONNECTING WITH THE WORLD’S DATA ANY DATA, ANY SIZE ANYWHERE
DATA MANAGEMENT 1 01 RELATIONAL NON-RELATIONAL STREAMING
DATA ENRICHMENT DISCOVER AND RECOMMEND SHARE AND GOVERN TRANSFORM AND CLEAN DATA MANAGEMENT 1 01 RELATIONAL NON-RELATIONAL STREAMING
INSIGHT SELF-SERVICE | COLLABORATIVE | MOBILE | REAL-TIME DATA ENRICHMENT DISCOVER AND RECOMMEND TRANSFORM AND CLEAN SHARE AND GOVERN DATA MANAGEMENT RELATIONAL NON-RELATIONAL 1 01 STREAMING
INSIGHT SELF-SERVICE | COLLABORATIVE | MOBILE | REAL-TIME DATA ENRICHMENT DISCOVER AND RECOMMEND TRANSFORM AND CLEAN SHARE AND GOVERN DATA MANAGEMENT 1 01 RELATIONAL NON-RELATIONAL STREAMING
Make Hadoop Enterprise Ready • Submit changes back to Apache Foundation • ‘Just works’ on Windows Azure and Server • Integration with Visual Studio • • Performance, Scale, High Availability Management, Ease of use Security, Data Governance Integration with AD and SC. • Integration with SQL Server
Use Case: Microsoft BI Tools • Klout provides a score to measure customer influence • Klout used SSAS OLAP Cube to speed queries and offer custom BI OLAP Cube • OLAP Cube loads data from Hive Data Warehouse • New solution analyzes 35 billions rows of data and delivers fast queries in under 10 seconds!
DEMOS!
Oozie (Workflow) Hive (Warehouse and Data Access) Karmasphere (Development Tool) Apache Mahout HBase (Column DB) Map. Reduce (Job Scheduling/Execution System) Hadoop = Map. Reduce + HDFS (Hadoop Distributed File System) HBase / Cassandra (Columnar No. SQL Databases) Flume Sqoop Avro (Serialization) Zookeeper (Coordination) Pig (Data Flow) Traditional BI Tools
Self-Service BI Data Warehouse & Analytics Digital Shoebox ETL & Data Mgmt
demo Hadoop on Azure Name Title Group
Benefits Map. Reduce programs in Java. Script Key Features Simplified Programming Simplified Deployment of Map. Reduce jobs JS Integration with. NET and new Java. Script libraries for Hadoop Deploy Java. Script Hadoop jobs from a simple web browser on any supported device
demo Big Data App Development
Benefits Key Features Familiar self service BI tools Hive ODBC Driver integrates Hadoop to SQL Server Analysis Services, Power. Pivot, and Power View, Hive Add-in for excel
demo Big Data Analytics with Hive and Excel
CALL TO ACTION Checkout: http: //Hadoop. On. Azure. com
THANK YOU!
mva
Learning Connect. Share. Discuss. Microsoft Certification & Training Resources http: //northamerica. msteched. com www. microsoft. com/learning Tech. Net Resources for IT Professionals Resources for Developers http: //microsoft. com/technet http: //microsoft. com/msdn
Complete an evaluation on Comm. Net and enter to win!
Scan the Tag to evaluate this session now on my. Tech. Ed Mobile
- Slides: 34