Senior Project Manager & Architect Love Your Data
Distributed Storage (HDFS) Query (Hive) Distributed Processing (MapReduce) Scripting (Pig) NoSQL Database (HBase) Metadata (HCatalog) Data Integration ( ODBC / SQOOP/ REST) Machine Learning (Mahout) Graph (Pegasus) Stats processing (RHadoop) Event Pipeline (Flume) Pipeline / Workflow (Oozie) Legend Red = Core Hadoop Blue = Data processing Purple = Microsoft integration points and value adds Yellow = Data Movement Green = Packages
EDW Data Mart
EDW Analytics Mart Data Mart
Blob Storage Storage Container HDInsight Cluster PowerShell Azure SQL DB
* http :// http ://
Power BI Stack Power Query Power View Power Pivot Power Map