Azure HDInsight Hadoop Meets the Cloud Microsoft’s managed Hadoop as a Service 100% open source Apache Hadoop Built on the latest releases across Hadoop (2.4) Up and running in minutes with no hardware to deploy Supported by Microsoft
Sentiment Clickstream Machine/Sensor Server Logs Geo-location Monitor real- time data to… Prevent Optimize Securities Fraud Compliance violations Security breaches Network Outages ---Machine failuresDriver & fleet issues Application failures Operational issues Order routing Pricing Bandwidth allocation Customer service Offers Pricing Supply chain Routes Pricing Site content FinanceTelcoRetailManufactur- ing TransportationWeb Common Scenarios
Tuples Core Unit of Data Immutable Set of Key/Value Pair Bolts Core functions of a streaming computation Receive tuples and do stuff Optionally emit additional tuples Spouts Source of Streams Wraps a streaming data source and emits Tuples Core Components of Apache Storm Topology Arrangement of Spouts and Bolts Unit of deployment & management
TridentTopology topology = new TridentTopology(); FixedBatchSpout spout = new FixedBatchSpout(…); Stream stream = topology.newStream(“words”, spout); stream.each(…, new Myfunction()).groupBy().each(…, new MyFilter()).persistentAggregate(…); Trident Fluent, Stream-Oriented API