Machine Learning – Apache Storm: Learn by Example

$25.00

In this ’Apache Storm: Learn by Example’ online course, you will learn how to use Storm to build applications which need you to be highly responsive to the latest data, and react within seconds and minutes, such as finding the latest trending topics on Twitter, or monitoring spikes in payment gateway failures.  Supplemental material included!

Length: 4 hrs 4 min

This title is available in the
Total Training All-Access library.

Subscribe Now

Description

Storm is to real-time stream processing what Hadoop is to batch processing.  From simple data transformations to applying machine learning algorithms on the fly, Storm can do it all.

What’s covered in this Apache Storm: Learn by Example online training course?

  1. Understanding Spouts and Bolts, which are the building blocks of every Storm topology
  2. Running a Storm topology in the local mode and in the remote mode
  3. Parallelizing data processing within a topology using different grouping strategies: Shuffle grouping, Fields grouping, Direct grouping, All grouping, Custom grouping
  4. Managing reliability and fault-tolerance within Spouts and Bolts
  5. Performing complex transformations on the fly using the Trident topology: Map, Filter, Windowing, and Partitioning operations
  6. Applying ML algorithms on the fly using libraries like Trident-ML and Storm-R

What are the requirements?

  • Experience in Java programming and familiarity with using Java frameworks
  • A Java IDE such as IntelliJ Idea should be installed

What am I going to get from this course?

  • Build a Storm Topology for processing data
  • Manage reliability and fault tolerance of the topology
  • Control parallelism using different grouping strategies
  • Perform complex transformations using Trident
  • Apply Machine Learning algorithms on the fly in Storm applications

What is the target audience?

  • Engineers looking to set up end-to-end data processing pipelines that react to changes in real time
  • Folks familiar with Batch processing technologies like Hadoop who want to learn more about Stream processing

Sample clip

 

Chapter 01: You, This Course, and Us 02:06

  1. Introduction

Chapter 02: Stream Processing with Storm 25:29

  1. How does Twitter compute Trends?
  2. Improving Performance using Distributed Processing
  3. Building blocks of Storm Topologies
  4. Adding Parallelism in a Storm Topology
  5. Components of a Storm Cluster

Chapter 03: Implementing a Hello World Topology 25:20

  1. A Simple Hello World Topology
  2. Ex 1: Implementing a Spout
  3. Ex 1: Implementing a Bolt
  4. Ex 1: Submitting the Topology

Chapter 04: Processing Data using Files 34:08

  1. Ex 2: Reading Data from a File
  2. Representing Data using Tuples
  3. Ex 3: Accessing data from Tuples
  4. Ex 4: Writing Data to a File

Chapter 05: Running a Topology in the Remote Mode 14:42

  1. Setting up a Storm Cluster
  2. Ex 5: Submitting a topology to the Storm Cluster

Chapter 06: Adding Parallelism to a Storm Topology 24:36

  1. Ex 6 : Shuffle Grouping
  2. Ex 7: Fields Grouping
  3. Ex 8: All Grouping
  4. Ex 9: Custom Grouping
  5. Ex 10: Direct Grouping

Chapter 07: Building a Word Count Topology 10:04

  1. Ex 11: Building a Word Count Topology

Chapter 08: Remote Procedure Calls Using Storm 12:48

  1. Ex 12: A Storm Topology for DRPC calls

Chapter 09: Managing Reliability of Topologies 10:31

  1. Ex 13: Managing Failures in Spouts

Chapter 10: Integrating Storm with Different Sources/Sinks 15:33

  1. Ex 14: Implementing a Twitter Spout
  2. Ex 15: Using a HDFS Bolt

Chapter 11: Using the Storm Multilang Protocol 08:24

  1. Ex 16: Building a Storm Topology using Python

Chapter 12: Complex Transformations using Trident 01:00:05

  1. Ex 17: Building a basic Trident Topology rs Classifier
  2. Ex 18: Implementing a Map Function
  3. Ex 19: Implementing a Filter Function
  4. Ex 20: Aggregating data Classifiers
  5. Ex 21: Understanding States
  6. Ex 21: Understanding States
  7. Ex 23: Joining data streams
  8. Ex 24: Building a Twitter Hashtag Extractor

Janani Ravi, Vitthal Srinivasan, Swetha Kolalapudi, and Navdeep Singh have honed their tech expertise at Google and Flipkart. Together, they have created dozens of training courses and are excited to be sharing their content with eager students. The team believes it has distilled the instruction of complicated tech concepts into enjoyable, practical, and engaging courses.

Janani: 7 years at Google (New York, Singapore); Studied at Stanford; also worked at Flipkart and Microsoft

Vitthal: Also Google (Singapore) and studied at Stanford; Flipkart, Credit Suisse and INSEAD too

Swetha: Early Flipkart employee, IIM Ahmedabad and IIT Madras alum

Navdeep: Longtime Flipkart employee too, and IIT Guwahati alum

You may also like…