Course Overview
Connecting to Twitter Using Spark:
Spark Streaming is an extension of the core Spark API that enables scalable, high-throughput, fault-tolerant stream processing of live data streams. Data can be ingested from many sources like Kafka, Flume, Twitter, ZeroMQ, Kinesis, or TCP sockets, and can be processed using complex algorithms expressed with high-level functions like map, reduce, join and window. Finally, processed data can be pushed out to filesystems, databases, and live dashboards.
The training will help you learn about
- Introduction to Spark streaming
- How to create Dstreams and Windowed Dstreams
- How to create Checkpoint Directory
- Creating Twitter Application
- Connecting To twitter Using Spark
Target Customers:
- Students/Professionals Interested in learning about Spark Streaming
- Data Engineers
- Analysts
- Software Engineers
- IT operations
- Technical managers
Pre-Requisites:
- Basic computer knowledge
- Basic understanding of Java/Scala/Python
- Knowledge of Mapreduce paradigm