Big Data needs proper tools and skills, and this workshop brings you “from zero to hero,” that is, provides the student with the necessary knowledge of Hadoop, Spark, and NoSQL. With these three fundamentals, you will be able to build systems processing massive amounts of data, in archival, batch, interactive and finally real-time manner. The workshop also lays foundations for proper analytics, allowing to extract insights from data.
Before taking this course, students should be: • Comfortable with Java programming language (most programming exercises are in Java) • Comfortable in Linux environment (be able to navigate Linux command line, edit files using vi / nano)
5 Days/Lecture & Lab
This course was designed for Developers.
- Introduction to Hadoop
- SparkSpark Basics
- RDDs In Depth
- Spark API programming
- Introduction to Spark API / RDD API
- Spark Streaming
- Cassandra Basics
- Cassandra Drivers
- Data Modeling – Part 1
- Data Modeling – Part 2
- Data Modeling Labs : Group Design Sessions