Spark Summit 2015: Intro to Apache Spark Training



I had the pleasure of leading a workshop at Spark Summit 2015 organized by Databricks.

In the full-day workshop, we covered the core Spark APIs, and technical exercises designed to get developers up to speed using Spark for data exploration, analysis, and building Big Data applications

Video and slides from my talk are available below. If you're interested in learning more about Spark, I'm also teaching an upcoming Spark Foundations course in Philadelphia on October 27th.

Topics covered include:

  • Overview of Big Data and Spark
  • Installing Spark Locally
  • Using Spark’s Core APIs in Scala, Java, Python
  • Building Spark Applications
  • Deploying on a Big Data Cluster
  • Combining SQL, Machine Learning, and Streaming for Unified Pipelines

Video: Part 1


Video: Part 2


Slides


Published July 14, 2015