Apache Hadoop is one of the most popular frameworks for processing Big Data on clusters of servers. This course delves into data management in HDFS, advanced Pig, Hive, and HBase. These advanced programming techniques will be beneficial to experienced Hadoop developers.Format: Lectures and hands on labs. (50% lecture + 50% labs). Pace of the class is determined by the students.
Before taking this course, students should have the following skills:Be comfortable with Java programming language (most programming exercises are in java)Be comfortable in Linux environment (be able to navigate Linux command line, edit files using vi/nano)Have attended "Hadoop for Developers" or have similar knowledge
3 Days/Lecture & Lab
This course was designed for developers.
- Data Management in HDFS
- Advanced Pig
- Advanced Hive
- Advanced HBase