Course Overview
Introduction to Apache Pig – Hadoop Training:
Pig is a very modest data flow language used in the analysis of huge data sets. Pig is capable in executing its Hadoop jobs in MapReduce, Apache Tez, or Apache Spark.
This course is basically intended for users who are interested to learn about Apache Pig and how to work on the data sets with ease in particular. The training will enable you to learn about Pig concepts. You shall be learning about Pig commands such as load and store data, group data and join data. We will look into combining and splitting data in pig using union operator and split operator. We will learn filtering data in pig using sort and limit operator and finally concentrate on pig internal and custom functions.
Course Objective:
- To understand basic Pig concepts and commands
- To learn working with data in Pig
Target Customers:
- Students
- Professionals
- Data analysts/developers
- Anyone who wants to learn about Pig
Target Customers:
- Pre-Requisites:
- Basic Computer Knowledge
- Experience of coding
- Basic knowledge of Hadoop Distributed File System (HDFS)