Course Overview
View Offline

Course Overview

2h 1m | 18 Videos | 70811 Views |

Expert| English[Auto-generated]

Introduction to Apache Pig – Hadoop Training:

Pig is a very modest data flow language used in the analysis of huge data sets. Pig is capable in executing its Hadoop jobs in MapReduce, Apache Tez, or Apache Spark.

wbcr_snippet

This course is basically intended for users who are interested to learn about Apache Pig and how to work on the data sets with ease in particular. The training will enable you to learn about Pig concepts. You shall be learning about Pig commands such as load and store data, group data and join data. We will look into combining and splitting data in pig using union operator and split operator. We will learn filtering data in pig using sort and limit operator and finally concentrate on pig internal and custom functions.