Course Overview
Web log Analysis using Apache PIG:
Apache Pig is a high-level platform for creating programs that run on Apache Hadoop. The language for this platform is called Pig Latin. Pig can execute its Hadoop jobs in MapReduce, Apache Tez, or Apache Spark. Pig Latin abstracts the programming from the Java MapReduce idiom into a notation which makes MapReduce programming high level, similar to that of SQL for relational database management systems. Pig Latin can be extended using user-defined functions which the user can write in Java, Python, JavaScript, Ruby and then call directly from the language.
The training will help you learn about
- Basic Pig concepts and commands
- How to find out the most viewed page
- How to find total hits per unique day : – Based on each unique day we need to find the total hits.
Target Customers:
- Students
- Professionals
- Data analysts/developers
- Anyone who wants to learn about Pig
Pre-Requisites:
- Basic Computer Knowledge
- Experience of coding
- Basic knowledge of Hadoop Distributed File System (HDFS)