Course Overview
Youtube Data Analysis Using Hadoop:
This course is basically intended for users who are interested to learn about application of Hadoop technologies and how to work on the data sets with ease in particular. We are going to learn analyzing YouTube’s video upload data using Hadoop.
We can analyze YouTube Video’s in following scenarios
1. Identify top 5 categories in which most of the vedio’s are uploaded
2. Top 10 highest rated video’s
3. Top 10 most viewed
4. Top 10 lengthy vedio’s
5. How many peoples are less than 18 yrs age (not adult) and uploaded vedio’s
6. How many child (less than 18 yrs age) in which categories they uploaded vedio’s (basically to check not uploaded in adult categories)
7. Number of comments per vedio’s
8. Sorting of data on basis of uploader name
9. Sorting of data on basis of uploaded age
You will analyze this data in different big data tools like MR, PIG and Hive and can generate some useful information for YouTube to take certain strategic decisions.
Target Customers:
- Students
- Professionals
- Data analysts/developers
- Anyone who wants to learn about application of Hadoop
Pre-Requisites:
- Basic Computer Knowledge
- Experience of coding
- Basic knowledge of Hadoop Distributed File System (HDFS)