Trainer need to give a training on below topics
Big Data
What is Big Data
Dimensions of Big Data
Big Data in Advertising
Big Data in Banking
Big Data in Telecom
Big Data in eCommerce
Big Data in Healthcare
Big Data in Defense
Processing options of Big Data
Hadoop as an option
Hadoop
What is Hadoop
How Hadoop 1.0 Works
How Hadoop 2.0 Works
HDFS
MapReduce
What is YARN
How YARN Works
Advantages of YARN
How Hadoop has an edge
Hadoop Ecosystem
Sqoop
Oozie
Pig
Hive
Flume
Hadoop Hands On
Running HDFS commands
Running your MapReduce program on Hadoop 1.0
Running your MapReduce Program on Hadoop 2.0
Running Sqoop Import and Sqoop Export
Creating Hive tables directly from Sqoop
Creating Hive tables
Querying Hive tables
Advanced MapReduce
MapReduce Code Walkthrough
ToolRunner
MR Unit
Distributed Cache
Combiner
Partitioner
Setup and Cleanup methods
Using Java API to access HDFS
Custom Types
Input Types in MapReduce
Output Types in MapReduce
Custom Input Data types
Custom Output Data types
Multiple Reducer MR program
Zero Reducer Mapper Program
MapReduce Design Patterns :
Searching
Sorting
Filtering
Inverted Index
TF-IDF
Word Co-occurrence
Pig
What is Pig
How Pig Works
Simple processing using Pig
Advanced Processing Using Pig
Pig Hands On
Oozie
What is Oozie
How Oozie Works
Oozie Hands on
Joins Using MapReduce
Map Side joins
Reduce side joins
Advanced MapReduce Hands On
MR Unit hands on
Distributed Cache hands on
Partitioner hands on
Combiner hands on
Accessing files using HDFS API hands on
Map Side joins hands on
Reduce side joins hands on
MapReduce Design Patterns Hands On :
Distributed Grep
Bloom Filters
Average Calculation
Standard Deviation
MapSide joins
Reduce Side joins
HIve
What is Hive
How Hive Works
Simple processing using Hive
Advanced processing using Hive
Hive Hands on
Impala
What is Impala
How Impala Works
Where Impala is better than Hive
Impala's shortcomings
Impala Hands on
Evaluation Test