Course Duration in Hours
27
27
Introduction to Big Data and Analytics
Introduction to Hadoop
Hadoop ecosystem - Concepts
Hadoop Map-reduce concepts and features
Developing the map-reduce Applications
Pig concepts
Hive concepts
Sqoop concepts
Flume Concepts
Oozie workflow concepts
Impala Concepts
Hue Concepts
HBASE Concepts
ZooKeeper Concepts
Real Life Use Cases
Hadoop
Why Hadoop?
Scaling
Distributed Framework
Hadoop v/s RDBMS
Brief history of hadoop
Setup hadoop
Pseudo mode
Cluster mode
Ipv6
Ssh
Installation of java, hadoop
Configurations of hadoop
Hadoop Processes ( NN, SNN, JT, DN, TT)
Temporary directory
UI
Common errors when running hadoop cluster, solutions
HDFS- Hadoop distributed File System
HDFS Design and Architecture
HDFS Concepts
Interacting HDFS using command line
Interacting HDFS using Java APIs
Dataflow
Blocks
Replica
Hadoop Processes
Name node
Secondary name node
Job tracker
Task tracker
Data node
Map Reduce
Developing Map Reduce Application
Phases in Map Reduce Framework
Map Reduce Input and Output Formats
Advanced Concepts
Sample Applications
Combiner
Joining datasets in Mapreduce jobs
Map-side join
Reduce-Side join
Map reduce customization
Custom Input format class
Hash Partitioner
Custom Partitioner
Sorting techniques
Custom Output format class
Hadoop Programming Languages :-
HIVE
Introduction
Installation and Configuration
Interacting HDFS using HIVE
Map Reduce Programs through HIVE
HIVE Commands
Loading, Filtering, Grouping.
Data types, Operators..
Joins, Groups.
Sample programs in HIVE
PIG
Basics
Installation and Configurations
Commands.
OVERVIEW HADOOP DEVELOPER
Introduction
The Motivation for Hadoop
Problems with traditional large-scale systems
Requirements for a new approach
Anyone who has knowledge on Java, basic UNIX and basic SQL can opt for the Big Data and Hadoop training course.
Key features
Center for Advanced Skills and Technologies, Indore,IN