Course Duration in Hours
80
80
Hadoop Distributions
Apache
Cloudera
HDInsight
Hortonworks
Course Objective Summary
Introduction to Big Data and Hadoop
Hadoop Cluster
HDFS
Hadoop Map-reduce concepts and features
Developing the map-reduce Applications
Sqoop Concepts
Hive concepts
Pig concepts
HBASE Concepts
Flume Concept
Real Time Use Cases
Introduction to Big Data and Hadoop
What is Big Data?
What are the challenges for processing big data?
What technologies support big data?
What is Hadoop?
Why Hadoop?
History of Hadoop
Use Cases of Hadoop
Hadoop eco System
HDFS
Map Reduce
Understanding the Cluster
Introduction/Installation Hadoop Custom VM (Single Node/Multi Node)
Using Cloudera
Using HDInsight
HDFS
HDFS Overview and Architecture
NameNode
CheckpointNode
DataNode
Data Replication
Configuration Files
HDFS Data Flows
Read
Write
HDFS Commands
Rack Awareness
Advanced HDFS Features
HDFS Federation
HDFS High Availability
Let s talk Map Reduce
Before Map reduce
Map Reduce Overview
Word Count Problem
Word Count Flow and Solution
Map Reduce Flow
Algorithms for simple problems
Algorithms for complex problems
Developing the Map Reduce Application
Data Types
File Formats
Explain the Driver, Mapper and Reducer code
Configuring development environment - Eclipse
Writing Unit Test
Running locally
Running on Cluster
Hands on exercises
How Map-Reduce Works
Anatomy of Map Reduce Job run
Job Submission
Job Initialization
Task Assignment
Job Completion
Job Scheduling
Job Failures
Shuffle and sort
Hands on Exercises
Map Reduce Types and Formats
MapReduce Types
Input Formats - Input splits & records, text input,binary input, multiple inputs & database input.
Output Formats - text Output, binary output, multiple outputs, lazy output and database output
Hands on Exercises.
Map Reduce Features
Counters
Sorting
MapReduce Combiner
MapReducePartitioner
Hands Exercises
MapReduce Sample Algorithms
Word Count
Average Word Length
Word-co-Occurrence
Combiner
Partitioner
Hospital bill Processing
Inverted Index
Twitter
Sqoop
Sqoop Introduction
Import Data From Traditional Systems to HDFS
Export Data From HDFS to Traditional Systems
Hive
Hive Introduction
Hive Vs RDBMS
Hive Interpreter
Hive QL
Hive Tables
Internal
External
Hive Partitions
Hive UDFs
Pig
Pig Introduction
Pig Vs Hive Vs MapReduce
Pig Interpreter
Pig Latin
Running Pig Scripts
Local Mode
MapReduce Mode
Pig UDFs
HBase
HBase Introduction
RDBMS Vs HBase(NoSQL)
HBase Commands
Integrating MR with HBase
Other Hadoop Ecosystems
Flume
Oozie
HBase
B.tech ,M.Tech and M.C.A,B.C.A, B.Sc,M.Sc Computers
It Professionals ,Java Knowledge Students
Lara Tech, Ameerpet (Hyderabad),Hyderabad,IN