Course Duration in Hours
60
60
Hadoop Development Course Content
INTRODUCTION TO BIGDATA
What is Big Data?
Examples of Big Data
Reasons of Big Data Generation
Why Big Datadeserves your attention
Use cases ofBig Data
Different options of analyzingBig Data
INTRODUCTION TO HADOOP
What is Hadoop
History of Hadoop
How Hadoop name was given
Problems with Traditional Large-Scale Systems and Need for Hadoop
Understanding Hadoop Architecture
Fundamental of HDFS (Blocks, Name Node, Data Node, Secondary Name Node)
Rack Awareness
Read/Write from HDFS
HDFS Federation and High Availability
STARTING HADOOP
Setting up single node Hadoop cluster(Pseudo mode)
Understanding Hadoop configuration files
Hadoop Components- HDFS, MapReduce
Overview Of Hadoop Processes
Overview Of Hadoop Distributed File System
The building blocks of Hadoop
Hands-On Exercise: Using HDFS commands
MAPREDUCE-1(MR V1)
Understanding Map Reduce
Job Tracker and Task Tracker
Architecture of Map Reduce
Map Function
Reduce Function
Data Flow of Map Reduce
How Map Reduce Works
Anatomy of Map Reduce Job (MR-1)
Submission & Initialization of Map Reduce Job
Assigning & Execution of Tasks
Monitoring & Progress of Map Reduce Job
Hadoop Writable and Comparable
Map Reduce Types and Formats
Understand Difference Between Block and Input Split
Role of Record Reader
Different File Input Formats
Map Reduce Joins
MAPREDUCE-2(YARN)
Limitations of Current Architecture
YARN Architecture
Application Master, Node Manager&Resource Manager
Job Submission and Job Initialization
Task Assignment and Task Execution
Progress and Monitoring of the Job
Failure Handling in YARN
Task Failure
HIVE
Introduction to Apache Hive
Architecture of Hive
Installing Hive
Hive data types
Hive-HQL
Types of Tables in Hive
Partitions
Parquet file
Sequence file
RC FILE
ORC file
SERD
Buckets& Sampling
Indexes
Views
Executing hive queries from Linux terminal
Executing hive queries from a file
Creating UDFs in HIVE
Hands-On Exercise
Hive with Hbase Integration
Security
PIG
Introduction to Apache Pig
Install Pig
Architecture
Data types
Working with various PIG Commands covering all the functions in PIG
Working with un-structured data
Working with Semi-structured data
Creating UDFs
Hands-On Exercise
Pig with Hbase Integration
Pig with
SQOOP
Introduction to SQOOP& Architecture
Installation of SQOOP
Import data from RDBMS to HDFS
Importing Data from RDBMS to HIVE
Exporting data from HIVE to RDBMS
Hands on exercise
HBASE
Introduction to HBASE
Installation ofHBASE
Exploring HBASE Master & Region server
Exploring Zookeeper
CRUD Operation of HBase with Examples
HIVE integration with HBASE
Hands on exercise on HBASE
OOZIE
BOTH STANDALONE AND CLUSTER
OOZIE META DATA MANAGEMENT
FLUME
HADOOP WITH TWITER DATA
TWITER DATA WITH ANALYTICS
SPARK
INTRODUCTION
SPARK CORE
SPARK STREAMING
SPARK SQL
SPARK INSTALLATION
INTEGRATION WITH HADOOP
SPARK MLIB
KAFKA
INTRODUCTION
KAFKA INSTALATION
PRODUCER
PRODUCER CONFIGURATION
BROKER
MULTIPLE BROKER CONFIGURATION
CONSUMER
KAFKA CLIENT
R:- ANALYTIC
R-HDFS
R-HBASE
R-HIVE
R-RMR
RHIPE
INSTALLATION
HOSTING HADOOP ON GOOGLE CLOUD
HOSTING HADOOP ON AWS EC2 CLOUD
STORM
CASSANDRA
MONGO DB
SCALA
Machine Learning
REPORTING
Tableau
FAQS, REAL TIME ENVIRONMENT & REAL TIME SCENARIOS
REAL TIME PROJECT
We will be providing raw data & requirements for the project & you will have to work. Finally we will have one Project execution session where we will be explaining the steps for execution.
Any graduation
Krishvi Technologies, Munekollal (Bangalore),Bangalore,IN