Course Duration in Hours
60
60
HADOOP COURSE CONTENTS
Hadoop Distributed File System (HDFS)
Design of HDFS, HDFS Concepts
Command Line Interface
Java Interface
NameNode, DataNode and Secondary NameNode
Data Flow (Anatomy of a File Read, Anatomy of a File Write, Coherency Model)
Parallel Copying with DISTCP
Hadoop Archives
Setting up Hadoop Cluster
Cluster Specification
Cluster Setup and Installation
SSH Configuration
Hadoop Configuration
MapReduce
Writing a MapReduce job
Anatomy of a MapReduce Job Run (JobTracker and TaskTracker)
MapReduce Configuration
Hadoop Data Types
Functional - Concept of Mappers
Functional - Concept of Reducers
Functional - Concept of Partitioners
Functional - Concept of Combiners
Input Formats (Input Splits and Records, Text Input, Binary Input, Multiple Inputs, Database Input and Output)
Output Formats (Text Output, Binary OutPut, Multiple Outputs, Database output)
Hands-On Exercise
PIG
Installing and Running Pig
Pig s Data Model
Pig Latin
Developing & Testing Pig Latin Scripts
Making Pig Fly
Writing Evaluation
Filter
Load & Store Functions
User Defined Functions (UDF)
Hands-On Exercise
HIVE
Installing Hive
Running Hive (Configuring Hive, Hive Services, MetaStore)
Comparison with Traditional Database (Schema on Read Versus Schema on Write, Updates, Transactions and Indexes)
Hive QL (Data Types, Operators and Functions)
Tables (Managed Tables and External Tables, Partitions and Buckets, Storage Formats, Importing Data, Altering Tables, Dropping Tables)
Querying Data (Sorting And Aggregating, Map Reduce Scripts, Joins & Sub queries & Views, Map and Reduce site Join to optimize Query)
User Defined Functions
Appending Data into existing Hive Table
Custom Map/Reduce in Hive
Hands-On Exercise
HBASE
Introduction
Installation
HBase Architecture
Exploring HBase Master & Regionservers
HBase Shell Commands
Intro to Zookeeper
Client API Basics
Client API - Advanced Features
Client API - Administrative Features
MapReduce Integration
Hands-On Exercise
Building Applications with Zookeeper (Zookeeper in Production)
SQOOP
Introduction to Sqoop
Installation of Sqoop on your system
Import/Export data from RDBMS to HDFS
Import/Export data from RDBMS to HBase
Import/Export data from RDBMS to Hive
Hands-On Exercise
NEXT-GEN HADOOP
Intro to High level concepts in Hadoop 2.0
YARN (MapReduce 2.0)
DAN IT SERVICES, BTM Layout (Bangalore),Bangalore,IN