Course Duration in Hours
70
70
Introduction to Hadoop
What is Hadoop
Why Hadoop
History of Hadoop
The Motivation of Hadoop
Hadoop architecture
Overview of HDFS (Hadoop Distributed File System) and MR (Map Reduce) framework
Overview of problems solved by HadoopData Mining
Web Mining
Natural Language Processing
Sentimental Analysis
Setting up Hadoop
Pseudo Mode
Cluster
Common Errors when running Hadoop cluster
Incompatible name space IDs
Protocol version mismatch
Safe mode exception
HDFS- Hadoop Distributed File System
HDFS Design and Architecture
HDFS Concepts
Interacting HDFS using command line
Interacting HDFS using JAVA APIs
Running MR application on Local File system, pseudo mode, cluster mode
Creating and deleting directories on HDFS
Creating and deleting files on HDFS
Reading and writing files on HDFS
Moving files with in HDFS
Renaming files
Submitting Jobs
Map Reduce Programming Model
Developing Map Reduce Application
Phases in Map Reduce Framework
Map Reduce Input and Output Formats
Advance Concepts
Hadoop Programming Languages (6 hrs)
Hive
Installation
Creating tables
Writing Hive Queries
Pig
Installation
Concepts
Data processing operators
Writing UDFs
No SQL Data Bases (2-4 hrs)
Concepts- SQL v/s NoSQL
Cassandra
Architecture
Concepts
Installation
Performing CRUD (Create Update and Delete)
Case Studies / Projects (10-12 hrs)
Data Mining on Wikipedia data set using
Batch Mode Processing (MR )
Using HBase and Hive
Integrating Tools with Hadoop
Informatica
Tables
Pentahu
HADOOP ADMINISTRATION
Hadoop Architecture
Hadoop Installation
Cluster Maintainence
Starting and stopping jobs
File system check
Backup and restore
Upgradations
Scheduling jobs
FIFO Scheduler
Fair Scheduler
Cluster monitoring
Logging
Metrics
Audit Logging
B.Tech/B.E
M.tech/M.E
MCA
Degree
QMinds Technologies, Marathahalli (Bangalore),Bangalore,IN