Course Duration in Hours
60
60
Course Contents
Hadoop Development
i. Introduction
ii. Hadoop Installation - configuration
iii. HDFS
iv. HDFS design considerations
v. Mapreduce
vi. Pig Installation, Configuration
vii. Grunt shell
viii. Data Model of Pig
ix. Advance features of pig latin
x. Development of PL scripts
xi. Performance tuning in pig
xii. Load-store functions in pig
xiii. Hive Introduction, Installation, Configuration
xiv. Data types and file formats
xv. HiveQL - DDL
xvi. HiveQL DML
xvii. Views in hive
xviii. Indexes in hive
xix. Performance tuning in hive
xx. Sqoop Installation
xxi. Sqoop - import data
xxii. Sqoop free form query import
xxiii. Sqoop export data
NoSQL
i. Introduction to NoSQL
ii. Interacting with NoSQL
iii. Storage Architecture
iv. CRUD operations
v. Query NoSQL stores
vi. Modifying data stores
vii. Indexing
viii. Managing Transactions
ix. NoSQL in cloud
x. Parallel processing
xi. Performance tuning
xii. Tools and Utilities
Pig
i. Introduction
ii. Installation, Configuration
iii. Grunt
iv. Data Model of Pig
v. Pig Latin
a. I/O
b. Relational operations
c. UDFs
vi. Advance features of pig latin
vii. Development of PL scripts
viii. Testing of PL scripts
ix. Performance tuning
x. PL in Python
xi. Filter functions
xii. Load-store functions
xiii. Pig and NoSQL
Hive
i. Introduction, Installation, Configuration
ii. Data types and file formats
iii. HiveQL - DDL
1. Database
2. Table &Index
3. Partitions
iv. HiveQL DML
1. Joins
2. Where clause
3. Group by
4. Casting
v. Views
vi. Indexes
vii. Schema design
viii. Performance tuning
ix. Compression
x. Hive development building hive from source
xi. Functions in hive
xii. Streaming
xiii. Hive thrift service
xiv. Storage handlers
xv. Security
xvi. Locking
xvii. Hive Integration
xviii. HCatalog
xix. Serialization
HBASE
i. HBASE fundamentals
ii. Data manipulation
iii. Data coordinates
iv. Data Models
v. ACID semantics
vi. Distributed HBASE
vii. HBASE &MapReduce
viii. HBASE schema design
ix. HBASE table design
x. De-normalization
xi. Heterogeneous data
xii. I/O considerations
xiii. Advanced column family configurations
xiv. Extending HBASE
xv. HBASE clients
xvi. HBASE deployment
xvii. HBASE distribution and configuration
xviii. Monitoring cluster
xix. Backup
xx. Replication
xxi. Migration
Freshers and Experienced
IGeekS Technologies, Malleswaram (Bangalore),Bangalore,IN