Module |
Objective |
Topics |
1: Introduction to Big Data & Hadoop |
In this module, you will be able to get big data context, its definition and its integration with Hadoop in terms of storage and processing. |
Big Data context with case studies, Big Data definition, structured vs unstructured, Business analytics lifecycle, Hadoop basics, Hadoop characteristics, Hadoop ecosystem, Hadoop core components, Secondary name node. |
2: HDFS Internals and YARN |
In this module, you will learn the concept of Blocks, Rack awareness, HDFS architecture and a few Hadoop commands. This module also covers HDFS High Availability, HDFS Federation and YARN. |
Blocks, Data Replication and Rack Awareness, HDFS File Write Anatomy, HDFS File Read Anatomy, HDFS Architecture, Common Hadoop Shell Commands, High vailability, HDFS Federation, YARN, Firing our First Map Reduce Job, Checking the output of M/R Job and Understanding the dump of Map Reduce Job. |
3: Introduction to Map Reduce |
In this module, you will learn about different Hadoop modes, configuration files, Split vs Blocks, traditional and Hadoop based distributed computing techniques. You will also learn advanced concepts like Combiners & Partitioners and Shuffle & Sort |
Hadoop Cluster Modes, Configuration Files, Web URLs, Split vs Blocks, Map Reduce use-cases, solving a problem in Traditional way, Understanding Map Reduce way, Map Reduce Anatomy, Advantages of Map Reduce, and Map Reduce Flow |
4: Advanced Map Reduce Concepts |
In this module, you will learn the advanced concepts like MR Unit, Counters, Distributed Cache and Joins |
MR Unit, Counters, Distributed Cache, Joins, Secondary Sort and Total Order Sort. |
5: Pig and Advance Pig |
In this module, you will be get a complete understanding of PIG & its association with Map Reduce. |
Pig Background, Need for Pig, Pig Vs M/R, Pig Definition, Pig Latin, Pig users, Pig usage at Yahoo, Pig Interaction Modes, Pig program execution, Pig data model, Pig data types, Pig operators and specialized Joins |
6: Hive and Advance Hive |
In this module you will learn about Hive Background, Hive comparison with RDBMS and Hive design and architecture. |
Hive Background, Hive Definition, Pig vs Hive, RDBMS vs Hive, Hive components, Hive Architecture, Hive Meta Store, Hive Design, Hive Data Model, Partition and Buckets |
7: Hbase and Advanced HBase |
In this module you will learn about NoSQL Background, Hive comparison with RDBMS and Hive design and architecture. |
NoSQL background and description, Real time scenarios, NoSQL landscapes, HBase definition, HBase characteristics, HBase history, HBase vs. RDBMS, HBase Data Model, HBase Data Model – Graphical representation, HBase Data Model – Logical Vs. Physical representation, Version concepts, Region and Region Servers and Zookeeper |
8: Oozie and Scoop |
In this module, you will learn the concepts of Oozie as a Hadoop Workflow Framework and how it orchestrates the execution of Hadoop Components. |
Oozie workflow, Oozie server , Oozie co-ordinator, Oozie Bundles, Configuration XML and Properties file, Creating Oozie application, Oozie web console, Oozie scheduling, Scoop Setup between Hadoop and RDBMS, Exporting Data from Hadoop into RDBMS, Importing Data from RDBMS into Hadoop. |
9: Zookeeper and Flume |
In this Module, you will learn about Zookeeper as a distributed cluster co-ordination system and how it can be used to keep clusters synchronized and avoid race conditions. You will also learn about Flume as a Hadoop sub-component to pull data from unstructured data sources. |
Zookeeper Master, Zookeeper Slave, concept of Ephemeral Nodes, Persistent and Optional Sequential Numbering, Configuration Management. Flume agent, Source, Sync, Defining the Flume flow, configuring individual components, configuring entire Flume set-up |
10: Project Set-up Discussion |
In this session, you will get familiar with the project you would be working for your certification along with several other topics. |
Project Discussions, Evaluating Individual Approaches, Finalizing Optimal Approach |
Reviews
There are no reviews yet.