BIG Data & Hadoop
£280.00 £273.00 Add to cart
Sale!

BIG Data & Hadoop

£273.00

-3%

This is a live instructor led program featuring 30 hours of live instructor led training, 60+ hours of practicals, 10+ live industry projects, 24/7 lifetime support, 100% placement assistance and recognized certification.

THE NEXT BATCH DATES ARE

  • July 22 2017
  • July 29 2017

WHAT IS INCLUDED IN COURSE?

  • Live Instructor led Training
  • Project & Practical Bundles
  • 24/7 Lifetime Support
  • Unlimited LMS Access
  • Virtual Environments

COURSE CURRICULUM:

Module Objective Topics
1: Introduction to Big Data & Hadoop In this module, you will be able to get big data context, its definition and its integration with Hadoop in terms of storage and processing. Big Data context with case studies, Big Data definition, structured vs unstructured, Business analytics lifecycle, Hadoop basics, Hadoop characteristics, Hadoop ecosystem, Hadoop core components, Secondary name node.
2: HDFS Internals and YARN In this module, you will learn the concept of Blocks, Rack awareness, HDFS architecture and a few Hadoop commands. This module also covers HDFS High Availability, HDFS Federation and YARN. Blocks, Data Replication and Rack Awareness, HDFS File Write Anatomy, HDFS File Read Anatomy, HDFS Architecture, Common Hadoop Shell Commands, High vailability, HDFS Federation, YARN, Firing our First Map Reduce Job, Checking the output of M/R Job and Understanding the dump of Map Reduce Job.
3: Introduction to Map Reduce In this module, you will learn about different Hadoop modes, configuration files, Split vs Blocks, traditional and Hadoop based distributed computing techniques. You will also learn advanced concepts like Combiners & Partitioners and Shuffle & Sort Hadoop Cluster Modes, Configuration Files, Web URLs, Split vs Blocks, Map Reduce use-cases, solving a problem in Traditional way, Understanding Map Reduce way, Map Reduce Anatomy, Advantages of Map Reduce, and Map Reduce Flow
4: Advanced Map Reduce Concepts In this module, you will learn the advanced concepts like MR Unit, Counters, Distributed Cache and Joins MR Unit, Counters, Distributed Cache, Joins, Secondary Sort and Total Order Sort.
5: Pig and Advance Pig In this module, you will be get a complete understanding of PIG & its association with Map Reduce. Pig Background, Need for Pig, Pig Vs M/R, Pig Definition, Pig Latin, Pig users, Pig usage at Yahoo, Pig Interaction Modes, Pig program execution, Pig data model, Pig data types, Pig operators and specialized Joins
6: Hive and Advance Hive In this module you will learn about Hive Background, Hive comparison with RDBMS and Hive design and architecture. Hive Background, Hive Definition, Pig vs Hive, RDBMS vs Hive, Hive components, Hive Architecture, Hive Meta Store, Hive Design, Hive Data Model, Partition and Buckets
7: Hbase and Advanced HBase In this module you will learn about NoSQL Background, Hive comparison with RDBMS and Hive design and architecture. NoSQL background and description, Real time scenarios, NoSQL landscapes, HBase definition, HBase characteristics, HBase history, HBase vs. RDBMS, HBase Data Model, HBase Data Model – Graphical representation, HBase Data Model – Logical Vs. Physical representation, Version concepts, Region and Region Servers and Zookeeper
8: Oozie and Scoop In this module, you will learn the concepts of Oozie as a Hadoop Workflow Framework and how it orchestrates the execution of Hadoop Components. Oozie workflow, Oozie server , Oozie co-ordinator, Oozie Bundles, Configuration XML and Properties file, Creating Oozie application, Oozie web console, Oozie scheduling, Scoop Setup between Hadoop and RDBMS, Exporting Data from Hadoop into RDBMS, Importing Data from RDBMS into Hadoop.
9: Zookeeper and Flume In this Module, you will learn about Zookeeper as a distributed cluster co-ordination system and how it can be used to keep clusters synchronized and avoid race conditions. You will also learn about Flume as a Hadoop sub-component to pull data from unstructured data sources. Zookeeper Master, Zookeeper Slave, concept of Ephemeral Nodes, Persistent and Optional Sequential Numbering, Configuration Management. Flume agent, Source, Sync, Defining the Flume flow, configuring individual components, configuring entire Flume set-up
10: Project Set-up Discussion In this session, you will get familiar with the project you would be working for your certification along with several other topics. Project Discussions, Evaluating Individual Approaches, Finalizing Optimal Approach

Reviews

There are no reviews yet.

Be the first to review “BIG Data & Hadoop”

Your email address will not be published. Required fields are marked *