Big Data-Hadoop Administration

Course Content


  • Big Data Facts
  • The three V’s of Big Data
  • What is big data

Understanding Hadoop
  • What is Hadoop?
  • Why learn Hadoop?
  • Relational Database vs Hadoop
  • Motivation for Hadoop

HDFS (Hadoop Distributed File System)

  • What is HDFS
  • HDFS Components
  • Understanding Block Storage
  • The Name Node
  • The Data Node
  • Data Node Failure
  • HDFS Commands
  • HDFS File Permissions

The MapReduce Framework
  • Overview of MapReduce
  • Understanding MapReduce
  • The Map Phase
  • The Reduce Phase
  • Word Count in MapReduce
  • Running MapReduce Job

Planning for Hadoop Cluster

  • Single Node Cluster Configuration
  • Multi-Node Cluster Configuration

Cluster Maintenance
  • Checking HDFS Status
  • Breaking The Cluster
  • Copying Data between Cluster
  • Adding and Removing Cluster Nodes

Installing & Managing Hadoop Ecosystem Projects

  • Sqoop
  • Flume
  • Hive
  • Pig HBase Oozie

Managing & Scheduling Jobs
  • Managing Jobs
  • The FIFO Scheduler
  • The Fair Schedule
  • How to start and stop Job running on the Cluster
Populating HDFS From external Sources
  • How to use Sqoop to import Data from RDBMS’s to HDFS
  • Features of Hive, Hbase and Pig

Job Opportunities

IT / Software Companies are struggling to hire Hadoop talent. The industries that are adopting Hadoop want assurance that the people they hire are capable of handling their petabytes / zettabyte and any big size of data. The aim of CEG for conducting training in Big data Hadoop framework for Administrator’s is a proof of capability and gives the above said assurance, making you a reliable and a responsible person for their data.

Why CNC?
  • • 1 Major Project
  • • 1 Live Project (for Client)
  • • 100% Job Assistance
  • An ISO 9001:2015 Certified Company
  • Running since 10 Years
  • Experienced and Developer cum Faculties

  • Course Duration: 60 Hours (60 Days)
    Track: Regular Track

    Email or Call 9649902444