Big Data with Hadoop

Start Date Batch Name Timings Days Duration Mode Amount Slots Available Cart
Nov 12,2018 Nov 2018 05:00 PM - 06:00 PM (PST) Monday - Friday 8 Weeks Online (Instructor Led) $499 ($399.00) 25

Big Data with Hadoop

The online Big data and Hadoop certification course also covers real life use-cases, multiple POCs, live Hadoop project and create foundation of Apache Spark for distributed data processing.

  • Curriculumestimated time

  • Introduction to Big Data and Hadoop
    • Common Big Data Customer Scenarios
    • Core Components of Hadoop
    • Hadoop Cluster Modes
  • Hadoop Distributed File System
    • HDFS Files and Blocks
    • Replication across multiple data node centers
    • HDFS read/write Operations
    • Configuration Files in Hadoop
    • HDFS Admin Commands
  • MapReduce
    • Data Flow in MapReduce
    • MapReduce Programming Model
    • Map and Reduce Operations
    • Job Submission Flow of MapReduce
    • Functioning of Job Tracker and Task Tracker
  • Advanced MapReduce I
    • GenericOptionsParser, Tool and ToolRunner
    • Writables in Hadoop
    • JUnit and MRUnit Testing
    • Counters and Schedulers
    • Data compression Techniques in Hadoop
  • Advanced MapReduce II
    • Profiling Map and Reduce Tasks
    • Custom Practitioner in MapReduce
    • Data Serialization using Protocol Buffers, Thrift and Avro
    • Joining in MapReduce: MapSide and ReduceSide
  • Apache Pig
    • Group, JOIN and COGROUP operator
    • Pig Latin-File Loaders
    • Pig Latin-Creating UDF
  • Apache Hive
    • Hive Metastore and Hive QL
    • External tables, HiveQL: Data Manipulation and Queries
    • Different kinds of UDF’s in Hive.
  • Apache Zookeeper
    • Group Membership in Zookeeper
    • Zookeeper API
    • Zookeeper Service : Data Model, Operation, Implementation
    • Zookeeper State transitions
  • Apache HBase
    • Heartbeat Sends by RegionServers
    • Working of HBaseArchitecture
    • Compaction in HBase
    • Data Loading Techniques used in HBase
  • Apache Sqoop
    • Sqoop Connectors
    • Sqoop Commands
    • SqoopImport and Export Processes
    • Import Data from MySQL to HDFS, Hive and HBase
    • SqoopConnectors
  • Hadoop 2.x with YARN
    • Hadoop 2.x Cluster Architecture
    • Different Processing Applications in YARN
    • Job Execution Flow on YARN
  • Apache Oozie
    • OozieArchitecture
    • OozieWorkflow, OozieCoordinator and OozieBundle Jobs
    • Action and Control Nodes in Oozie
    • OozieCoordinator Lifecycle Operations
  • Cloudera Impala and Project
    • Impala Architecture
    • Impala Storage
    • Working of ClouderaImpala Engine
    • File Formats, supported by Impala