Hadoop advanced

19,900.00₹

2 students enrolled

Hadoop Introduction

• What is Big Data?
• Source of Data
• Characteristics of Big Data
• Benefits of Big Data analysis
• Challenges in Big Data processing
• Why Hadoop for Big Data?
• Introduction to Hadoop
• Hadoop not good for …
• Hadoop Ecosystem
• Hadoop Installation

Hadoop Distribute File System (HDFS)

• Hadoop Distributed File System (HDFS)
• HDFS Architecture
• Types of Nodes in HDFS
• Data Flow
• HDFS Block
• HDFS Federation
• HDFS High Availability (HA)
• HDFS Commands
• Hadoop Archives
• HDFS Accessibility

MapReduce Framework

• MapReduce Introduction
• How does MapReduce work?
• MapReduce Program
• MapReduce program execution
• MapReduce program Unit Testing
• Behind the Scenes : MapReduce
• Hadoop streaming
• Combiner
• Partitioner
• Counters

Hive

• Hive Introduction
• Installing & Running Hive
• Hive Components
• Hive Metastore
• HiveQL
• Hive Data Model
• Querying Data
• User-Defined Functions

Pig

• Pig Introduction
• How it works?
• Execution Types
• Running Pig Programs
• Pig Latin
• User-Defined Functions
• Data Processing Operators
• Pig Best Practices

Additional Concepts

• Introduction to HBase
• HBase Architecture
• HBase Practical
• Introduction to SQOOP
• Import data into Hadoop using SQOOP
• Introduction to Flume
• Practical example with Flume

  •   0/6

    • Additional Concepts
    • Hadoop Distribute File System (HDFS)
    • Hadoop Introduction – big data
    • Hive
    • MapReduce Framework
    • Pig