Кому следует посетить
-
Big data practitioners
-
Big data related industry practitioners
Предварительные требования
-
Have basic knowledge of Linux
-
With IT project experience
-
Have Hadoop basics
Цели курса
On completion of this program, the participants will be able to:
-
Master the principles of big data components
-
Master the usage of big data components
Classroom training
Длительность 5 дней
Цена
Currently no classroom training dates
Online training
Длительность 5 дней
Цена
Currently no online training dates
* Расчеты в рублях по курсу ЦБ РФ
Программа курса
1.MapReduce - Distributed Off-line Batch Processing and Yarn - Resource Negotiator
-
Introduction to MapReduce and YARN
-
Functions and Architectures of MapReduce and YARN
-
Resource Management and Task Scheduling of YARN
-
Enhanced Features
2.HBase - Distributed NoSQL Database
-
Introduction to HBase
-
Functions and Architecture of HBase
-
Key Processes of HBase
-
Huawei Enhanced Features of HBase
3.HDFS - Hadoop Distributed File System
-
HDFS Overview and Application Scenarios
-
Position of HDFS in FusionInsight HD
-
HDFS System Architecture
-
Key Features
4.Streaming - Distributed Stream Computing Engine
-
Introduction to Streaming
-
System Architecture
-
Key Features
-
Introduction to StreamCQL
5.Kafka - Distributed Message Subscription System
-
Introduction to Kafka
-
Architecture and Functions of Kafka
-
Key Processes of Kafka
6.Zookeeper - Cluster Distributed Coordination Service
-
Introduction to ZooKeeper
-
Position of ZooKeeper in FusionInsight
-
System Architecture
-
Key Features
-
Relationship with Other Components
7.Big Data Industry and Technological Trends
-
Big Data Era
-
Big Data Application Scope
-
Opportunities and Challenges in the Big Data Era
-
Huawei Big Data Solution
8.FusionInsight HD Solution Overview
-
FusionInsight Overview
-
FusionInsight Features
-
Success Cases of FusionInsight
9.Flume - Massive Logs Aggregation
-
Flume Overview and Architecture
-
Key Characteristics of Flume
-
Flume Applications
10.Hive - Distributed Data Warehouse
-
Introduction to Hive
-
Hive Functions and Architecture
-
Basic Hive Operations
11.Spark2x - In-memory Distributed Computing Engine
-
Spark Overview
-
Spark Principles and Architecture
-
Spark Integration in FusionInsight HD
12.Loader - Data Transformation
-
Introduction to Loader
-
Loader Job Management
13.Flink – Stream Processing and Batch Processing Platform
-
Flink Overview
-
Technical Principles and Architecture of Flink
-
Flink Integration in FusionInsight HD