IT manager

Training Course: Cloudera Certified Associate (CCA) Data Analyst


Register Now
Quick Inquiry
Discount Group Download Brochure (9)

IT234731

10 - 21 Nov 2024

Dubai (UAE)

Hotel : Residence Inn by Marriott Sheikh Zayed Road, Dubai

Cost : 7300 € Euro

Introduction

This Certified Associate (CCA) Data Analyst Training course will teach you to apply traditional data analytics and business intelligence skills to big data. This course presents the tools data professionals need to access, manipulate, transform, and analyze complex data sets using SQL and familiar scripting languages. Advance Your Ecosystem Expertise Apache Hive makes transformation and analysis of complex, multi-structured data scalable in Cloudera environments. Apache Impala enables real-time interactive analysis of the data stored in Hadoop using a native SQL environment. Together, they make multi-structured data accessible to analysts, database administrators, and others without Java programming expertise.

Course Objectives of Cloudera Certified Associate (CCA) Data Analyst

Throughout this training course participants will get to know:

  • How the open source ecosystem of big data tools addresses challenges not met by traditional RDBMSs
  • Using Apache Hive and Apache Impala to provide SQL access to data
  • Hive and Impala syntax and data formats, including functions and subqueries
  • Create, modify, and delete tables, views, and databases; load data; and store results of queries
  • Create and use partitions and dierent file formats
  • Combining two or more datasets using JOIN or UNION, as appropriate
  • What analytic and windowing functions are, and how to use them Store and query complex or nested data structures?
  • Process and analyze semi-structured and unstructured data
  • Techniques for optimizing Hive and Impala queries
  • Extending the capabilities of Hive and Impala using parameters, custom file formats and SerDes, and external scripts
  • How to determine whether Hive, Impala, an RDBMS, or a mix of these is best for a given task?

Target Audience of Cloudera Certified Associate (CCA) Data Analyst  

This course is designed for:

  • Data analysts
  • Business intelligence specialists
  • Developers
  • Aystem architects
  • Database administrators

Prerequisites of Cloudera Certified Associate (CCA) Data Analyst

Some knowledge of SQL is assumed, as is basic Linux command-line familiarity. Prior knowledge of Apache Hadoop is not required.

Course Outline for Cloudera Certified Associate (CCA) Data Analyst

Day 1

Apache Hadoop Fundamentals

  • The Motivation for Hadoop
  • Hadoop Overview
  • Data Storage: HDFS Distributed Data Processing: YARN, MapReduce, and Spark
  • Data Processing and Analysis: Hive, and Impala
  • Database Integration: Sqoop _Other Hadoop Data Tools
  • Exercise Scenario Explanation

Day 2

Introduction to Apache Hive and Impala

  • What Is Hive?
  • What Is Impala?
  • Why Use Hive and Impala?
  • Schema and Data Storage Comparing Hive and Impala to Traditional Databases
  • Use Cases

Day 3

Querying with Apache Hive and Impala

  • Databases and Tables Basic Hive and Impala Query Language Syntax
  • Data Types
  • Using Hue to Execute Queries
  • Using Beeline (Hive's Shell)
  • Using the Impala Shell

Day 4

Common Operators and Built-In Functions

  • Operators
  • Scalar Functions
  • Aggregate Functions

Data Management

  • Data Storage
  • Creating Databases and Tables
  • Loading Data
  • Altering Databases and Tables
  • Simplifying Queries with Views
  • Storing Query Results

Day 5

Data Storage and Performance

  • Partitioning Tables
  • Loading Data into Partitioned Tables
  • When to Use Partitioning
  • Choosing a File Format
  • Using Avro and Parquet File Formats

Day 6

Working with Multiple Datasets

  • UNION and Joins
  • Handling NULL Values in Joins
  • Advanced Joins

Analytic Functions and Windowing

  • Using Common Analytic Functions
  • Other Analytic Functions
  • Sliding Windows

Day 7

Complex Data

  • Complex Data with Hive
  • Complex Data with Impala

Analyzing Text

  • Using Regular Expressions with Hive and Impala
  • Processing Text Data with SerDes in Hive
  • Sentiment Analysis and n-grams

Day 8

Apache Hive Optimization

  • Understanding Query Performance
  • Bucketing
  • Hive on Spark

Apache Impala Optimization

  • How Impala Executes Queries
  • Improving Impala Performance

Day 9 & 10

Extending Apache Hive and Impala

  • Custom SerDes and File Formats in Hive
  • Data Transformation with Custom Scripts in Hive
  • User-Defined Functions
  • Parameterized Queries

Choosing the Best Tool for the Job

  • Comparing Hive, Impala, and Relational Databases
  • Which to Choose?

IT manager

Training Course: Cloudera Certified Associate (CCA) Data Analyst


Register Now
Quick Inquiry
Discount Group Download Brochure (9)

IT234731

10 - 21 Nov 2024

Dubai (UAE) - Residence Inn by Marriott Sheikh Zayed Road, Dubai

Hotel : Residence Inn by Marriott Sheikh Zayed Road, Dubai

Cost: 7300 € Euro


  About Dubai

Dubai, located on the Persian Gulf, is one of the seven United Arab Emirates and one of the most popular tourist destinations in the world. The discovery of oil in the region has made Dubai extremely wealthy, allowing it to build the glittering skyscrapers that it is now famous for. That wealth is strongly in evidence in Dubai and visitors will see luxurious buildings and supercars aplenty. Perfect beaches and endless shopping opportunities are to key to Dubai's attractions. Flights to Dubai open up the city's cultural attractions to tourists, with beautiful mosques, museums and art galleries scattered throughout this ultra-modern metropolis.


  Things to do and places to visit in Dubai

Dubai's wealth has made it famous for building ever taller buildings and creating artificial islands off its shores. The city's hotels are luxurious and shoppers will love its extensive shopping malls which showcase all the world's top brands. Dubai's attractions don't end there. Dubai also caters to adventure lovers, who can jump in a 4x4 or on a board to speed over dunes outside the city. Local culture mustn't be forgotten either, and visitors have wonderful mosques to visit and old districts to explore. All that combined means that a flight to Dubai is sure to lead to an unforgettable holiday.

When visiting Dubai, be sure to:

  • Go to the observation deck of the Burj Khalifa, the tallest building in the world.
  • Admire the intricately beautiful Grand Mosque, which has the tallest minaret in the city.
  • Understand the local history and culture with a visit to the Dubai Museum.
  • Discover objects from the 6th century at Jumeirah Archaeological Site.
  • Go skiing – That's not a joke, the Mall of the Emirates houses a snowdome.
  • Go shopping at the Mall of the Emirates or the Dubai Mall.
  • Explore the desert surrounding the city – either by 4x4 or atop a camel.
  • Eat fantastic seafood at Dubai Marina.
  • Cool off at the Wild Wadi Waterpark.
  • Marvel at gorgeous Arabic calligraphy at Jumeirah Mosque, the biggest in the city.
  • Take a yacht tour around the artificial islands of Palm Jumeirah.
  • Haggle for souvenirs in one of the city's souks.
  • Wander around the traditional building in Bastakiya District.
 22 Portman Square, Marylebone, London W1H 7BG, UK
 3 Oudai street, Aldouki, Giza, Giza Governorate, Egypt
 0020233379764
 00201095004484
 00201102960555
 00201102960666
 19 Mayıs Mahallesi, 19 Mayis Street No 2 Sisli, 34360 Istanbul/Turkey
 00905357839460
 Australia Street, Raouche Beirut, Lebanon .، Beirut, Lebanon
 0096181746278
 811 Massachusetts Avenue, Boston, Massachusetts, 02118, USA
 6 Beirut Street - Fifth Circle Abdoun, P.O. Box 831370, 11183 Amman, Jordan
Copyright Global Horizon Training Center © 2019