Big Data Training

Available at our training facilities, in the cloud, or on-site at your location.

What makes our training special
  • 3 Delivery Options (in our NY office, in the Cloud, or Private class) more>>
  • All courses have Live Instructors more>>
  • Valuable reference books (course & lab books)
  • Certificate of Completion click to view>>

Course Title

Advanced Big Data Testing using Hive and HQL

Course Code

BD103

Length

1 day

Price

$495 now only $375*

*Extended: Price valid until 01/31

Course Summary

This one day course of lectures and hands-on training is designed to provide students with advanced techniques necessary for testing big data environments. The course covers advanced HQL transformations and the challenges these issues cause in testing big data scenarios.

Intended Audience
  • Data Quality Teams
  • Data Warehouse Analysts
  • Automation Engineers
  • Quality Assurance Analysts
  • Project Managers
  • anyone involved with providing software quality for big data projects
Course Objectives

At the end of the course, you will be able to:

  • understand big data structures and architectures
  • implement a successful process for big data testing
  • create and execute more sophisticated transforation tests
  • utilize regular expressions for data comparisons
  • create and utilize subqueries
  • work with derived tables and inlined views
  • take advantage of advanced techniques for big data techniques
  • create tests for unstructured or semi-structured data
Prerequisites
  • Understanding of basic ETL testing processes
  • Basic HQL knowledge or have taken Introduction to Big Data Testing using HQL
Course Outline

Big Data Overview

  • Understanding Big Data Architecture
  • Understand the challenges of Big Data Testing
  • Understanding ETL Mapping Documents
  • Overview of Transformation Types
  • Big Data Comparison Methods

Calculated Fields Transformation Test

  • Aggregate Functions with Group By statement
  • Compare Calculated Source fields with grouping to target field.

Derived Fields Transformation Test

  • Discuss the differences between calculated and derived fields
  • Implement variations of SubQueries (Nested, Scalar, Correlated, Non-Correlated, Inline)
  • Compare a target field from a derived field from the source data.

Field Length Limits Transformation Test

  • Get table information using Describe command
  • Calculate maximum size of Field Mergers
  • Calculate maximum size of Field Splits
  • Validate maximum size of source data split into separate fields into the target database

Field Padding Transformation Test

  • String Padding Functions
  • SQL Regular Expression Functions
  • Verify erroneous source data has been padded correctly in target table

XML Transformation Test

  • Usage of the Extract function
  • Discuss relevance of XPATH
  • Database specific casting functions
  • Utilizing XML functions to form result set from XML content
  • Compare source tables to XML content in a target table

Transpose Transformation Test

  • Utilization of Self Joins
  • Compare transposed source data to a target table

Match and Merge Transformation Test

  • Utilization of Unions
  • Compare multiple source records that need to be matched and then merged into a target table. 

Registering 3+ people? Receive 10% off!
(No promo code needed)

Upcoming Course Schedule

Start Date: Feb 08th, 2019
Time: 9:30AM - 4:30PM Eastern
Location: In The Cloud, Web

Register

Start Date: May 10th, 2019
Time: 9:30AM - 4:30PM Eastern
Location: New York, NY

Register

Start Date: Aug 30th, 2019
Time: 9:30AM - 4:30PM Eastern
Location: In The Cloud, Web

Register

Are you interested in learning more or have additional questions?
Please fill out the form below and we will gladly assist you.

=