Big Data Training

Available at our training facilities, in the cloud, or on-site at your location.

What makes our training special
  • 3 Delivery Options (in our NY office, in the Cloud, or Private class) more>>
  • All courses have Live Instructors more>>
  • Valuable reference books (course & lab books)
  • Certificate of Completion click to view>>

Course Title

Big Data and ETL Testing Fundamentals

Course Code



1 day


$495 now only $375*

*Extended: Price valid until 02/29

Course Summary

This one day course is designed to familiarize business professionals in the Big Data and ETL space with the basics of testing and validating. This course focuses on getting professionals the knowledge required in order to successfully test and validate Big Data and ETL processes.

Intended Audience
  • Manual Testers
  • Automation Engineers
  • Quality Assurance Analysts
  • Developers
  • Project Managers
  • anyone involved with providing software quality for Big Data 
Course Objectives

At the end of the course, you will be able to:

  • Describe the purpose of a Big Data and the ETL process
  • Determine an appropriate testing strategy
  • Understand a source-target mapping document
  • Describe an approach to test each business rule
  • Recognize the different testing methods
  • Determine appropriate sample sizes and data permutations
  • Explain the different data error types
  • Have knowledge of the different testing tools
  • Understand the importance automated testing
Course Outline


  • Data Concepts
  • Big Data Concepts
  • What is ETL?
  • What is Business Intelligence (BI)?
  • What is Data Science and Analytics?
  • Transactional vs. Analytical Databases vs. Big Data Stores
  • Big Data Concepts
  • What is the Hadoop Ecosystem?
  • How does Hadoop Process Big Data?
  • Resources Types Involved
  • Main Structures
  • Introduction
    • Test points and legs
    • Single Leg strategy
    • Multi leg strategy
    • Single Leg vs Multi Leg
  • Principles of ETL Testing

  • Data Mapping Document
  • Testing methods
    • Visual Compare
    • Record Counts
    • Minus Queries
    • Automation
  • Working with flat files
    • Excel Files
    • Comma delimited files
    • Fixed width files
    • XML Files
  • How much data to test?
  • Testing incremental loads
  • Multiple Sources
    • Selective column and row type
    • Translation
    • Lookups
    • Transpose
    • Field Splitting
    • Field Merging
    • Calculated and Derived
    • Table Splitting
    • Assess: Test Strategy
      • Data Permutations
      • Test Data Sampling
      • Test Points
      • Leveraging Test Tools
    • Plan: Test Planning
      • Test List
      • Resource Estimation
      • Prioritizing
      • Scheduling
      • Defect workflow
      • Test Plan
    • Design: Test Case authoring
      • ETL Manual Test Creation
    • Visual Compare
    • Record Counts
    • Minus Queries
    • Home Grown
      • ETL Automated Test Creation
    • QuerySurge
      • BI Report Test Creation
    • Execute: Test Case Execution
      • Manual Tests
      • Automated Tests
    • Evaluate and Improve
      • Lessons learned
      • New Test cases
    • Missing Data
    • Truncation
    • Type Mismatch
    • Null Translation
    • Misplaced Data
    • Extra records
    • Logic Issues
    • Duplicate Records
    • Precision
    • Sequence
    • Rejected Rows
    • Undocumented Requirements
    • Simple Issues
  • Transformation Types

    Testing Process

    Defect Types

Registering 3+ people? Receive 10% off!
(No promo code needed)

Upcoming Course Schedule

Start Date: May 20th, 2020
Time: 9:30AM - 5:00PM Eastern
Location: New York, NY


Are you interested in learning more or have additional questions?
Please fill out the form below and we will gladly assist you.