image
The Ultimate Drawing Course Beginner to Advanced...
$179
$79
image
User Experience Design Essentials - Adobe XD UI UX...
$179
$79
Total:
$659

Description

What is ETL?
The ETL (extract, transform, load) process is the most popular method of collecting data from multiple sources and loading it into a centralized data warehouse. ETL is an essential component of data warehousing and analytics.
Why Pentaho for ETL?
Pentaho has phenomenal ETL, data analysis, metadata management and reporting capabilities. Pentaho is
faster
than other ETL tools (including Talend). Pentaho has a user-friendly GUI which is
easier
and takes less time to learn. Pentaho is
great for beginners
. Also, Pentaho Data Integration (PDI) is an important skill in data analytics field.
How much can I earn?
In the US, median salary of an ETL developer is $74,835 and in India average salary is Rs. 7,06,902 per year. Accenture, Tata Consultancy Services,

Cognizant Technology Solutions, Capgemini, IBM, Infosys etc. are major recruiters for people skilled in ETL tools; Pentaho ETL is one of the most sought-after skills that recruiters look for. Demand for Pentaho Data Integration (PDI) techniques is increasing day after day.
What makes us qualified to teach you?
The course is taught by Abhishek and Pukhraj. Instructors of the course have been teaching Data Science and Machine Learning for over a decade. We have experience in teaching and implementing Pentaho ETL, Pentaho Data Integration (PDI) for data mining and data analysis purposes.
We are also the creators of some of the most popular online courses - with over 150,000 enrollments and thousands of 5-star reviews like these ones:
I had an awesome moment taking this course. It broaden my knowledge more on the power use of Excel as an analytical tools. Kudos to the instructor! - Sikiru
Very insightful, learning very nifty tricks and enough detail to make it stick in your mind. - Armand
Our Promise
Teaching our students is our job and we are committed to it. If you have any questions about the course content on Pentaho, ETL, practice sheet or anything related to any topic, you can always post a question in the course or send us a direct message.
Download Practice files, take Quizzes, and complete Assignments
With each lecture, there is a practice sheet attached for you to follow along. You can also take quizzes to check your understanding of concepts on Pentaho, ETL, Pentaho Data Integration, Pentaho ETL. Each section contains a practice assignment for you to practically implement your learning on Pentaho, ETL, Pentaho Data Integration, Pentaho ETL. Solution to Assignment is also shared so that you can review your performance.
By the end of this course, your confidence in using Pentaho ETL and Pentaho Data Integration (PDI) will soar. You'll have a thorough understanding of how to use Pentaho for ETL and Pentaho Data Integration (PDI) techniques for study or as a career opportunity.
Go ahead and click the enroll button, and I'll see you in lesson 1 of this Pentaho ETL course!
Cheers
Start-Tech Academy
Who this course is for:
Students who want to have a career in the field of Data warehouse/ETL developer
ETL developers and data process automation developers
Business managers who want to understand the entire ETL process and become capable of implementing it

What you'll learn

Understanding of the entire data integration process using PDI

Extracting data from all popular data sources including Excel, JSON, Zipped files, TXT files and even cloud storage

Cleaning the data using Pentaho Data Integration

Applying business rules on the data in PDI

Different types of Data transformations

Loading the data into different formats

Managing SQL database using PDI

Metadata Injection - a powerful tool offered by PDI

Understanding of the concepts of data marts and data warehouse

Requirements

  • You will need a copy of Adobe XD 2019 or above. A free trial can be downloaded from Adobe.
  • No previous design experience is needed.
  • No previous Adobe XD skills are needed.

Course Content

27 sections • 95 lectures
Expand All Sections
1-Introduction
2
1.1-Welcome to the course
1.2-Course Resources
2-Pentaho Data Integration (PDI) Installation and Setup
4
2.1-Setting up environment and installing PDI
2.2-This is a milestone!
2.3-Opening Spoon - The Graphical UI
2.4-Quiz
3-A Simple ETL Demonstration
4
3.1-The example problem statement
3.2-Demonstration of a PDI transformation
3.3-Demonstration of a PDI Job
3.4-Quizzes
4-Basic concepts - Theory for foundational understanding
5
4.1-What is ETL?
4.2-Check your understanding
4.3-Data Warehouse, Ops Database and Data mart
4.4-Inmon vs Kimball Architecture
4.5-ETL vs ELT
5-The ETL process: The practical part begins here
2
5.1-Data and the ETL process
5.2-Quizzes
6-DATA EXTRACTION: Extracting tabular data
6
6.1-Manually entering data into PDI
6.2-Inputting Data from a TXT (text) file
6.3-Input from multiple CSV files at the same time
6.4-Inputting Data from an Excel file
6.5-Extracting Data from Zipped files
6.6-Quizzes
7-DATA EXTRACTION: Extracting non-tabular data
2
7.1-Extracting from XML
7.2-Extracting from JSON
8-Extracting from an SQL table
4
8.1-Plan for importing sales data
8.2-Installing PostgreSQL and pgAdmin in your System
8.3-Creating Sales table in SQL
8.4-Extracting from an SQL table
9-Storing and Retrieving Data from Cloud storage
2
9.1-Storing Data on AWS S3
9.2-Reading data from AWS S3
10-Merging Data Streams
6
10.1-Concepts: Merging Data Streams
10.2-Sorted Merge Step - Merging customer data
10.3-Merging product data
10.4-Time to check your understanding
10.5-Append data stream - merging sales data
10.6-Time to check your understanding
11-Data Cleansing
11
11.1-Introduction to Data Cleansing
11.2-Value Mapper Step
11.3-Replace in String Step
11.4-Time to check your understanding
11.5-Fuzzy Match concepts
11.6-Fuzzy Match Step in PDI
11.7-Fuzzy Match Algorithms
11.8-Time to check your understanding
11.9-Formula Step and changing data format
11.10-Common Data Cleaning Steps
11.11-Quiz
12-Data Validation
6
12.1-Introduction to Data validation
12.2-Data_validation 1 - String-to-Int and integer range validations
12.3-Data validation 2 - Checking Reference Values using stream look-up
12.4-Data validation 3 - Order date < shipping date using calculator step
12.5-Common Data Validation steps
12.6-Quiz
13-Error Handling
6
13.1-Correcting the errors and merging with main stream
13.2-Time to check your understanding
13.3-Writing the errors to the log
13.4-Time to check your understanding
13.5-Writing the errors to a separate file
13.6-Time to check your understanding
14-Transformation and Analytics steps
5
14.1-Concatenating Address Fields
14.2-Data Aggregation using Group-by
14.3-Normalization and Denormalization
14.4-Number Range Step
14.5-Quiz
15-PDI SQL Connection
4
15.1-Introduction to PDI - SQL connection
15.2-Reading and filtering data from DB into PDI
15.3-Updating and Inserting data into DB from PDI
15.4-Deleting data from SQL DB using PDI
16-Conceptual understanding for Loading Data
7
16.1-Facts and Dimensions tables
16.2-Time to check your understanding
16.3-Surrogate Keys in Dimension tables
16.4-Type 1 & 2 Slowly Changing Dimensions
16.5-Time to check your understanding
16.6-Schemas
16.7-Quiz
17-Loading the data into a Data Mart
4
17.1-Creating tables in DB
17.2-Loading Customer Data using combination lookup/ update step
17.3-Loading product data using dimension lookup step
17.4-Loading sales data after database lookup steps
18-Running Java and Javascript
1
18.1-Scripting Steps
19-PDI Jobs
7
19.1-PDI Jobs vs Transformation
19.2-Controlling the flow of execution
19.3-Setting variables using set variables step
19.4-File and Folder Management
19.5-Sending Email Step
19.6-Abort Job Step
19.7-Time to check your understanding
20-Scheduling a job for production environment
1
20.1-Running using command prompt and scheduling
21-Metadata injection
1
21.1-Metadata injection
22-Regex Notation
1
22.1-Regular Expressions for advanced String Matching
23-Congratulations and about your certificate
4
23.1-Alternative to Pentaho
23.2-The final milestone!
23.3-About your certificate
23.4-Bonus Lecture