Whizlabs
AWS: Feature Engineering Data Transformation & Integrity
Whizlabs

AWS: Feature Engineering Data Transformation & Integrity

Whizlabs Instructor

Instructor: Whizlabs Instructor

Included with Coursera Plus

Gain insight into a topic and learn the fundamentals.
Intermediate level

Recommended experience

5 hours to complete
Flexible schedule
Learn at your own pace
Gain insight into a topic and learn the fundamentals.
Intermediate level

Recommended experience

5 hours to complete
Flexible schedule
Learn at your own pace

What you'll learn

  • Apply data cleaning, transformation, and feature engineering techniques to prepare datasets for machine learning.

  • Recognize methods to detect and reduce bias in data preparation and securely manage PII using AWS tools like DataBrew.

  • Implement ETL workflows using AWS Glue, Glue Crawlers, and DataBrew for data preparation.

  • Process large-scale datasets using Apache Spark on Amazon EMR for machine learning workloads.

Details to know

Shareable certificate

Add to your LinkedIn profile

Recently updated!

September 2025

Assessments

4 assignments

Taught in English

See how employees at top companies are mastering in-demand skills

 logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Build your subject-matter expertise

This course is part of the Exam Prep MLA-C01: AWS Machine Learning Engineer Assocaite Specialization
When you enroll in this course, you'll also be enrolled in this Specialization.
  • Learn new concepts from industry experts
  • Gain a foundational understanding of a subject or tool
  • Develop job-relevant skills with hands-on projects
  • Earn a shareable career certificate

There are 2 modules in this course

Welcome to Week 1 of the AWS: Feature Engineering, Data Transformation & Integrity course. This week, you’ll dive into the foundational steps of preparing high-quality data for machine learning workflows. We’ll begin with data cleaning and transformation techniques to ensure consistency and accuracy in your datasets. You’ll then explore feature engineering methods that help extract meaningful insights, followed by encoding techniques such as One-Hot Encoding, Label Encoding, and Tokenization to prepare categorical and textual data for modeling. Finally, we’ll focus on ensuring data integrity and fairness by learning how to address bias in data preparation and securely handle sensitive information (PII) using tools like AWS Glue DataBrew.

What's included

5 videos2 readings2 assignments1 discussion prompt

Welcome to Week 2 of the AWS: Feature Engineering, Data Transformation & Integrity course. This week, you'll dive into AWS-native tools for large-scale data processing and transformation. We’ll begin with AWS Glue, where you'll learn how to create Glue Crawlers, configure ETL jobs, and validate outputs for structured and semi-structured data. You'll explore AWS Glue DataBrew, a no-code tool that simplifies data profiling, cleaning, and transformation. We’ll also cover AWS Glue Data Quality to help ensure your datasets meet required standards for ML workflows. In the second half of the week, you’ll work with Amazon EMR to process massive datasets using Apache Spark. You'll launch EMR clusters, submit jobs, and transform data at scale — gaining hands-on experience with distributed data pipelines tailored for machine learning tasks.

What's included

10 videos3 readings2 assignments

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Instructor

Whizlabs Instructor
Whizlabs
132 Courses88,471 learners

Offered by

Whizlabs

Explore more from Algorithms

Why people choose Coursera for their career

Felipe M.
Learner since 2018
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
Jennifer J.
Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
Larry W.
Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
Chaitanya A.
"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."
Coursera Plus

Open new doors with Coursera Plus

Unlimited access to 10,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription

Advance your career with an online degree

Earn a degree from world-class universities - 100% online

Join over 3,400 global companies that choose Coursera for Business

Upskill your employees to excel in the digital economy

Frequently asked questions