Name: Data Cleaning Basics: Fix Missing Values, Duplicates & Typos
Price: Included USD
Availability: InStock
Rating: 4.8 (60 reviews)

Data Cleaning Basics: Fix Missing Values, Duplicates & Typos

Turn messy spreadsheets into trustworthy data—step by step.

Beginner data-cleaning · missing-values · duplicates · typos

Make messy data usable—even if you’ve never done this before

Real-world data is rarely neat. It often arrives as a spreadsheet with empty cells, repeated rows, inconsistent labels, and small typing mistakes that quietly break totals, charts, and decisions. This beginner course is a short, book-style guide that teaches data cleaning from the ground up—using plain language and simple steps you can apply in Excel or Google Sheets.

You’ll learn how to recognize the most common issues that appear in everyday datasets: missing values, duplicates, and typos. More importantly, you’ll learn safe ways to fix them without “guessing” or accidentally changing meaning. The goal is not perfection—it’s trustworthy, consistent data that is good enough for analysis and reporting.

What you’ll be able to do by the end

By the final chapter, you’ll have a repeatable workflow you can use on almost any spreadsheet dataset. You’ll know how to inspect data before changing it, apply clear rules for fixes, and verify that your changes improved quality.

Identify missing values and decide when to leave, remove, or fill them
Find duplicates and keep the correct record (not just delete blindly)
Fix typos and standardize text so categories group correctly
Run simple “before vs. after” checks to confirm your results
Document your cleaning steps so others can trust your work

How the course is structured (like a short technical book)

This course is organized into exactly six chapters. Each chapter builds on the last, so you start with the idea of “clean data,” then learn to profile a dataset, then fix one problem type at a time, and finally combine everything into a single workflow.

You’ll begin with first principles: what rows and columns represent, what counts as an error, and why making a safe copy matters. Next, you’ll learn quick ways to scan a dataset for trouble spots and record what you find. Then you’ll tackle missing values, duplicates, and typos using simple rules and checks designed for beginners. The final chapter brings it all together into a repeatable checklist and a small quality report you can reuse at school, work, or home projects.

Who this is for

This course is for absolute beginners—no coding, no statistics, and no prior data science experience required. If you can open a spreadsheet, sort a column, and save a copy of a file, you can follow along.

Students learning analytics for the first time
New analysts and operations staff working with spreadsheets
Teams in business or government who need reliable reporting
Anyone who wants fewer errors and more confidence in their data

Get started

If you want to turn messy spreadsheets into clean, dependable datasets, this course will guide you step by step. Register free to begin, or browse all courses to find the right learning path for your goals.

What You Will Learn

Explain what data cleaning is and why it affects results and decisions
Spot common data problems: missing values, duplicates, typos, and inconsistent formats
Choose safe ways to handle missing values (leave, remove, fill) for simple datasets
Find and remove duplicates while keeping the correct record
Standardize text and fix common typos using simple, repeatable rules
Create a basic data cleaning checklist you can reuse on any spreadsheet
Document what you changed so others can trust and reproduce your work

Requirements

No prior AI, coding, or data science experience required
Basic computer skills (copy/paste, saving files)
Access to a spreadsheet tool (Excel, Google Sheets, or similar)
Willingness to practice with small sample datasets

Chapter 1: What “Clean Data” Means (and Why It Matters)

Understand messy vs. clean data with everyday examples
Learn the costs of dirty data (wrong totals, wrong decisions)
Meet the three core problems: missing values, duplicates, typos
Set up your first cleaning workspace and make a safe copy
Build your first simple “data dictionary” (what columns mean)

Chapter 2: Profiling Your Dataset Before You Change Anything

Scan columns to find likely problem areas quickly
Count blanks and unusual values without advanced tools
Detect inconsistent formats (dates, phone numbers, casing)
Create a simple issue log to track what you find
Decide what “good enough” means for your goal

Chapter 3: Fixing Missing Values Without Guessing

Learn why values go missing and what it can mean
Choose between leaving blank, removing rows, or filling in
Fill missing text and categories using simple rules
Handle missing numbers with safe beginner methods
Validate results so you don’t introduce new mistakes

Chapter 4: Finding and Removing Duplicates the Right Way

Understand duplicates: exact vs. “same person, slightly different”
Pick the columns that define a unique record
Identify duplicate groups and review them safely
Remove duplicates while keeping the correct row
Prevent duplicates with simple input rules

Chapter 5: Fixing Typos and Standardizing Text

Spot common typo patterns and inconsistent naming
Standardize casing, spacing, and punctuation
Fix frequent misspellings using a reference list
Normalize categories so charts and counts make sense
Run quality checks to confirm text cleanup worked

Chapter 6: Putting It All Together: A Repeatable Cleaning Workflow

Clean a small dataset from start to finish with the full checklist
Document every change so it’s transparent and repeatable
Create a “cleaned data” output and keep raw data untouched
Build a simple quality report (before vs. after metrics)
Plan next steps: automation ideas without needing code

Sofia Chen

Data Analytics Instructor, Data Quality & Reporting

Sofia Chen is a data analytics instructor who helps beginners turn messy spreadsheet data into reliable reports. She has supported teams in operations and public programs by building simple, repeatable data quality workflows that reduce errors and rework.

More Courses

Google GCP-ADP Associate Data Practitioner Guide

Beginner

Google Professional ML Engineer Guide (GCP-PMLE)

Beginner

GCP-PMLE Google ML Engineer Practice Tests

Beginner

Edu AI Last

AI Course Assistant

Hi! I'm your AI tutor for this course. Ask me anything — from concept explanations to hands-on examples.

Data Cleaning Basics: Fix Missing Values, Duplicates & Typos