Skip to content

lstcyr25/Layoffs-Data-Cleaning-Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

15 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Layoffs Data Cleaning (SQL)

πŸ“Š Overview

This project focuses on cleaning and preparing a real-world layoffs dataset using SQL.

The goal was to transform raw, inconsistent data into a structured and analysis-ready format.


🎯 Objectives

  • Identify and remove duplicate records
  • Handle missing and null values
  • Standardize inconsistent data entries
  • Convert data types for accurate analysis

πŸ› οΈ Tools & Technologies

  • Microsoft SQL Server (SSMS)
  • SQL (T-SQL)

πŸ“ Dataset

  • Layoffs dataset containing company, industry, location, and workforce reduction data

🧹 Data Cleaning Process

1. Removing Duplicates

  • Identified duplicate rows using SQL queries
  • Removed duplicates to ensure data accuracy

2. Handling Missing Values

  • Addressed NULL values in key columns
  • Applied appropriate transformations where needed

3. Standardizing Data

  • Cleaned inconsistent entries (e.g., country names, industry labels)
  • Ensured uniform formatting across dataset

4. Data Type Conversion

  • Converted columns into appropriate data types (dates, integers, etc.)

πŸ“Š Result

  • Cleaned dataset ready for analysis
  • Improved data consistency and reliability
  • Reduced errors caused by duplicates and missing values

🧠 What I Learned

  • Real-world data cleaning techniques in SQL
  • Handling inconsistent and incomplete datasets
  • Writing structured and readable SQL queries
  • Preparing data for downstream analysis

About

SQL-based data cleaning project transforming raw layoffs data into a structured and analysis-ready dataset

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors