This project focuses on cleaning and preparing a real-world layoffs dataset using SQL.
The goal was to transform raw, inconsistent data into a structured and analysis-ready format.
- Identify and remove duplicate records
- Handle missing and null values
- Standardize inconsistent data entries
- Convert data types for accurate analysis
- Microsoft SQL Server (SSMS)
- SQL (T-SQL)
- Layoffs dataset containing company, industry, location, and workforce reduction data
- Identified duplicate rows using SQL queries
- Removed duplicates to ensure data accuracy
- Addressed NULL values in key columns
- Applied appropriate transformations where needed
- Cleaned inconsistent entries (e.g., country names, industry labels)
- Ensured uniform formatting across dataset
- Converted columns into appropriate data types (dates, integers, etc.)
- Cleaned dataset ready for analysis
- Improved data consistency and reliability
- Reduced errors caused by duplicates and missing values
- Real-world data cleaning techniques in SQL
- Handling inconsistent and incomplete datasets
- Writing structured and readable SQL queries
- Preparing data for downstream analysis