Data cleaning

An automated data cleaning pipelines is a robust solution designed to ingest, clean, and transform large datasets in various formats (CSV, Excel, PDF) for downstream analysis or storage. The application standardize the import process, validates data, applies a modular cleaning pipeline, and export the cleaned data as an Excel file for further analysis. The application is built using Spring Boot and thymeleaf for simple user interface. It supports uploading files, viewing cleaned data, and analyzing data quality through intuitive web interface.

The application has the following features:

Data ingestion service for supported file formats (CSV, Excel, JSON) and automatically standardize column names and handles empty values uniformly.
Modular cleaning pipeline:
- Remove special characters
- Normalize whitespace
- Handle missing values
- Remove duplicates
- Data type inference
- Outlier detection
- Categorical standardization
Exports cleaned data to Excel format for download
Data quality monitoring: Display comprehensive data quality reports to track total records, missing value count, unique value count

Build and Run the application

Build project

mvn clean install

Run the application

mvn spring-boot:run

The backend will be available at http://localhost:8080.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.mvn/wrapper		.mvn/wrapper
src		src
.DS_Store		.DS_Store
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
img.png		img.png
img_1.png		img_1.png
mvnw		mvnw
mvnw.cmd		mvnw.cmd
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data cleaning

The application has the following features:

Build and Run the application

Screenshots

About

Uh oh!

Releases

Packages

Uh oh!

Languages

niyiment/data-cleaning

Folders and files

Latest commit

History

Repository files navigation

Data cleaning

The application has the following features:

Build and Run the application

Screenshots

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages