Skip to content

Latest commit

 

History

History
78 lines (63 loc) · 1.79 KB

File metadata and controls

78 lines (63 loc) · 1.79 KB

Machine Learning Data Visualization Sample Evaluation

This repository showcases various approaches to visualizing machine learning data using Python. It includes examples of scatter plots, bar charts, pie charts, histograms, and advanced techniques like sunburst charts and radar plots. These visualizations help understand patterns, trends, and insights in datasets. I will further update the last version on my Kaggle account.

Features

  • Data Preparation: Includes code for loading, cleaning, and transforming datasets.
  • Visualization Libraries: Demonstrates the use of popular Python libraries:
    • Pandas
    • Seaborn
    • Matplotlib
    • Plotly
  • Charts and Plots:
    • Scatter Plots
    • Box Plots
    • Histograms
    • Sunburst Charts
    • Pie Charts
    • Heatmaps
  • Interactive Features: Integration of Plotly for dynamic and interactive visualizations.

Datasets

  • Heart Disease Dataset
  • Gapminder Dataset
  • Job Market Dataset (Malaysia)

Code Examples

Scatter Plot

import plotly.express as px

fig = px.scatter(
    df,
    x="chol",
    y="age",
    color="cp",
    size="oldpeak",
    hover_name="exang"
)
fig.update_layout(title_text="Cholesterol vs Age (colored by Chest Pain)")
fig.show()

Bar Chart

fig = px.bar(
    df,
    x="age",
    y="chol",
    color="sex",
    height=400
)
fig.show()

Getting Started

  1. Clone the repository:
    git clone https://github.com/raccomandino/machine_learning_sample.git
    cd machine_learning_sample
  2. Install the required libraries:
    pip install pandas numpy seaborn matplotlib plotly
  3. Run the Jupyter Notebook or Python scripts:
    jupyter notebook

License

This project is licensed under the MIT License - see the LICENSE file for details.