This Project is created as part of Introduction to Machine Learning course included in Data Science Studies at Warsaw University of Technology. We are using the dataset from Kaggle: • https://www.kaggle.com/datasets/spicemix/soccer-detailed-players-match-data?resource=download&select=players_away_matches.csv.
Our project goal is to find the best possible clustering model for analysing player performance data in football matches in order to provide insight to teams, coaches, analysts, scouting agencies and betting companies
Anna Ostrowska
Mateusz Iwaniuk
Paweł Florek
Mateusz Deptuch
Can be found in the folder "EDA".
Can be found in the folder "feature_validation".
Includes the final model selected by us for the realisation of the business plan. Can be found in the "final" folder.
Reports are written in Polish.
They can be found in "raport" and "raport_walidacyjny" file.
We have also report about our validator work for the other team.
The final presentation in Polish can be found in the "Ocena występów piłkarzy" file.