UFC Network Analysis Project

A network science analysis of UFC fighter interactions, rivalries, and performance metrics using graph theory and machine learning.

Project Overview

This project analyzes UFC fight data through the lens of network science, constructing and analyzing fighter networks to understand:

Fighter rivalries and competition patterns (undirected rivalry graph)
Win-loss dominance hierarchies (directed dominance graph)
Network metrics and their relationship to fighter performance
Community structure within the fighter ecosystem
Temporal evolution of the fighter network
Comparison with theoretical null models (ER, BA, WS)

Project Structure

NetworkScienceProject/
├── data
│   ├── data_raw/              # Raw UFC dataset (ufc-master.csv)
│   └── data_processed/        # Processed data, graphs, and metrics
├── src/ufcnet/           # Main Python package
│   ├── config.py         # Project paths and configuration
│   ├── utils.py          # Logging and utility functions
│   ├── data_loading.py   # Data loading and parsing
│   ├── cleaning.py       # Data cleaning and preprocessing
│   ├── build_graphs.py   # Network construction
│   ├── metrics_static.py # Network metrics and centralities
│   ├── performance_models.py  # Fighter performance analysis
│   ├── null_models.py    # Temporal and null model analysis
│   └── scripts/          # CLI entry points
│       ├── preprocess_data.py
│       ├── build_networks.py
│       ├── compute_metrics.py
│       ├── analyze_performance.py
│       └── run_null_models.py
├── tests/                # Unit tests
├── figures/              # Generated plots and visualizations
├── reports/              # Analysis reports and model results
├── slides/               # Presentation materials
└── requirements.txt      # Python dependencies

Installation

Prerequisites

Python 3.8+
pip or conda for package management

Setup

Clone the repository:

git clone <repository-url>
cd NetworkScienceProject

Create and activate a virtual environment:

python -m venv .venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate

Install dependencies:

pip install -r requirements.txt

Download the UFC dataset: The dataset will be downloaded from Kaggle when you run the preprocessing script.

Usage

The project is organized into 5 main phases, each with a corresponding script:

Phase 1: Data Preprocessing

Loads raw UFC data, cleans columns, creates fighter IDs, and generates structured datasets.

python src/ufcnet/scripts/preprocess_data.py \
    --input data_raw/ufc-master.csv \
    --output-dir data_processed/ \
    --seed 42

Outputs:

data_processed/ufc_clean.csv - Cleaned full dataset
data_processed/fights.csv - Fight-level table
data_processed/fighters_lookup.csv - Fighter ID mappings

Phase 2: Network Construction

Builds rivalry (undirected) and dominance (directed) graphs from fight data.

python src/ufcnet/scripts/build_networks.py \
    --input data_processed/fights.csv \
    --output-dir data_processed/ \
    --seed 42

Outputs:

data_processed/rivalry.gexf / rivalry.gpickle - Rivalry graph
data_processed/dominance.gexf / dominance.gpickle - Dominance graph

Phase 3: Network Metrics & Communities

Computes global statistics, node centralities, and community structure.

python src/ufcnet/scripts/compute_metrics.py \
    --rivalry-graph data_processed/rivalry.gpickle \
    --dominance-graph data_processed/dominance.gpickle \
    --output-dir data_processed/ \
    --seed 42

Outputs:

data_processed/global_stats.json - Network-level statistics
data_processed/centralities.csv - Node centrality measures
data_processed/communities.csv - Community assignments

Phase 4: Performance Analysis & Modeling

Analyzes fighter performance and trains predictive models using network features.

python src/ufcnet/scripts/analyze_performance.py \
    --fights data_processed/fights.csv \
    --centralities data_processed/centralities.csv \
    --communities data_processed/communities.csv \
    --output-dir data_processed/ \
    --reports-dir reports/ \
    --seed 42

Outputs:

data_processed/fighters_performance.csv - Fighter performance metrics
data_processed/fighters_merged.csv - Combined dataset for modeling
reports/model_results.json - Regression model results

Phase 5: Temporal & Null Models

Builds temporal snapshots and compares against theoretical null models.

python src/ufcnet/scripts/run_null_models.py \
    --fights data_processed/fights.csv \
    --rivalry-graph data_processed/rivalry.gpickle \
    --output-dir data_processed/ \
    --seed 42

Outputs:

data_processed/temporal_stats.csv - Year-over-year network statistics
data_processed/null_vs_real_stats.csv - Null model comparisons

Development Phases

Phase 0: Base setup (repo structure, config, script skeletons) ✓
Phase 1: Data preprocessing (loading, cleaning, fighter IDs)
Phase 2: Network construction (rivalry & dominance graphs)
Phase 3: Network metrics & communities
Phase 4: Performance modeling
Phase 5: Temporal analysis & null models

Contributing

This is a network science course project. See individual phase documentation for task assignments.

License

MIT License (or specify your license)

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
data/raw		data/raw
src/ufcnet		src/ufcnet
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

UFC Network Analysis Project

Project Overview

Project Structure

Installation

Prerequisites

Setup

Usage

Phase 1: Data Preprocessing

Phase 2: Network Construction

Phase 3: Network Metrics & Communities

Phase 4: Performance Analysis & Modeling

Phase 5: Temporal & Null Models

Development Phases

Contributing

License

About

Uh oh!

Releases

Packages

Contributors 4

Uh oh!

Languages

damianpostaru/NetworkScienceProject

Folders and files

Latest commit

History

Repository files navigation

UFC Network Analysis Project

Project Overview

Project Structure

Installation

Prerequisites

Setup

Usage

Phase 1: Data Preprocessing

Phase 2: Network Construction

Phase 3: Network Metrics & Communities

Phase 4: Performance Analysis & Modeling

Phase 5: Temporal & Null Models

Development Phases

Contributing

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Uh oh!

Languages

Packages