This repository hosts the Graph4ASD Challenge, a graph machine learning competition focused on Autism Spectrum Disorder (ASD) classification using resting-state fMRI functional connectivity data from the ABIDE dataset.
Participants must design and train Graph Neural Network (GNN) models to classify subjects as:
- ASD
- Typical Control (TC)
based on brain connectivity graphs.
Each subject is represented as a graph where:
- Nodes → brain regions (Craddock 200 atlas)
- Edges → functional connectivity between brain regions
Leaderboard scores are automatically updated.
The evaluation metric is Macro F1-Score:
Rankings are sorted by descending score.
Each sample corresponds to one subject’s brain graph.
[ G = (A, X) ]
Where:
- A → adjacency matrix (functional connectivity)
- X → node feature matrix (identity features)
- Nodes → 200 brain regions (Craddock atlas)
Train a model that correctly predicts whether a subject:
- has Autism Spectrum Disorder (ASD)
- or is a Typical Control (TC)
using:
- Graph structure
- Node features
This is a graph-level classification task.
The dataset is derived from:
Autism Brain Imaging Data Exchange (ABIDE)
ABIDE aggregates anonymized rs-fMRI data from 17 international sites.
| Property | Value |
|---|---|
| Subjects | 1,009 |
| ASD | 516 (51.14%) |
| Controls | 493 (48.86%) |
| Brain atlas | Craddock 200 |
Each subject is stored as a graph:
- Adjacency matrix (A) → connectivity between brain regions
- Feature matrix (X) → node-level features
All graphs share:
- Same number of nodes → 200
- Same node ordering → Craddock atlas
The dataset is hosted in folder data/public.
Train your model using:
- Graphs: Adjacency matrix and node feature matrix in
data/public/adj_train.npyanddata/public/node_features.npy. Each - labels in
data/public/train_label.csv
Predict labels for every graph in:
data/public/adj_test.npyanddata/public/node_features_test.npy
Create a metadata.json that contains metadata about your submission:
{
"team": "example_team",
"run_id": "example_run_id",
"type": "human", // must be "human", "llm-only", or "human+llm"
"model": "GAT",
"notes": "Additional notes"
}Create a CSV with columns id and y_pred (same format as data/public/sample_submission.csv):
id,y_pred
1,0
2,0
3,1
...Encrypt your predictions.csv using: python extra/encrypt.py predictions.csv.
Then put predictions.csv.enc and metadata.json in the submissions/ folder.
Note: ❗ Do NOT upload the raw CSV. You need to submit an encrypted version of your predictions file to keep privacy.
Commit and push the predictions.csv.enc and metadata.json to the repository. Then, open Pull Request or push to the main branch. The automated pipeline will decrypt and score your submission to update the leaderboard.
- You are free to use any Graph Neural Network.
- You are not allowed to use external data.
- You must not try to identify each subject.
Cameron Craddock, Yassine Benhajali, Carlton Chu, Francois Chouinard, Alan Evans, András Jakab, Budhachandra Singh Khundrakpam, John David Lewis, Qingyang Li, Michael Milham, Chaogan Yan, Pierre Bellec (2013). The Neuro Bureau Preprocessing Initiative: open sharing of preprocessed neuroimaging data and derivatives. In Neuroinformatics 2013, Stockholm, Sweden.