Testing Order Flow Imbalance in Bitcoin Markets

Overview

This project stress-tests a high-frequency trading (HFT) strategy based on Order Flow Imbalance (OFI) across six major cryptocurrency exchanges.

Data Preprocessing

Raw trade data is processed through a three-step pipeline to handle fragmented liquidity:

Unit Normalization: Converts Deribit's inverse futures into standard BTC quantities.
Winsorization: Clips extreme outliers (1st/99th percentiles) to prevent block trades from distorting the signal.
Z-Score Standardization: Normalizes the signal via a rolling window to measure relative deviation from the mean.

Strategy Logic

The strategy translates order flow into trade decisions systematically:

Target Construction: Calculates the "Forward Return" over a set horizon (e.g., 1 second).
Model Training: Splits data chronologically (40% Train / 60% Test) and fits a Linear Regression to derive a predictive Beta coefficient.
Dynamic Thresholding: Applies the Beta to the out-of-sample set. Trades trigger only if the predicted return exceeds a dynamic "J-Threshold" (targeting the top 5% of opportunities).

Conclusion: Performance & Risk

The backtest reveals a high-alpha signal severely undermined by directional risk and scaling limits.

Drawdown Risk: On OKX, optimized settings generated a Net P&L of $575,712. However, the Maximum Drawdown was -$1.2M.
Missing Exit Logic: Without a formal stop-loss, the model averaged down into losing trends, ultimately suffering a catastrophic -$62.3M drawdown.
Scalability Frictions: Real-world network latency inevitably causes slippage, destroying HFT margins. Furthermore, scaling position limits (e.g., from 10 BTC to 100 BTC) hits a strict liquidity ceiling, triggering immediate adverse selection.

Data Notice

The raw tick and order book data used for this analysis are stored in massive .parquet files that exceed GitHub's repository limits. To comply with these size constraints, the underlying data files have been excluded from this repository.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Order_Flow_Imbalance.ipynb		Order_Flow_Imbalance.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Testing Order Flow Imbalance in Bitcoin Markets

Overview

Data Preprocessing

Strategy Logic

Conclusion: Performance & Risk

Data Notice

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Testing Order Flow Imbalance in Bitcoin Markets

Overview

Data Preprocessing

Strategy Logic

Conclusion: Performance & Risk

Data Notice

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages