- GitHub Repo for Hacklytics'24 - GT Hackathon!
- We are a team of 4, building 3Q4F (Quant, Quantum, and Q-Learning for Finance), a quantum algorithm-based trading platform, offering superior performance compared to Reinforcement Learning (RL) methods, revolutionizing stock trading.
- Classical trading methods often suffer from limited optimization capabilities and slower decision-making processes, leading to suboptimal results in dynamic market environments. Reinforcement learning (RL) was introduced to address these shortcomings by enabling sequential decision-making and adaptive strategies based on technical indicators.
However, RL algorithms may struggle with scalability, convergence issues, and susceptibility to noise, prompting the exploration of quantum computing (QC) for its potential to offer faster computation, improved optimization, and greater robustness in stock trading applications.
- Taking inspiration from [1] where they theoriesed about Quantum Superposition States and Circuits for the action space, we built a Quantum Reinforcement Learning based Trader as an Enhancement of an RL based trader.
- There is theoretical evidence on the quadratic speedup for convergence of the RL algorithm on employing Quantum Computing [2]
- Classical trading methods are slow and lack optimization, prompting the introduction of Reinforcement learning (RL) for adaptive strategies. But RL faces scalability and noise issues. Enter Quantum Computing (QC), offering faster computation and better optimization, potentially revolutionizing stock trading.
- Problem Statement: Providing a Reinforcement Learning Based Trader that has sound performance, faster convergence, and better balance of exploration and exploitation to cover all sorts of strategies in a dynamic market environment.
- Proposed Solution: Quantum Reinforcement Learning (QRL) offers a solution by leveraging quantum algorithms to help the Reinforcement Learning Trader to balance the exploration vs exploitation issue and provide faster convergence for optimal buy-sell decision making.
- Building for the Finance Track at Hacklytics'24!
- Understanding and Preprocessing Data: Collect historical stock price data and select relevant technical indicators, like Bollinger Bands, EMA and RSI.
- Implementing Quantum Reinforcement Learner (QRL):
- With the quantum states represent the value of the technical indicators and the actions are buy,sell and hold.
- Implementing Grover's iteration algorithm for search of optimal buy-sell signals on the state action space.
- Training and Evaluation: Training on historical price data and backtesting our strategy to assess portfolio returns, risk and performance regarding benchmark.
- Portfolio Return Assessment: Providing a framework for future training on a portfolio of stocks, providing graphs and metrics to assesss performance of our trader when compared to benchmark and a typical Q-Learning Trader.
- Trading on a portfolio of stocks, and how to diversify it for optimal conditions.
Two options seemed viable:
- Equally splitting the portfolio among the 10 stocks (example), and optimizing the trade stocks using its own QRL based trading methods.
- Performing portfolio optimization to reduce risk for higher return, and rebalnacing portfolios periodically.
- Significant fluctuations in asset allocation due to the quantum computing-based portfolio optimizer's operation.
- Execution of substantial trade volumes leading to heightened trading activity.
- Adverse market impact, especially pronounced for less liquid assets or during periods of heightened volatility.
- Implementation of transaction costs as a prudent measure to address challenges.
- Imposition of constraints on portfolio turnover to mitigate the frequency and scale of allocation adjustments.
- Mitigation of the risk of excessive trading costs.
- Promotion of stability in portfolio management practices.
- Enhancement of overall investment efficacy.
- [1] D. Dong, C. Chen, H. Li and T. -J. Tarn, "Quantum Reinforcement Learning," in IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), vol. 38, no. 5, pp. 1207-1220, Oct. 2008, doi: 10.1109/TSMCB.2008.925743
- [2] Giuseppe Davide Paparo, Vedran Dunjko, Adi Makmal, Miguel Angel Martin-Delgado, and Hans J. Briegel, Quantum Speedup for Active Learning Agents, Phys. Rev. X 4, 031002



