An Algorithm for Trading and Portfolio Management Using Q-learning and Sharpe Ratio Maximization
نویسندگان
چکیده
A trading and portfolio management system called QSR is proposed. It uses Q-learning and Sharpe ratio maximization algorithm. We use absolute proot and relative risk-adjusted proot as performance function to train the system respectively, and employ a committee of two networks to do the testing. The new proposed algorithm makes use of the advantages of both parts and can be used in a more general case. We demonstrate with experimental results that the proposed approach generates appreciable proots from trading in the foreign exchange markets.
منابع مشابه
Outperformance Testing of a Dynamic Assets Portfolio Selection Supplemented with a Continuous Paths Levy Process
This study aims at getting a better performance for optimal stock portfolios by modeling stocks prices dynamics through a continuous paths Levy process. To this end, the share prices are simulated using a multi-dimensional geometric Brownian motion model. Then, we use the results to form the optimal portfolio by maximizing the Sharpe ratio and comparing the findings with the outputs of the conv...
متن کاملArbitrage pricing theory-based Gaussian temporal factor analysis for adaptive portfolio management
Ever since the inception of Markowitz’s modern portfolio theory, static portfolio optimization techniques were gradually phased out by dynamic portfolio management due to the growth of popularity in automated trading. In view of the intensive computational needs, it is common to use machine learning approaches on Sharpe ratio maximization for implementing dynamic portfolio optimization. In the ...
متن کاملPerformance Functions and Reinforcement Learning for Trading Systems and Portfolios
We propose to train trading systems and portfolios by optimizing objective functions that directly measure trading and investment performance. Rather than basing a trading system on forecasts or training via a supervised learning algorithm using labelled trading data, we train our systems using recurrent reinforcement learning (RRL) algorithms. The performance functions that we consider for rei...
متن کاملComparison of Portfolios Formed by Use of Grid Strategy Model Based on New and Traditional Variables Performance With Sharpe and Treynor Measures (Evidence of IRAN Exchange)
In this research, performance of portfolios formed by use of grid strategy based on new variables (aggressive, indifference and defensive stocks) presented by Rahnamaye Roodposhti (1388), and traditional ones (growth, growth-value and value stocks), calculated with Sharpe and Treynor performance measures and tested by an Active portfolio management approach to identify the portfolios by perform...
متن کاملOptimal mean-variance portfolio selection using Cauchy–Schwarz maximization
Fund managers highly prioritize selecting portfolios with a high Sharpe ratio. Traditionally, this task can be achieved by revising the objective function of the Markowitz mean-variance portfolio model and then resolving quadratic programming problems to obtain the maximum Sharpe ratio portfolio. This study presents a closed-form solution for the optimal Sharpe ratio portfolio by applying Cauch...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000