Optimization of Policy Evaluation and Policy Improvement Methods in Portfolio Optimization using Quasi-Monte Carlo Methods
dc.contributor.author | Orok, Gavin | |
dc.date.accessioned | 2024-05-24T17:20:54Z | |
dc.date.available | 2024-05-24T17:20:54Z | |
dc.date.issued | 2024-05-24 | |
dc.date.submitted | 2024-05-22 | |
dc.description.abstract | Machine learning involves many challenging integrals that can be estimated using numerical methods. One application of these methods which has been explored in recent work is the estimation of policy gradients for reinforcement learning. They found that for many standard continuous control problems, the numerical methods randomized Quasi-Monte Carlo (RQMC) and Array-RQMC that used low-discrepancy point sets improved the efficiency of both policy evaluation and policy gradient-based policy iteration compared to standard Monte Carlo (MC). We extend this work by investigating the application of these numerical methods to model-free reinforcement learning algorithms in portfolio optimization, which are of interest because they do not rely on complex model assumptions that pose difficulties to other analytical methods. We find that RQMC significantly outperforms MC under all conditions for policy evaluation and that Array-RQMC outperforms both MC and RQMC in policy iteration with a strategic choice of the reordering function. | en |
dc.identifier.uri | http://hdl.handle.net/10012/20596 | |
dc.language.iso | en | en |
dc.pending | false | |
dc.publisher | University of Waterloo | en |
dc.relation.uri | https://colab.research.google.com/drive/1DOA2VRYGWWR1hC713l6sIY57tK2gDx0H?usp=sharing | en |
dc.relation.uri | https://colab.research.google.com/drive/1mwA9wtUAPZoIUfWnlirZ5_TmVbZlWZi8?usp=sharing | en |
dc.subject | reinforcement learning | en |
dc.subject | numerical methods | en |
dc.subject | quasi-Monte Carlo | en |
dc.subject | portfolio optimization | en |
dc.subject | continuous control | en |
dc.title | Optimization of Policy Evaluation and Policy Improvement Methods in Portfolio Optimization using Quasi-Monte Carlo Methods | en |
dc.type | Master Thesis | en |
uws-etd.degree | Master of Quantitative Finance | en |
uws-etd.degree.department | Statistics and Actuarial Science | en |
uws-etd.degree.discipline | Quantitative Finance | en |
uws-etd.degree.grantor | University of Waterloo | en |
uws-etd.embargo.terms | 0 | en |
uws.comment.hidden | The two links provide access to the Google Colab notebooks with the code used for policy iteration and evaluation. | en |
uws.contributor.advisor | Lemieux, Christiane | |
uws.contributor.affiliation1 | Faculty of Mathematics | en |
uws.peerReviewStatus | Unreviewed | en |
uws.published.city | Waterloo | en |
uws.published.country | Canada | en |
uws.published.province | Ontario | en |
uws.scholarLevel | Graduate | en |
uws.typeOfResource | Text | en |