Data-driven Models for Inferring the Patient Scheduling Policies via Inverse Reinforcement Learning
dc.contributor.author | Moradi, Parham | |
dc.date.accessioned | 2024-01-23T14:21:26Z | |
dc.date.issued | 2024-01-23 | |
dc.date.submitted | 2024-01-22 | |
dc.description.abstract | In this work, we study multi-class patient scheduling with stochastic daily patient arrivals. Different classes of patients are characterized by different service times, waiting cost parameters, and rejection cost parameters. Our primary objective is to infer the policy used by the decision-makers, who schedule patients over a finite time horizon, based on their historical decisions. To achieve this, we first develop a mathematical model that captures the complexities of patient scheduling and is representative of the problem that decision-makers may consider to scheduling patients. Then, we utilize the Riccati and Hamiltonian approaches to estimate the cost parameters that have influenced the scheduling decisions made by the decision-maker. The Riccati approach begins by estimating the expert's policy, which is then used to determine the cost parameters. Conversely, the Hamiltonian approach derives the cost parameters through the optimality conditions of a path trajectory without needing to estimate the expert's policy. Using a simulation model, we demonstrate the efficiency and robustness of the proposed methods. Furthermore, we apply Riccati and Hamiltonian approaches to MRI data from two hospitals to estimate the cost parameters used in their scheduling decisions. Utilizing the estimated cost parameters, we analyze the root causes of the observed outcomes and examine the impact of these underlying factors on the scheduling process. Finally, through counterfactual analysis, we propose two alternative scheduling policies that reduce the total cost, even with the original cost parameters used by the decision-makers. | en |
dc.identifier.uri | http://hdl.handle.net/10012/20271 | |
dc.language.iso | en | en |
dc.pending | false | |
dc.publisher | University of Waterloo | en |
dc.title | Data-driven Models for Inferring the Patient Scheduling Policies via Inverse Reinforcement Learning | en |
dc.type | Master Thesis | en |
uws-etd.degree | Master of Applied Science | en |
uws-etd.degree.department | Management Sciences | en |
uws-etd.degree.discipline | Management Sciences | en |
uws-etd.degree.grantor | University of Waterloo | en |
uws-etd.embargo | 2025-01-22T14:21:26Z | |
uws-etd.embargo.terms | 1 year | en |
uws.contributor.advisor | Abouee Mehrizi, Hossein | |
uws.contributor.affiliation1 | Faculty of Engineering | en |
uws.peerReviewStatus | Unreviewed | en |
uws.published.city | Waterloo | en |
uws.published.country | Canada | en |
uws.published.province | Ontario | en |
uws.scholarLevel | Graduate | en |
uws.typeOfResource | Text | en |