Deep Learning Based Approaches for Imputation of Time Series Models
dc.contributor.author | Saad, Muhammad | |
dc.date.accessioned | 2020-12-16T15:59:05Z | |
dc.date.available | 2020-12-16T15:59:05Z | |
dc.date.issued | 2020-12-16 | |
dc.date.submitted | 2020-12-07 | |
dc.description.abstract | Market price forecasting models for Fresh Produce (FP) are crucial to protect retailers and consumers from highly priced FP. However, utilizing the data for forecasting is obstructed by the occurrence of missing values. Therefore, it is imperative to develop models to determine the value for those missing instances thereby enabling effective forecasting. Usually this problem is tackled with conventional methods that introduce bias into the system which in turn results in unreliable forecasting results. Therefore, in this thesis, numerous imputation models are developed alongside a framework enabling the user to impute any time series data with the optimal models. This thesis also develops novel forecasting models which are used as a gauging mechanism for each tested imputation mode. However, those forecasting models can also be used as standalone models. The growth and success of deep learning has largely been attributed to the availability of big data and high end computational power along with the theoretical advancement . In this thesis, multiple deep learning models are built for imputing the missing values and also for forecasting. The data used in building these deep learning models comprise California weather data, California strawberry yield, California strawberry farm-gate prices, USA corn yield data, Brent oil type daily prices and a synthetic time series dataset. For imputation, mean squared error is used as an metric to gauge the performance of imputation whereas for forecasting a new aggregated error measure (AGM) is proposed in this thesis which combines mean absolute error, mean squared error and R2 which is the coefficient of determination. Different models are found to be optimal for different time series. These models are illustrated in the recommendation framework developed in the thesis. Different stacking ensemble techniques such as voting regressor and stacking ML ensemble are then utilized to have better imputation results. The experiments show that the voting regressor yields the best imputation results. To gauge the robustness of the imputation framework, different time series are assessed. The imputed data is used for forecasting and the forecasting results are compared with market deep and non-deep learning models. The results show the best imputation models recommended based on work with the synthesized datasets are in fact the best for the tested incomplete real datasets with Mean Absolute Scaled Error (MASE) <1 i.e. better than the naive forecasting model. Also, it is found that the best imputation models have higher impact on reducing the forecasting errors compared to other deep or non-deep imputation models found in literature and market. | en |
dc.identifier.uri | http://hdl.handle.net/10012/16561 | |
dc.language.iso | en | en |
dc.pending | false | |
dc.publisher | University of Waterloo | en |
dc.relation.uri | https://www.kaggle.com/ainslie/usda-wasde-monthly-corn-soybean-projections#USDAProj_Corn_2007to2008.csv | en |
dc.relation.uri | https://www.eia.gov/dnav/pet/pet_pri_spt_s1_d.html | en |
dc.relation.uri | https://cimis.water.ca.gov/ | en |
dc.relation.uri | https://www.calstrawberry.com/en-us/ | en |
dc.subject | deep learning | en |
dc.subject | imputation | en |
dc.subject | missing values | en |
dc.subject | time series | en |
dc.subject | machine learning | en |
dc.subject | forecasting | en |
dc.subject | ensemble | en |
dc.title | Deep Learning Based Approaches for Imputation of Time Series Models | en |
dc.type | Master Thesis | en |
uws-etd.degree | Master of Applied Science | en |
uws-etd.degree.department | Electrical and Computer Engineering | en |
uws-etd.degree.discipline | Electrical and Computer Engineering | en |
uws-etd.degree.grantor | University of Waterloo | en |
uws.comment.hidden | The following are the publications that are based on the thesis: - Prediction of Strawberry Yield and Farm Price Utilizing Deep Learning - Published in IEEE WCCI’20 - Deep Learning Based Approach for Fresh Produce Market Price Prediction - Published in IEEE WCCI’20 - Machine Learning Based Approaches for Imputation in Time Series Data and their Impact on Forecasting -Published in IEEE SMC’20 - Tackling Imputation Across Time Series Models Using Deep Learning and Ensemble Learning -Published in IEEE SMC’20 - Imputation Impact on Strawberry Yield and Farm Price Prediction Using Deep Learning - Published IEEE SMC’20 Currently, a journal paper is also been formulated illustrating the results of the thesis. | en |
uws.contributor.advisor | Karray, Fakhri | |
uws.contributor.advisor | Gaudet, Vincent | |
uws.contributor.affiliation1 | Faculty of Engineering | en |
uws.peerReviewStatus | Unreviewed | en |
uws.published.city | Waterloo | en |
uws.published.country | Canada | en |
uws.published.province | Ontario | en |
uws.scholarLevel | Graduate | en |
uws.typeOfResource | Text | en |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Saad_Muhammad.pdf
- Size:
- 4.38 MB
- Format:
- Adobe Portable Document Format
- Description:
- Masters Thesis on Deep Learning Based Approaches for Imputation of Time Series Models
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 6.4 KB
- Format:
- Item-specific license agreed upon to submission
- Description: