Show simple item record

dc.contributor.authorNadalian, Soheila
dc.date.accessioned2023-03-14 19:25:36 (GMT)
dc.date.available2023-03-14 19:25:36 (GMT)
dc.date.issued2023-03-14
dc.date.submitted2023-02-16
dc.identifier.urihttp://hdl.handle.net/10012/19203
dc.description.abstractInflammatory Bowel Disease (IBD) refers to a group of conditions that primarily affect the gut and cause inflammation. In contrast, Hidradenitis Suppurativa (HS) is a chronic immune-mediated condition characterized by boils in a person's underarms, groyne, and/or under their breasts. In recent years, the research on HS has been gaining a growing level of interest in light of reliable recognition of these two diseases (i.e., IBD and HS) becoming crucial in clinical settings. In this study, multiple machine learning and data mining algorithms will be investigated to shed light on HS versus IBD distinction, methods such as Decision Tree, Random Forest, Naive Bayes, and k-Nearest Neighbor algorithms. These potential solution to recognize HS-IBD boundaries are used to classify IBD and HS disease based on multiple features such as age, illness history, and clinical observations. The thesis conducts a comparative study on the various classification strategies which can be achieved through the use of machine learning in order to recognize these two diseases. These methods have been applied to the IBD/HS dataset that was collected by the medical professionals at the Mayo clinic, Rochester, MN, USA. The information consists of 198 data records and 52 attributes; however, data cleaning process was necessary before employing the machine learning. During the evaluation, the performance of approaches were compared with respect to their accuracy as the commonly used metric. Based on the findings of the conducted comparisons, it was discovered that the \emph{random forest} approach performed the best, achieving an accuracy of (93.8%) for a reduced dataset that contained 20 features for each patient. The detailed results analysis is supported by several visualization techniques such as t-SNE. In addition, the thesis makes an effort to determine a precise set of criteria and identify the features that are the most significant in separating these two diseases from one another. The results of this study provide medical professionals with the opportunity to investigate aspects that previously were assumed to not play a significant role in clinical practice. To the best of author’s knowledge, this is the first applied study to utilize machine learning and data mining techniques for the IBD and HS classification.en
dc.language.isoenen
dc.publisherUniversity of Waterlooen
dc.relation.uriIBD-HS dataset collected by the medical professionals at the Mayo clinic, Rochester, MN, USAen
dc.subjectmachine learningen
dc.subjectdata miningen
dc.subjectclassificationen
dc.subjectdisease classificationen
dc.subjectIBDen
dc.subjectHSen
dc.titleFeature Analysis and Classification of Inflammatory Bowel Disease and Hidradenitis Suppurativa Using Data Miningen
dc.typeMaster Thesisen
dc.pendingfalse
uws-etd.degree.departmentSystems Design Engineeringen
uws-etd.degree.disciplineSystem Design Engineeringen
uws-etd.degree.grantorUniversity of Waterlooen
uws-etd.degreeMaster of Applied Scienceen
uws-etd.embargo.terms0en
uws.contributor.advisorTizhoosh, Hamid Reza
uws.contributor.advisorRahnamayan, Shahryar
uws.contributor.affiliation1Faculty of Engineeringen
uws.published.cityWaterlooen
uws.published.countryCanadaen
uws.published.provinceOntarioen
uws.typeOfResourceTexten
uws.peerReviewStatusUnrevieweden
uws.scholarLevelGraduateen


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record


UWSpace

University of Waterloo Library
200 University Avenue West
Waterloo, Ontario, Canada N2L 3G1
519 888 4883

All items in UWSpace are protected by copyright, with all rights reserved.

DSpace software

Service outages