UWSpace staff members will be away from May 5th to 9th, 2025. We will not be responding to emails during this time. If there are any urgent issues, please contact GSPA at gsrecord@uwaterloo.ca. If any login or authentication issues arise during this time, please wait until UWSpace Staff members return on May 12th for support.
 

Fault Diagnosis in Enterprise Software Systems Using Discrete Monitoring Data

dc.contributor.authorReidemeister, Thomas
dc.date.accessioned2012-05-18T23:13:14Z
dc.date.available2012-05-18T23:13:14Z
dc.date.issued2012-05-18T23:13:14Z
dc.date.submitted2012-05-18
dc.description.abstractSuccess for many businesses depends on their information software systems. Keeping these systems operational is critical, as failure in these systems is costly. Such systems are in many cases sophisticated, distributed and dynamically composed. To ensure high availability and correct operation, it is essential that failures be detected promptly, their causes diagnosed and remedial actions taken. Although automated recovery approaches exists for specific problem domains, the problem-resolution process is in many cases manual and painstaking. Computer support personnel put a great deal of effort into resolving the reported failures. The growing size and complexity of these systems creates the need to automate this process. The primary focus of our research is on automated fault diagnosis and recovery using discrete monitoring data such as log files and notifications. Our goal is to quickly pinpoint the root-cause of a failure. Our contributions are: Modelling discrete monitoring data for automated analysis, automatically leveraging common symptoms of failures from historic monitoring data using such models to pinpoint faults, and providing a model for decision-making under uncertainty such that appropriate recovery actions are chosen. Failures in such systems are caused by software defects, human error, hardware failures, environmental conditions and malicious behaviour. Our primary focus in this thesis is on software defects and misconfiguration.en
dc.identifier.urihttp://hdl.handle.net/10012/6757
dc.language.isoenen
dc.pendingfalseen
dc.publisherUniversity of Waterlooen
dc.subject.programElectrical and Computer Engineering (Software Engineering)en
dc.titleFault Diagnosis in Enterprise Software Systems Using Discrete Monitoring Dataen
dc.typeDoctoral Thesisen
uws-etd.degreeDoctor of Philosophyen
uws-etd.degree.departmentElectrical and Computer Engineeringen
uws.peerReviewStatusUnrevieweden
uws.scholarLevelGraduateen
uws.typeOfResourceTexten

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Reidemeister_Thomas.pdf
Size:
4.04 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
256 B
Format:
Item-specific license agreed upon to submission
Description: