Building interpretable machine learning models for sequential data
Abstract
Details
- Title: Subtitle
- Building interpretable machine learning models for sequential data
- Creators
- Dat Hong
- Contributors
- Tong Wang (Advisor)Alberto Maria Segre (Committee Member)Padmini Srinivasan (Committee Member)Bijaya Adhikari (Committee Member)Qihang Lin (Committee Member)
- Resource Type
- Dissertation
- Degree Awarded
- Doctor of Philosophy (PhD), University of Iowa
- Degree in
- Computer Science
- Date degree season
- Summer 2023
- Publisher
- University of Iowa
- DOI
- 10.25820/etd.006992
- Number of pages
- xiii. 102 pages
- Copyright
- Copyright 2023 Dat Hong
- Language
- English
- Date submitted
- 06/12/2023
- Description illustrations
- illustrations, tables, graphs
- Description bibliographic
- Includes bibliographical references (pages 93-102).
- Public Abstract (ETD)
- Machine learning is a branch of artificial intelligence where we teach computers how to learn. For instance, we can train a program to recognize if an image shows a dog or a cat. The trained program is called a model, and it can predict the label of a new image. While machine learning models can be highly accurate, their decision-making process is often unclear to humans. As a result, there is a growing demand for interpretable machine learning due to increased requirements for transparency in algorithms and data by governments and businesses. This field focuses on developing techniques to explain how machine learning models work and has gained attention from both researchers and industry.
Interpretable machine learning encompasses two main categories of methods: those that explain pre-trained models and those that create new models that are easier for humans to understand. In this thesis, we propose various methods that belong to both categories.
The first method explains a recurrent neural network, a specialized network used for analyzing time-series data. Our method generates a diagram to explain how the network makes predictions after training it to determine the sentiment (positive or negative) of a text. For example, the diagram may reveal that if a text begins with words like “wonderful” or “great food,” it will be classified as having a positive sentiment.
In the second method, we propose a new, self-explanatory network architecture. This network is built based on the concept of prototypes, which are representative examples in the training data that can stand for multiple other examples. Before generating a prediction, the network maps any input text sequence to a sequence of prototypes called a prototype trajectory. For instance, sentences like “The food is fantastic,” “Amazing food” and “All are delicious” can all be represented by a single prototype sentence like “Food is great.” By examining the prototypes, users can understand how the model reaches its predictions.
Finally, our third method explains predictions at the instance level using personalized recourse or counterfactual explanations. Given an input sequence with a specific label, our method generates a similar sequence, but with the label reversed. Additionally, the new sequence can be personalized based on the desired style. For example, if the original sequence is “The food is terrible” with a negative sentiment, our method can generate a new sentence that is similar but with a positive sentiment, such as “The food is fantastic,” and add a specific flair, like “The street food in the city is amazing.” In another application, if a player makes a series of incorrect decisions resulting in a loss in a game, our method can suggest a similar series of steps that would help the player win and adjust their playing style.
- Academic Unit
- Computer Science
- Record Identifier
- 9984454644102771