Working paper
Distinguishing Tax Avoider Types: An Unsupervised Machine Learning Approach
SSRN
12/2025
DOI: 10.2139/ssrn.5942755
Abstract
We utilize unsupervised machine learning to identify distinct types of tax avoiders based on 15 observable firm characteristics associated with tax avoidance mechanisms. The most common type—the “PPE/DEBT” cluster—exhibits greater amounts of PPE, capital expenditures, total debt, and mezzanine financing (47 percent), followed by the “R&D/NOL” cluster, which exhibits greater R&D expenditures and NOLs (35 percent). In contrast, only 18 percent of tax avoiders are assigned to the “income shifting” cluster, characterized by greater usage of intangible assets, foreign sales, tax havens, and stock options. We observe time-series variation in the composition of tax avoider types, and out-of-sample tests reveal that the income shifting cluster exhibits higher levels of both future IRS attention and tax settlements. Our findings on the combination of mechanisms that characterize distinct types of tax avoiders should be of direct use for policymakers and tax authorities in designing tax incentives and allocating resources, respectively.
Details
- Title: Subtitle
- Distinguishing Tax Avoider Types: An Unsupervised Machine Learning Approach
- Creators
- Sonja O. Rego - Indiana UniversityBrian Williams - Indiana UniversityRyan J. Wilson - University of IowaJunwei Xia - Texas A&M University
- Resource Type
- Working paper
- DOI
- 10.2139/ssrn.5942755
- Publisher
- SSRN
- Number of pages
- 52 pages
- Language
- English
- Date posted
- 12/2025
- Academic Unit
- Accounting
- Record Identifier
- 9985116167102771
Metrics
1 Record Views