Journal article
Authorship Attribution via Coupon-Collector-Type Indices
Journal of quantitative linguistics, Vol.27(4), pp.321-333
10/01/2020
DOI: 10.1080/09296174.2019.1577939
Abstract
Authorship attribution is the process of determining the author of a text in question by capturing an author's writing style based on selected stylistic features. In this paper, we propose a new methodology for authorship attribution based on a profile of indices related to the generalized coupon collector problem, called coupon-collector-type indices. The coupon collector problem and its generalizations are of traditional and recurrent interests. Coupons are drawn one at a time from a population containing n distinct type of coupons. The process continues until a complete set of n distinct coupons is obtained and the total number of draws,
, is recorded. We base our methodology on function words. We establish a testing procedure by constructing a confidence band of the coupon-collector-type indices using an empirical bootstrap technique. We validate our proposed methodology using several writing samples whose authorship is known. We then apply this methodology to explore the question of who wrote the fifteenth Oz book, whose authorship is disputed between Lyman Frank Baum (1856-1919) and his successor) on the Oz series, Ruth Plumly Thompson (1891-1976).
Details
- Title: Subtitle
- Authorship Attribution via Coupon-Collector-Type Indices
- Creators
- Lukun Zheng - Western Kentucky UniversityHuiqiang Zheng - Western Kentucky University
- Resource Type
- Journal article
- Publication Details
- Journal of quantitative linguistics, Vol.27(4), pp.321-333
- DOI
- 10.1080/09296174.2019.1577939
- ISSN
- 0929-6174
- eISSN
- 1744-5035
- Publisher
- Routledge
- Language
- English
- Date published
- 10/01/2020
- Academic Unit
- Asian and Slavic Languages and Literatures
- Record Identifier
- 9984398014202771
Metrics
22 Record Views