Spatial Segmentation for Laryngeal High-Speed Videoendoscopy in Connected Speech

Ahmed M. Yousef; Dimitar D. Deliyski; Stephanie R.C. Zacharias; Alessandro de Alarcon; Robert F. Orlikoff; Maryam Naghibolhosseini

doi:10.1016/j.jvoice.2020.10.017

Back

Spatial Segmentation for Laryngeal High-Speed Videoendoscopy in Connected Speech

Journal article

Peer reviewed

Spatial Segmentation for Laryngeal High-Speed Videoendoscopy in Connected Speech

Ahmed M. Yousef, Dimitar D. Deliyski, Stephanie R.C. Zacharias, Alessandro de Alarcon, Robert F. Orlikoff and Maryam Naghibolhosseini

Journal of voice, Vol.37(1), pp.26-36

01/01/2023

DOI: 10.1016/j.jvoice.2020.10.017

PMCID: PMC8411982

PMID: 33257208

View Online

Abstract

This study proposes a new computational framework for automated spatial segmentation of the vocal fold edges in high-speed videoendoscopy (HSV) data during connected speech. This spatio-temporal analytic representation of the vocal folds enables the HSV-based measurement of the glottal area waveform and other vibratory characteristics in the context of running speech. HSV data were obtained from a vocally normal adult during production of the “Rainbow Passage.” An algorithm based on an active contour modeling approach was developed for the analysis of HSV data. The algorithm was applied on a series of HSV kymograms at different intersections of the vocal folds to detect the edges of the vibrating vocal folds across the frames. This edge detection method follows a set of deformation rules for the active contours to capture the edges of the vocal folds through an energy optimization procedure. The detected edges in the kymograms were then registered back to the HSV frames. Subsequently, the glottal area waveform was calculated based on the area of the glottis enclosed by the vocal fold edges in each frame. The developed algorithm successfully captured the edges of the vocal folds in the HSV kymograms. This method led to an automated measurement of the glottal area waveform from the HSV frames during vocalizations in connected speech. The proposed algorithm serves as an automated method for spatial segmentation of the vocal folds in HSV data in connected speech. This study is one of the initial steps toward developing HSV-based measures to study vocal fold vibratory characteristics and voice production mechanisms in norm and disorder in the context of connected speech.

Connected Speech

Glottal Area Waveform

High-Speed Videoendoscopy

Laryngeal Imaging

Spatial Segmentation

Voice Assessment

Details

Title: Subtitle: Spatial Segmentation for Laryngeal High-Speed Videoendoscopy in Connected Speech
Creators: Ahmed M. Yousef - Michigan State University
Dimitar D. Deliyski - Michigan State University
Stephanie R.C. Zacharias - Mayo Clinic in Arizona
Alessandro de Alarcon - Cincinnati Children's Hospital Medical Center
Robert F. Orlikoff - East Carolina University
Maryam Naghibolhosseini - Michigan State University
Resource Type: Journal article
Publication Details: Journal of voice, Vol.37(1), pp.26-36
DOI: 10.1016/j.jvoice.2020.10.017
PMID: 33257208
PMCID: PMC8411982
NLM abbreviation: J Voice
ISSN: 0892-1997
eISSN: 1873-4588
Publisher: Elsevier Inc
Number of pages: 11
Language: English
Date published: 01/01/2023
Academic Unit: Communication Sciences and Disorders
Record Identifier: 9984721229702771

Metrics

3 Record Views

14 Times Cited - Web of Science

See more details