Scaling out speculative execution of finite-state machines with parallel merge

Yang Xia; Peng Jiang; Gagan Agrawal

doi:10.1145/3332466.3374524

Back

Conference proceeding

Scaling out speculative execution of finite-state machines with parallel merge

Yang Xia, Peng Jiang and Gagan Agrawal

Proceedings of the 25th ACM SIGPLAN Symposium on principles and practice of parallel programming, pp.160-172

PPoPP '20

02/19/2020

DOI: 10.1145/3332466.3374524

View Online

Abstract

A finite-state machine (FSM) is a key component for many important applications, such as Huffman decoding, regular expression matching and HTML tokenization. Due to its inherent dependencies and unpredictable memory access pattern, FSM computations are considered to be extremely difficult to parallelize. As such, significant research efforts have been made to accelerate FSM computations. Although they achieve promising performance results on multi-core machines, these methods are not scalable for emerging many-core architectures such as the GPUs. Based on our experiments, we point out that the bottleneck of achieving scalability on GPUs is the sequential merge inherent to these methods. However, unlike the case for simple reduction loops, parallel merge implementations for FSM computations typically require runtime checks and re-executions, which can also impede performance. Based on these observations, we develop parallel merge techniques that select efficient runtime check implementations and avoids unnecessary re-executions. Further, based on GPU architectural features, we develop optimization techniques to improve performance. We evaluate our parallel merge implementations on a set of representative algorithms. Experimental results show that our parallel merge implementations are 2.02-6.74 times more efficient than corresponding sequential merge implementations and achieve better scalability on an Nvidia V100 GPU.

finite-state machines

GPUs

speculation

Details

Title: Subtitle: Scaling out speculative execution of finite-state machines with parallel merge
Creators: Yang Xia - The Ohio State University
Peng Jiang - University of Iowa
Gagan Agrawal - The Ohio State University
Resource Type: Conference proceeding
Publication Details: Proceedings of the 25th ACM SIGPLAN Symposium on principles and practice of parallel programming, pp.160-172
Series: PPoPP '20
DOI: 10.1145/3332466.3374524
Publisher: ACM
Language: English
Date published: 02/19/2020
Academic Unit: Computer Science
Record Identifier: 9984259430202771

Metrics

12 Record Views

6 Times Cited - Web of Science