Journal article
State of the Art in Parallel Computing with R
Journal of statistical software, Vol.31(1), pp.1-27
08/01/2009
DOI: 10.18637/jss.v031.i01
Abstract
R is a mature open-source programming language for statistical computing and graphics. Many areas of statistical research are experiencing rapid growth in the size of data sets. Methodological advances drive increased use of simulations. A common approach is to use parallel computing.
This paper presents an overview of techniques for parallel computing with R on computer clusters, on multi-core systems, and in grid computing. It reviews sixteen different packages, comparing them on their state of development, the parallel technology used, as well as on usability, acceptance, and performance.
Two packages (snow, Rmpi) stand out as particularly suited to general use on computer clusters. Packages for grid computing are still in development, with only one package currently available to the end user. For multi-core systems five different packages exist, but a number of issues pose challenges to early adopters. The paper concludes with ideas for further developments in high performance computing with R. Example code is available in the appendix.
Details
- Title: Subtitle
- State of the Art in Parallel Computing with R
- Creators
- Markus Schmidberger - Ludwig-Maximilians-Universität MünchenMartin Morgan - Fred Hutchinson Canc Res Ctr, Seattle, WA USADirk EddelbuettelHao Yu - Univ Western Ontario, London, ON N6A 3K7, CanadaLuke Tierney - Univ Iowa, Iowa City, IA 52242 USAUlrich Mansmann
- Resource Type
- Journal article
- Publication Details
- Journal of statistical software, Vol.31(1), pp.1-27
- DOI
- 10.18637/jss.v031.i01
- ISSN
- 1548-7660
- eISSN
- 1548-7660
- Publisher
- JOURNAL STATISTICAL SOFTWARE
- Number of pages
- 27
- Language
- English
- Date published
- 08/01/2009
- Academic Unit
- Statistics and Actuarial Science
- Record Identifier
- 9984257743502771
Metrics
127 Record Views