Journal article
pdfsearch: Search Tools for PDF Files
Journal of open source software, Vol.3(27), p.668
2018
DOI: 10.21105/joss.00668
Abstract
PDF files are common formats for reports, journal articles, briefs, and many other documents. PDFs are lightweight, portable, and easily viewed across operating systems. Even though PDF files are ubiquitous, extracting and finding text within a PDF can be time consuming and not easily reproducible. The pdftools R package (Ooms 2017), which uses the poppler C++ library to extract text from PDF documents, aids in the ability to import text data from PDF files to manipulate in R. The pdfsearch package (LeBeau 2018) is an R package (R Core Team 2016) that extends the text extraction of pdftools to allow for keyword searching within a single PDF or a directory of PDF files.
Details
- Title: Subtitle
- pdfsearch: Search Tools for PDF Files
- Creators
- Brandon LeBeau
- Resource Type
- Journal article
- Publication Details
- Journal of open source software, Vol.3(27), p.668
- DOI
- 10.21105/joss.00668
- ISSN
- 2475-9066
- eISSN
- 2475-9066
- Language
- English
- Date published
- 2018
- Academic Unit
- Psychological and Quantitative Foundations
- Record Identifier
- 9983993487002771
Metrics
198 Record Views