Installing PDFtotext in R
Installing PDFtotext in R
I am trying to run the PDFtotext package in R.
PDFtotext
When I run these commands:
library(tm) pdf=readPDF(control=list(text="-layout"))(elem=list(uri=uri), language="en", id="idi")
I get this error:
Error in system2("pdftotext", c(control$text, shQuote(x), "-"), stdout = TRUE) : "pdftotext" not found
In addition: Warning message: running command "pdfinfo" "C:*****NCLR AR 2005.pdf" had status 127
Does anyone know what the problem might be?
Sys.which("pdftotext")
""
pdftools
This function of the
tm library requires that pdftotext and pdfinfo are installed on your computer. You can download precombiled binaries for the most common operating systems here. These programs are not installed in or by R, as the title of your question suggests. They need to be installed as separate programs on your system.– RHertel
Apr 6 '16 at 15:52
tm
pdftotext
pdfinfo
1 Answer
1
In widows add the binary to your path. System-->Advanced -->Environment Variables -->add the directory containing the pdftotext.exe
By clicking "Post Your Answer", you acknowledge that you have read our updated terms of service, privacy policy and cookie policy, and that your continued use of the website is subject to these policies.
Sys.which("pdftotext")is""? I.e. the file is not found. Have you installed it? You may want to try the packagepdftoolsas an alternative to read pdfs.– lukeA
Apr 6 '16 at 14:38