Riley Hun
Riley Hun

Reputation: 2785

Python: Error - tabula-py cannot read PDF

I cannot execute tabula-py's read_pdf function.

It seems to be producing the following error message:

WindowsError: [Error 2] The system cannot find the file specified

With traceback:

Traceback (most recent call last):
  File "C:/Users/riley/PycharmProjects/Payroll/PayrollParsePDF.py", line 126, in <module>
    print read_pdf(r"C:\Users\riley\Desktop\Bank Statements\53591.pdf")
  File "C:\Python27\lib\site-packages\tabula\wrapper.py", line 54, in read_pdf_table
    output = subprocess.check_output(args)
  File "C:\Python27\lib\subprocess.py", line 212, in check_output
    process = Popen(stdout=PIPE, *popenargs, **kwargs)
  File "C:\Python27\lib\subprocess.py", line 390, in __init__
    errread, errwrite)
  File "C:\Python27\lib\subprocess.py", line 640, in _execute_child
    startupinfo)

Any insight as to why this is? It was working fine before.

Upvotes: 2

Views: 2564

Answers (1)

jasonharper
jasonharper

Reputation: 9621

It isn't YOUR file that is not being found, it is the java executable that isn't being found. That line in tabula\wrapper.py is trying to launch the actual Tabula code, which is written in Java, and failing.

I don't know why Java would suddenly stop being found - something messed up your PATH, perhaps?

Upvotes: 2

Related Questions