What is causing AttributeError: 'list' object has no attribute 'read' when tying to read in a pdf with Tabula?

Question

I am attempting to use Tabula to pull table information from a pdf and convert it to a pandas dataframe. I have been following the steps in this tutorial:

https://aegis4048.github.io/parse-pdf-files-while-retaining-structure-with-tabula-py

When I try to load the remote PDF into my jupyter notebook with the following code (taken directly from the tutorial):

import tabula
df2 = tabula.read_pdf("https://github.com/tabulapdf/tabula-java/raw/master/src/test/resources/technology/tabula/arabic.pdf")

I get the error:

AttributeError: 'list' object has no attribute 'read'

I have tried to read in pdfs saved locally to my machine and I get the same error. I believe I have successfully installed Java and configured the environment variable correctly, and I have the most recent version of Tabula.

Link to screenshot from my jupyter notebook:

https://www.dropbox.com/s/y44mfzuclihfdau/S_O_Capture_1.PNG?dl=0

Thanks.

What is causing AttributeError: 'list' object has no attribute 'read' when tying to read in a pdf with Tabula?

Answers (1)

Related Questions

What is causing AttributeError: &#39;list&#39; object has no attribute &#39;read&#39; when tying to read in a pdf with Tabula?

Answers (1)

Related Questions

What is causing AttributeError: 'list' object has no attribute 'read' when tying to read in a pdf with Tabula?