Extract information from financial statements from XBRL files in Python?

Question

I am trying to parse and get information from XBRL files, and it seems there are a number of open source packages that have the ability to parse XBRL files in python. However, documentation on using them seems to be lacking. The packages I have investigated are:

Arelle: https://arelle.readthedocs.io/en/2.25.2/
Py-xbrl: https://py-xbrl.readthedocs.io/en/latest/usage.html#offline
Brel*: https://brellibrary.github.io/brel/

*For reasons that I won't get into, I'm currently unable to use Brel (it's essentially due to requiring python >=3.10).

However, the other packages don't seem to allow to me to parse downloaded XBRL files offline. The structure of my dowloaded XBRL files is as follows (taking the example of Apple from the SEC filings):

aapl-20200926
 |
 +-- aapl-20200926.xsd
 +-- aapl-20200926_cal.xml
 +-- aapl-20200926_def.xml
 +-- aapl-20200926_lab.xml
 +-- aapl-20200926_pre.xml
 +-- aapl-20200926_htm.xml

I'm aware this isn't the typical purpose of stackoverflow, but does anyone know how I could start parsing XBRL files in the above structure (rather than from a link on the internet) in a python script? I expect there is some way to do this using Arelle and the Python API, or in py-xbrl, but I haven't been able to crack it so far.

Extract information from financial statements from XBRL files in Python?

Answers (1)

Related Questions