Reputation: 1
I am trying to automate scientific PDF to XML using Python. I would like to know: is there any Python library that can generate XML from text or from PDF documents?
Upvotes: 0
Views: 1257
Reputation: 115
You can use textract to read a pdf file, for details: Extracting text from a PDF file using Python once you have your text in a variable use ElementTrees to write properly formatted XML.
Upvotes: 2