Reputation: 21
We are trying to use the IBM Watson Document Conversion service on Word documents and have noticed that text that is in the header (and is displayed when the doc file is viewed) is not returned by the document conversion service. Is this a known issue?
Upvotes: 0
Views: 78
Reputation: 92
The Watson Document Conversion service filters out headers and footers on purpose so that the converted output is a clean, readable text.
Typically, headers and footers contain repeating phrases or words (e.g. chapter title) or numbers (e.g. page number), which is usually not desirable to have in the output HTML or plain text.
Upvotes: 1