Pandoc drops "unknown" HTML elements when converting to markdown

Question

Consider the following simple HTML:



Test

I want to convert that to markdown, and for the elements that don't have markdown equivalents (object, etc.) to just pass them through as HTML unchanged. However, when I run it through pandoc (v1.13.1) with the following command line:

pandoc --from=html --to=markdown --output=C:\Temp	est.md C:\Temp	est.html

...the only output I get in test.md is:

Test

I am obviously missing some parameter, or is this even possible? I would think it is given that markdown allows semi-arbitrary HTML to be embedded inline.

Note: I have already seen this question and answer, but when I try --parse-raw it simply passes through all the HTML as HTML, which is not what I want.

mb21 · Accepted Answer

The --parse-raw parameter is indeed what you're looking for. For example:

$ echo 'foo
bar ' | pandoc -f html -t markdown --parse-raw
foo
===

bar

However, it seems to choke on the tag in your example, thus leaving the outer

tag in place instead of converting it to markdown. You should probably submit a bug report.

Pandoc drops "unknown" HTML elements when converting to markdown

Answers (1)

Related Questions

Pandoc drops &quot;unknown&quot; HTML elements when converting to markdown

Answers (1)

Related Questions

Pandoc drops "unknown" HTML elements when converting to markdown