HTML to Context template using Pandoc and Django: how to avoid calling subprocess too many times?

Question

I'm generating pdfs using the TeX macro package Context. The Context source file is generated using Django templates, and the content is converted from HTML (as stored in the database) to Context's syntax using Pandoc. Since there is no native python binding for Pandoc, I have created the following template filter to convert HTML to Context:

def html2context(value):
    """
     Runs pandoc to convert from HTML to Context.

    Syntax::

    {{ value|html2context }}
    """
    cmd = 'pandoc -f html -t context'
    p1 = subprocess.Popen(cmd.split(" "), stdout=subprocess.PIPE, stdin=subprocess.PIPE)
    (stdout, stderr) = p1.communicate(input=value.encode('utf-8'))
    return mark_safe(stdout)

The problem is that to generate my pdf I have call many times the template filter, ending in a very slow conversion process. The reason I have to call many times the filter is because my template mixes content from the database and raw Context commands to structure my document: HTML doesn't cover all the possibilities I need in Context. A minimal example of my template looks like this:

{% for plugin in intro %}
    {{ plugin.text.body|html2context }}
    \page[emptyodd]
{% endfor %}

Do you have any idea how I could make the conversion process less sub-optimal?

Thanks

HTML to Context template using Pandoc and Django: how to avoid calling subprocess too many times?

Answers (1)

Related Questions