Using 'preserve_interword_spaces' in tesseract.js

Question

I am trying to use Tesseract.js for OCR, but I'm not able to get the 'preserve_interword_spaces' option to work. Here is what I am trying:

 Tesseract.recognize(
      element.files[0],
      'eng',
        { preserve_interword_spaces: 1,
          logger: progress => {
            console.log(progress);
            progressBar.querySelector("div").innerText = progress.status;
            progressBar.querySelector("progress").value = progress.progress;
        } }
    ).then( //etc )

The OCR is coming out with multiple spaces combined into one. Help?

I'd prefer to define the .recognize() this way, rather than using await(). I know preserve_interword_spaces is supported since I can see it in the documentation here and here but I'm not sure how to get it to work in my case.

Using 'preserve_interword_spaces' in tesseract.js

Answers (1)

Related Questions

Using &#39;preserve_interword_spaces&#39; in tesseract.js

Answers (1)

Related Questions

Using 'preserve_interword_spaces' in tesseract.js