Sage of Seven Paths
Sage of Seven Paths

Reputation: 105

Is Cursive Writing Character segmentation even possible using this strip wise OCR (or other Classification model) based technique?

I need to extract instances of characters from a sample of user's handwritten text. In effect, I am trying to create a database of how a user writes a specific letter from the submitted text sample. This is my attempt at the problem of cursive character segmentation.

My approach at the problem:

The idea seemed theoretically possible.

I found that a similar method had been presented in Stack Exchange before.

I ran my model implementing my idea and found that the result was pretty good, and it should improve substantially improve with a larger improved OCR model, or another model which is actually trained specifically on digit recognition.

I thought that it would work on handwritten characters as well, since TrOCR, unlike Tesseract is also supposed to work for Handwriting.

When I ran it again, I discovered the following:

The second problem could be solved by segmenting each word, and then running TrOCR. This fails spectacularly. Turns out, TrOCR is terrible in recognising words without context. When I sent a slice of the word "dog", It began returning words like "diores".

I also ran TrOCR on individual characters, where it also fails.

My questions:

  1. Can working with a CNN or Contrastive learning Based Character Classification model work better? If Yes, Where can I find a dataset of segmented Cursive Characters?
  2. Is there anyway to continue using TrOCR such that it does not care about word meanings?
  3. Can this method hope to outperform the recent segmentation techniques such as this?

Upvotes: -3

Views: 35

Answers (0)

Related Questions