Selvarani
Selvarani

Reputation: 31

Tesseract couldn't load any languages

The TESSDATA_PREFIX environment variable is set to the parent directory of your "tessdata" directory. Failed loading language 'eng' Tesseract couldn't load any languages!

I can't open below path to set TESSDATA_PREFIX environment variable is set to parent directory of "tessdata" directory.

/Users/syzygy01/Library/Developer/CoreSimulator/Devices/3C2CC079-D784-432D-A79A-C5336017E69C/data/Containers/Bundle/Application/61ADADE0-8CFD-4815-8F33-19B0DA676619/TesstractTest.app/tessdata/eng.traineddata

Upvotes: 3

Views: 1193

Answers (1)

Devendra  Singraul
Devendra Singraul

Reputation: 951

Please keep you traning data file inside tessdata folder and follow this code :

import net.sourceforge.tess4j.Tesseract;
import net.sourceforge.tess4j.TesseractException;

 public class OCR_POC {

 public static void main(String[] args) throws TesseractException {

String inputFilePath ="F:/my_documents/issues.pdf";
Tesseract   tesseract = new Tesseract();
tesseract.setDatapath("F:/Tesseract/tessdata/");
//tesseract.setLanguage("chi_sim");
//tesseract.setLanguage("eng");// english is default langauge
String  fullText= tesseract.doOCR(new File (inputFilePath));
System.out.println("Full text : "+fullText);

 }
}

Maven repository for pom.xml :

<dependency>
<groupId>net.sourceforge.tess4j</groupId>
 <artifactId>tess4j</artifactId>
<version>${tess4j}</version>
</dependency>

Upvotes: 1

Related Questions