Links

Thai Document OCR <New!>

Thai Document OCR Version 2.0
iApp's Thai Documents Optical Character Recognition (OCR) can convert any printed characters document as an image or a multiple-pages PDF file to the following file output automatically as follows:
  • the Editable Text (.txt)
  • the Structured JSON (.json)
  • the Microsoft Word file! (.docx)
It will save your time and not need to re-typing the whole documents again!

Features

  • Supported PNG, JPG, JPEG, and PDF input image file. No more than 30MB.
  • System will recognize the document components automatically as follows:
    • Page boundary
    • Title
    • Paragraph
    • Image
    • Table
    • Thai characters
    • English characters
    • Special characters
  • System will generate the output into three format.
    • the Editable Text (.txt) (Endpoint = /ocr)
    • the Structured JSON (.json) (Endpoint = /layout)
    • the Microsoft Word file! (.docx) (Endpoint = /docx)

Demo