Skip to main content

Cuneiform OCR software recognizes up to 23 languages

Cognitive OpenOCR (CuneiForm) is an open source Optical Character Recognition (OCR) software that can automatically recognize texts in scanned or printed documents in not one or two but twenty-three international languages. These include English, Bulgarian, Croatian, Czech, Danish, Dutch, Estonian, French, German, Hungarian, Italian, Latvian, Lithuanian, Polish, Portuguese, Romanian, Russian, mixed Russian-English, Spanish, Swedish, Serbian, Slovenian, Turkish, and Ukrainian.

Cuneiform can recognize any printing and typing styles with the exception of decorative and manuscripts. The software has special algorithms for text recognition from dotted matrix printer, fax and photocopies of bad typing. It auto recognizes blocks of text, tables and images and preserves the layout of the page perfectly.

cuneiform

Cuneiform has a high recognition rate. Its performance is comparable to that of high-quality commercial software such as ABBYY FineReader. In fact, in my test I wasn’t able to figure out who did the better job - ABBYY FineReader or Cuneiform.

Cuneiform does everything pretty much automatically. A wizard guides you through all stages of scanning and recognition and helps you reach the goal quickly. Just keep feeding the software scanned copies of text or direct it to get the images from the scanner and it will do the rest. The software also grants you the ability to define regions for scanning, regions to ignore, define tables, columns and so on, you wish to.

After it is done recognizing the text, a curious thing happens – it opens Microsoft Word right inside the program’s window for you to carry out editing and proofreading. If your computer does not have MS Word installed, the document will open on their inbuilt text editor which is by no means inferior to standard word processors.

cuneiform2

Features:

  • Quality recognition
  • High speed
  • Recognition of texts in 23 languages
  • Rotate and invert images before recognition
  • Recognition of tables of any structure and complexity
  • Automatic saving of illustrations and tables in the received output document
  • Complete preservation of the topology of the page
  • Support batch mode scanning and recognition
  • Built-in text editor to work with the recognized text

Comments

  1. Can you make sure that whatever links you mention on your article are actually functional links?! The "Cuneiform"(http://en.openocr.org/download) is not!

    ReplyDelete
  2. The link is absolutely functional. I downloaded the software from the same link. If it's not opening, perhaps the website is temporarily offline.

    ReplyDelete
  3. try loading google cache of the page and clicking download links from there, it worked for me that way

    ReplyDelete
  4. Cannot make it work on Windows 7 :-(

    ReplyDelete
  5. Do your software can work on Indian regional languages like Marathi, Tamil, Punjabi etc ?

    ReplyDelete

Post a Comment

Popular posts from this blog

How to Schedule Changes to Your Facebook Page Cover Photo

Facebook’s current layout, the so called Timeline, features a prominent, large cover photo that some people are using in a lot of different creative ways. Timeline is also available for Facebook Pages that people can use to promote their website or business or event. Although you can change the cover photo as often as you like, it’s meant to be static – something which you design and leave it for at least a few weeks or months like a redesigned website. However, there are times when you may want to change the cover photo frequently and periodically to match event dates or some special promotion that you are running or plan to run. So, here is how you can do that.

69 alternatives to the default Facebook profile picture

If you have changed the default Facebook profile picture and uploaded your own, it’s fine. But if not, then why not replace that boring picture of the guy with a wisp of hair sticking out of his head with something different and funny?

How to remove watermark from an image or picture

A watermark is any recognizable text, logo or pattern that appears over an image to identify the owner of the image and generally used to prevent unauthorized reuse of the image. Watermarks are usually transparent and can be difficult to remove. The difficulty or ease of removal depends on the content of the image and the position, color, size etc of the watermark.