Skip to Content

Submit your nominations for the Luxist Awards' Best in Decor
AOL Tech

optical-character-recognition posts

Filed under: Text, Utilities, Web services

Online OCR extracts text from scans for free

Online OCRHave you ever received a PDF file in an email that contained information you need in text form, but the PDF doesn't let you select the text? Sometimes this is due to access restrictions on the document, but more often it's because the person creating the PDF did so by simply scanning the document. This creates an image of the document, but does not embed the text information.

To extract the text, you need to use optical character recognition (OCR) software. Most scanners come with OCR software, but if you don't have ready access to it, you might consider using the Online OCR service.

Online OCR lets you upload a PDF file and it returns plain text that can then be copied directly into the word processing tool of your choice.

Obviously there are security concerns with uploading and storing copies of your documents on a 3rd party's site, but if that's not a concern to you, Online OCR might be just the trick to solve your OCR needs.

[via Lifehacker]

Filed under: Office, Web services, Google, Web

Google Docs API now provides OCR service

Google API OCR DemoIf you're not a developer, you're probably not aware that Google Docs has an API available for various document-related services. Recently Google added a new feature that allows developers to create applications that will pass an image-based (.png, .jpg, or .gif) document to the API, and using OCR (Optical Character Recognition) technology, generate and pass back an editable text-based document. Currently the service can handle documents up to 10 MB in size.

For non-developers, this is certainly interesting, but not terribly useful. Well, a live demo is available that will allow you to test the service yourself. At the time of this writing I was successfully able to convert a single page document, though there are reports that the service has become overloaded and is slow or sometimes fails to respond. Given that this is a demo, that's probably not terribly surprising.

If Google is exposing this functionality to developers, it seems at least possible that it could become a built-in feature of Google Docs for regular users at some point in the near future. With Google's recent acquisition of reCaptcha, it seems likely that Google's document-scanning capabilities will soon be better than anyone else's. That would make for an incredibly powerful feature for Google to offer its users.

[via Google Operating System]

Featured Time Waster

The World's Hardest Game 2.0 - Time Waster

So, just how good at time waster games are you? Think you've got the stuff? Well, The World's Hardest Game 2.0 doesn't think you do. Yes, amazingly, it's possible to have a sequel to a game called "The World's Hardest Game". It doesn't seem logically possible, since if the first one was actually the world's hardest, how could another one come along and share the moniker? It made me doubt the name in the first place. That is, until I tried the game. The mechanics of the game are very simple. You are a small red square, ...

View more Time Wasters

Featured Galleries

Defective by Design, London: Protest Pictures
Microsoft Security Essentials
Chromium Pre-Alpha on CrunchBang Linux
Safari 4 Beta
10 Firefox themes that don't suck
IE8 RC1
Download Squad at the Crunchies After-Party
Download Squad at the Crunchies
WordPress 2.7
Cooking Mama: Mama Kills Animals
Windows 7 Hands On
Comodo Internet Security
Android First-look: Amazon.com MP3 Store
Android First-look: Twitroid
Google Reader Android
Android Hands-On
Twine 1.0
Photoshop Express Beta
Mozilla Birthday Cake
Palm stuff
Adobe Lightroom 1.1

 


Follow us on Twitter!

Flickr Pool

www.flickr.com

More Tech Coverage

AOL Radio