Dojo 1.0 + OCR

This week Dojo 1.0 was released. I already took a quick look at it, and it looks really nice. Big changes since the last version I used (0.4). The new grid is awesome, I really hope it will be easy to use but I’m pretty sure it will be. The guys from the Dojo community are really good programmers. I spent some timing digging through the code for a fix, I saw several things I didn’t even know were possible with Javascript 🙂
I’ll come back on Dojo once I’ve got some real first hand experience with it, but it looks promising (and also much faster than previous releases)
I already found out that if you pass in a JSON store to your grid, your identifier better be unique or it won’t work (took me some time to figure that one out 🙂 )

A couple of weeks ago We bought a All in one (scanner,fax,printer,copier). The nice thing about this device it let’s you scan 50 pages (document feeder) without having to put every page on the scanner window. That combined with the ability to scan to a networkdrive,email or ftp server makes this device the perfect solution for my administrative problem. My goal is to go to a paperless office but it will take some time until we get there.
Problem is the device scans the images straight to PDF. So text actually becomes an image, you can’t select it, search for text,… it’s not yet what I want. In my Ubuntu I searched for OCR and got some results witch I installed. Almost all of the tools let me scan for text in the PDF, when I saved the document I suddenly could search for text in the document. Still the selecting of text wasn’t possible. Philip told me that Google had a solution that let’s you convert these kind of documents to XHtml. I knew they were developing an OCR tool but didn’t know it would be able to do this kind of stuff. I tried to install it but I’m still missing some dependecies, I’ll try it later again.

Leave a Reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.