[aklug] Re: Anyone interested in a job importing 24k printed emails in Juneau/Anchorage into a database?

From: Mark Neyhart <Mark_Neyhart@legis.state.ak.us>
Date: Wed Jun 08 2011 - 15:41:00 AKDT

This was my first question as well...

Does anybody know of a linux OCR tool which can convert images to
text? I've found references to Tesseract, but am not sure if it is
active. I've got a bunch of pages which have been scanned to PDF, and
would like to be able to make them searchable.

Mark Neyhart

Joshua J. Kugler wrote:
> First question: WHY ON EARTH are they printed? Why can't they give them
> to you on a CD or DVD?
>
> j
>
> On Wednesday 08 June 2011, Jason McEachen elucidated thus:
>> This Friday at 9am the State of Alaska is going to have a couple
>> boxes of printed emails in Juneau for me to have, and a hand truck to
>> help carry them. We could also pick them up at the Anchorage Airport
>> at 3pm.
>>
>> What I'd like to do is somehow import them into a database and set up
>> a quick and easy web-based interface to allow searches.
>>

>> Thanks for your help,
>>
>> --Jason

---------
To unsubscribe, send email to <aklug-request@aklug.org>
with 'unsubscribe' in the message body.
Received on Wed Jun 8 15:41:08 2011

This archive was generated by hypermail 2.1.8 : Wed Jun 08 2011 - 15:41:08 AKDT