Date : Tue, 26 Jun 2001 00:04:34 +0100
From : "mike.mallett" <mike.mallett@...>
Subject: FW: Scanning of BBC Magazines
> -----Original Message-----
> From: owner-bbc-micro@... [mailto:owner-bbc-micro@...]
> On Behalf Of Isabel Cisternas & Robert Schmidt
> Sent: 25 June 2001 21:44
> To: bbc-micro@...
> Subject: Re: [BBC-Micro] Scanning of BBC Magazines
>
<etc ...>
>
> There should be one high quality, lossless format to store scans before
> somebody volunteers to OCR them, and one OCRed format (PDF, RTF, DOC and
> HTML could all be considered, but one should be chosen).
PDF can include keywords and searchable text, and there is indexing over
multiple files.
>
> I've seen some pretty amazing work done on material scanned and OCRed
> into PDF. Not sure if this can be done with Acrobat out of the box, but
> some kind of OCRed format seems essential to me.
>
The full version of Acrobat includes the Paper Capture plug in to do this.
Not to be confused with the free Reader ...