RootsWeb.com Mailing Lists
Total: 2/2
    1. Re: Gif/Jpg printer ?
    2. Ian Goddard
    3. Denis Beauregard wrote: > It is relatively easy to take a document and to print it to a PDF > file. For example, I can read a text file and print it with > LibreOffice. > > Now, I have a book scanned to a PDF file and I would like to extract > the images to enter them in an image processor so as to check each > entry in a list when I integrate the information into a database. > > I presume if I can print to a PDF file, then I can print to a JPG > file as well. AFAIK LibreOffice doesn't have a print to JPEG or whatever unless you actually have (if such a thing exists) a 3rd party application which provides a pseudo-printer. However, on a quick test I found I could open a PDF with LibreOffice, copy an image & paste it into a paint program. This is with Linux but presumably should also work on W7. However, it wouldn't be my preferred way of doing this on Linux as there are command line tools to extract images from PDFs. In fact I used them recently to split a PDF consisting of scans of maps & get a series of TIFFs of the original individual sheets. Scanned books, e.g. from archive.org are often OCRd & you can also extract the text - complete with all the usual OCR artefacts. -- Ian The Hotmail address is my spam-bin. Real mail address is iang at austonley org uk

    11/20/2012 02:29:43
    1. Re: Gif/Jpg printer ?
    2. Denis Beauregard
    3. On Tue, 20 Nov 2012 09:29:43 +0000, Ian Goddard <goddai01@hotmail.co.uk> wrote in soc.genealogy.computing: >Denis Beauregard wrote: >> It is relatively easy to take a document and to print it to a PDF >> file. For example, I can read a text file and print it with >> LibreOffice. >> >> Now, I have a book scanned to a PDF file and I would like to extract >> the images to enter them in an image processor so as to check each >> entry in a list when I integrate the information into a database. >> >> I presume if I can print to a PDF file, then I can print to a JPG >> file as well. > >AFAIK LibreOffice doesn't have a print to JPEG or whatever unless you >actually have (if such a thing exists) a 3rd party application which >provides a pseudo-printer. > >However, on a quick test I found I could open a PDF with LibreOffice, >copy an image & paste it into a paint program. This is with Linux but >presumably should also work on W7. I presume teh many LO versions have all the same features for the same version. W7 does it too ! >However, it wouldn't be my preferred way of doing this on Linux as there >are command line tools to extract images from PDFs. In fact I used them >recently to split a PDF consisting of scans of maps & get a series of >TIFFs of the original individual sheets. > >Scanned books, e.g. from archive.org are often OCRd & you can also >extract the text - complete with all the usual OCR artefacts. I tried this solution and it does more or less the job. I open the PDF file with LO, delete the pages I don't need and I copy pages one by one into the paint software. I can't find how to edit the images directly however. All I can do is to put arrows and I can do that to indicate that I already have the data on a given line. Anyway, I think I can do more or less what I want to do. Thanks for the idea. I didn't know LO could edit PDF files. Denis -- Denis Beauregard - généalogiste émérite (FQSG) Les Français d'Amérique du Nord - www.francogene.com/genealogie--quebec/ French in North America before 1722 - www.francogene.com/quebec--genealogy/ Sur cédérom à 1780 - On CD-ROM to 1780

    11/20/2012 03:24:16