Use multiple pages pdf as scanned answered sheets (Feature #66)


Added by Nicolas Pettiaux over 12 years ago. Updated about 12 years ago.


Status:Closed Start date:06/26/2012
Priority:Normal Due date:
Assignee:- % Done:

100%

Category:-
Target version:-

Description

The scanner I use produces multiple pages pdf as the results of scanning the answered sheets. Today I need to convert the pdf (how to do easily with a script ?) to good quality jpeg manually with gimp (as I co not know how to script gimp nor have found the parameters to have Imagemagick convert to it as well as gimp).


History

Updated by red sea over 12 years ago

you can used this programs:
- pdf chain
- pdf sam

regards

Updated by Alexis Bienvenüe over 12 years ago

Another suggestion: as PDF is only a container, extracting the included images can be done without modifying them using pdfimages from the poppler-utils debian package:

pdfimages scan.pdf scanroot

In some situations however, the resulting images will need 90° rotation...

Updated by Nicolas Pettiaux over 12 years ago

Alexis Bienvenüe wrote:

Another suggestion: as PDF is only a container, extracting the included images can be done without modifying them using pdfimages from the poppler-utils debian package:
[...]
In some situations however, the resulting images will need 90° rotation...

Great.

Im my case, the resulting file is in format im-001-000.pbm: Netpbm PBM "rawbits" image data. Does AMC accept this format ? or do I have to convert it to jpeg ? or another ? which is the best way and with which parameters ?

Updated by Alexis Bienvenüe over 12 years ago

Netpbm PBM "rawbits" image data. Does AMC accept this format ?

This should be OK, as this format can be read by OpenCV.

Updated by Nicolas Pettiaux over 12 years ago

good to know. Problem is solved then. Much thanks.

Updated by Alexis Bienvenüe over 12 years ago

Also note that AMC accepts multi-page PDF scans as input when calling "automatic data capture": it does convert it to bitmap single page images before processing them.

Updated by Nicolas Pettiaux over 12 years ago

Alexis Bienvenüe wrote:

Also note that AMC accepts multi-page PDF scans as input when calling "automatic data capture": it does convert it to bitmap single page images before processing them.

Thanks for the info. So, if I understand well, the situation now is : we can use with AMC a multipage pdf scan, that is created by the copy machine acting as scanner. THe rest will be automatic ?

Updated by Alexis Bienvenüe over 12 years ago

Yes.

Updated by Alexis Bienvenüe about 12 years ago

  • Status changed from New to Resolved

Updated by Alexis Bienvenüe about 12 years ago

  • % Done changed from 0 to 100
  • Status changed from Resolved to Closed

Also available in: Atom PDF