I’ve got a job coming up where I need to process approx 1 million PDF’s an index file. I need to put the PDF’s into ZIPs in 2k chunks along with a new index file. One of the index fields needed is PDF page count. Is there an easier way of getting the page count of a PDF rather than put it into a mapper?
I obviously won’t be trying to run all 1million at once and will be running them in smaller chunks but I can imagine if I load the PDF’s into mapper it’s going to take days to process