Is it possible to convert PDF files to text within the CONNECT workflow?
I think there is no simple way to convert the complete pdf to text.
Maybe you can use the Alambic API:
https://help.objectiflune.com/files/EN/alambicedit-api/AlambicEdit.html#ca2c9ecafa5c48628db2e4b46fd88ee2
Otherwise there are many third party tools to do that (e.g. GhostScript). You can try to integrate the tool via command line.
Maybe there is another way I don`t know.
It depends what you mean by convert.
If you wish to retrieve the text from the PDF, then either the Connect Data Mapper or the AlambicEdit ExtractText2()
method should get you going.
If you wish to produce a text representation of a PDF, with the text being visually placed at the proper position in a plain text file, when printing the PDF through the Generic/Text Only printer driver that comes with Windows should work, up to a certain point.