PDF to CSV auto determine if need to force OCR or not

PDF to CSV auto determine if need to force OCR or not

For Zapier, Integromat and others plugins insert custom profiles into profiles field. For API calls please set value as string in profiles parameter as string.

To determine if OCR is required, the converter checks each PDF document page if it contains at least one raster image or vector drawing but very little text (less than 8 characters).

The OCR can be forced by changing an option of underlying PDF Extractor engine using the profiles parameter.

{ "OCRMode": "TextFromImagesAndVectorsAndFonts" }

In the TextFromImagesAndVectorsAndFonts mode, the converter will perform OCR on image and vector objects and combine the result with the extraction of normal text objects.

Applies To:

  • /pdf/convert/to/csv
  • /pdf/convert/to/xml
  • /pdf/convert/to/json
  • /pdf/convert/to/xls
  • /pdf/convert/to/xlsx
Have more questions? Submit a request

0 Comments

Please sign in to leave a comment.