Extract text from a PDF file

This example pipeline demonstrates how to extract text from a PDF file that contains table data.

  1. Configure the File Reader Snap to read a sample PDF file that contains data in tables.
  2. Configure the Extract Snap—select the Text and Full table checkboxes to extract text from the tables in the PDF file.
    On validation, the text is extracted. You can view the extracted text from the tables in the output preview.
    Extract Snap Configuration Extract Snap Output

    Extract Snap Configuration


    Extract Snap Output

To successfully reuse pipelines:
  1. Download and import the pipeline into SnapLogic.
  2. Configure Snap accounts as applicable.
  3. Provide pipeline parameters as applicable.