- Parse-type Snap
- Works in Ultra Pipelines
This Snap enables you to extract fields, text, images, or tables from a PDF document. This Snap requires the PDF account only for parsing a locked PDF file.
The PDF Parser Snap might encounter issues when it parses tables that are embedded images, lack borders, have a complex row-column structure, or span multiple pages.
|Examples of Upstream and Downstream Snaps
|This Snap has exactly one binary input view.
|This Snap supports exactly one binary or document output view.
Error handling is a generic way to handle errors without losing data or failing the Snap execution. You can handle the errors that the Snap might encounter when running the pipeline by choosing one of the following options from the When errors occur list under the Views tab. The available options are:
Learn more about Error handling in Pipelines.
- Suggestion icon (): Indicates a list that is dynamically populated based on the configuration.
- Expression icon (): Indicates whether the value is an expression (if enabled) or a static value (if disabled). Learn more about Using Expressions in SnapLogic.
- Add icon (): Indicates that you can add fields in the field set.
- Remove icon (): Indicates that you can remove fields from the field set.
|Field / Field Set
|Required. Specify a unique name for the Snap. Modify this to be more appropriate, especially if there are more than one of the same Snap in the pipeline.
Required. Specify pages or the range of pages that you want to split into separate PDF documents. For a range of pages, use a hyphen and separate each page or range with a comma. For example:
Default value: N/A
Example: 1–3, 5–7
Select how the parser should act on the pages specified in the Pages field. The options available are:
Default value: Text extractor
Example: Table parser
|Select one of the three modes in which the Snap executes.
Available options are: