Google VertexAI RAG Corpus Add File

Overview

Use this Snap to add a file to the RAG corpus.


Snap dialog

Prerequisites

Supported models:

  • text-embedding-005
  • text-multilingual-embedding-002
  • text-embedding-large-exp-03-07

You need to have one of the following accounts configured for your Google VertexAI Snaps:

Snap views

Type Description Examples of upstream and downstream Snaps
Input This Snap has at the most one document input view.
  • File Reader
Output

This Snap has at the most one document output view.

{
                  "name": "projects/sldatascience/locations/us-central1/ragCorpora/6917529027641081856/ragFiles/5438241315519446597",
                  "displayName": "test",
                  "description": "test-desc",
                  "directUploadSource": {},
                  "fileId" : "5438241315519446597",
                  "fileLocation" : "us-central1",
                  "fileName" : "5438241315519446597:test",
                  }
  • Mapper
Learn more about Error handling.

Snap settings

Legend:
  • Expression icon (): Allows using JavaScript syntax to access SnapLogic Expressions to set field values dynamically (if enabled). If disabled, you can provide a static value. Learn more.
  • SnapGPT (): Generates SnapLogic Expressions based on natural language using SnapGPT. Learn more.
  • Suggestion icon (): Populates a list of values dynamically based on your Snap configuration. You can select only one attribute at a time using the icon. Type into the field if it supports a comma-separated list of values.
  • Upload : Uploads files. Learn more.
Learn more about the icons in the Snap settings dialog.
Field/Field set Type Description
Label String

Required. Specify a unique name for the Snap. Modify this to be more appropriate, especially if more than one of the same Snaps is in the pipeline.

Default value: Google VertexAI RAG Corpus Add File

Example: Add document to corpus

Location String/Expression/ Suggestion

The location of the corpus. By default, the account location is used.

Default value: N/A

Example: global / us-central1
Corpus name String/Expression/ Suggestion

Required. The name of the corpus.

Default value: N/A

Example: 7991637538768945152:test-corpus
File display name String/Expression

Required. The display name of the file.

Default value: N/A

Example: report-document
File description String/Expression

The description of the file.

Default value: N/A

Example: The selling report for 2025.
Chunk size Integer/Expression

The chunk size of the file.

Default value: N/A

Example: 512
Chunk overlap Integer/Expression

The chunk overlap of the file. The value for Chunk overlap must be less than the value for Chunk size

Default value: N/A

Example: 128
Snap execution Dropdown list
Choose one of the three modes in which the Snap executes. Available options are:
  • Validate & Execute: Performs limited execution of the Snap and generates a data preview during pipeline validation. Subsequently, performs full execution of the Snap (unlimited records) during pipeline runtime.
  • Execute only: Performs full execution of the Snap during pipeline execution without generating preview data.
  • Disabled: Disables the Snap and all Snaps that are downstream from it.

Default value: Validate & Execute

Example: Execute only

File type support

  • HTML file - 10 MB
  • JSON file - 10 MB
  • JSONL or NDJSON file - 10 MB
  • Markdown file10 MB
  • Microsoft PowerPoint slides (PPTX file) - 10 MB
  • Microsoft Word documents (DOCX file) - 50 MB
  • PDF file - 50 MB
  • Text file - 10 MB
  • Image files

    Note: can generate lower-quality responses

    • PNG
    • WEBP
    • HEIC
    • HEIF