Azure OpenAI Embedder

Overview

You can use this Snap to generate an embedding vector based on the provided input data. The Snap processes the input document to generate the corresponding embedding for the output document, regardless of the batch size.


Azure OpenAI Embedder Overview

  • Transform-type Snap
  • Works in Ultra Tasks when the Batch size is set to 1.

Prerequisites

Deploy the specific model in the Azure OpenAI Studio portal. Learn more about the access to Azure OpenAI.

Known issues

  • The Snap fails with the connection reset error when the deployment server's timeout is shorter than the preconfigured read timeout (15 minutes).
Workarounds:
  • Reduce the Batch size value.
  • Deploy the model on better hardware for faster embedding processing.

Snap views

View Description Examples of upstream and downstream Snaps
Input This Snap has at the most one document input view. The Snap requires the text to generate the embedding vector. Mapper
Output This Snap has at the most one document output view. The Snap provides the embedded vectors, and the original input document. Mapper
Error

Error handling is a generic way to handle errors without losing data or failing the Snap execution. You can handle the errors that the Snap might encounter when running the pipeline by choosing one of the following options from the When errors occur list under the Views tab. The available options are:

  • Stop Pipeline Execution Stops the current pipeline execution when the Snap encounters an error.
  • Discard Error Data and Continue Ignores the error, discards that record, and continues with the remaining records.
  • Route Error Data to Error View Routes the error data to an error view without stopping the Snap execution.

Learn more about Error handling in Pipelines.

Snap settings

Note:
  • Suggestion icon (): Indicates a list that is dynamically populated based on the configuration.
  • Expression icon (): Indicates whether the value is an expression (if enabled) or a static value (if disabled). Learn more about Using Expressions in SnapLogic.
  • Add icon (Plus Icon): Indicates that you can add fields in the field set.
  • Remove icon (Minus Icon): Indicates that you can remove fields from the field set.
Field / Field set Type Description
Label String

Required. Specify a unique name for the Snap. Modify this to be more appropriate, especially if more than one of the same Snaps is in the pipeline.

Default value: Azure OpenAI Embedder

Example: Embedding data
Deployment ID String/Expression/Suggestion
Required. Specify the model or deployment ID for the model from the Azure OpenAI Studio portal. Learn more about how to retrieve the ID and the list of compatible models.
Note: All deployment IDs available might not be listed in the Suggestions list because of the limitations of Azure APIs

Workaround: Enter the Deployment ID manually (found on your Deployments page within the Azure OpenAI Portal) associated with the model you plan to use.

Default value: N/A

Example: snaplogic-gpt-4
Batch size Integer/Expression
Required. Specify the number of documents batched per request. In Ultra mode, the Batch size must be set to 1.
Note:
  • This field does not support input values from the upstream Snap.
  • When you configure the batch size with a value greater than 1, the Snap accumulates input documents until the specified batch size is attained. Subsequently, a single request is made to the endpoint with the accumulated batch size.
  • Output documents are sent one at a time in the same order they were received.

Maximum value: 2048

Default value: N/A

Example: 50
Text to embed String/Expression

Required. Specify the text to generate the embedding vector.

Default value: N/A

Example: $msg
Snap execution Dropdown list
Select one of the three modes in which the Snap executes. Available options are:
  • Validate & Execute. Performs limited execution of the Snap and generates a data preview during pipeline validation. Subsequently, performs full execution of the Snap (unlimited records) during pipeline runtime.
  • Execute only. Performs full execution of the Snap during pipeline execution without generating preview data.
  • Disabled. Disables the Snap and all Snaps that are downstream from it.

Default value: Validate & Execute

Example: Execute only

Examples