MultimodalCompletion​Multimodal​Completion

Use Multimodal completion using the Gemini Client.

See Gemini API about multimodal input for more information.

yaml
type: "io.kestra.plugin.gemini.MultimodalCompletion"

Multimodal completion using the Gemini Client

yaml
id: gemini_multimodal_completion
namespace: company.team

inputs:
  - id: image
    type: FILE

tasks:
  - id: multimodal_completion
    type: io.kestra.plugin.gemini.MultimodalCompletion
    apiKey: "{{ secret('GEMINI_API_KEY') }}"
    model: "gemini-2.5-flash"
    contents:
      - content: Can you describe this image?
      - mimeType: image/jpeg
        content: "{{ inputs.image }}"
Properties

Gemini API Key

The chat content prompt for the model to respond to

Model

Specifies which generative model (e.g., 'gemini-1.5-flash', 'gemini-1.0-pro') to use for the completion.

Default false

Whether the response has been blocked for safety reasons

The reason the generation has finished

The response safety ratings

The generated response text

Whether the response has been blocked for safety reasons.

Safety category.

Safety rating probability.

The content itself, should be a string for text content or a Kestra internal storage URI for other content types.

If the content is not text, the mimeType property must be set.

Mime type of the content, use it only when the content is not text.

Default user

The content role, defaults to "user".