Weaviate Embedding Store

yaml
type: "io.kestra.plugin.ai.embeddings.Weaviate"

Ingest documents into a Weaviate embedding store.

yaml
id: document-ingestion
namespace: company.team

tasks:
  - id: ingest
    type: io.kestra.plugin.ai.rag.IngestDocument
    provider:
      type: io.kestra.plugin.ai.provider.GoogleGemini
      modelName: gemini-embedding-exp-03-07
      apiKey: "{{ secret('GEMINI_API_KEY') }}"
    embeddings:
      type: io.kestra.plugin.ai.embeddings.Weaviate
      apiKey: "{{ secret('WEAVIATE_API_KEY') }}"
      host: your-weaviate-host
    drop: true
    fromExternalURLs:
      - https://raw.githubusercontent.com/kestra-io/docs/refs/heads/main/content/blogs/release-0-22.md
Properties

Weaviate API key

Your Weaviate API key. Not required for local deployment.

Weaviate host

The host, e.g. "ai-4jw7ufd9.weaviate.network" of cluster URL. Find in under Details of your Weaviate cluster.

Weaviate avoid dups

If true (default), then WeaviateEmbeddingStore will generate a hashed ID based on provided text segment, which avoids duplicated entries in DB. If false, then random ID will be generated.

Possible Values
ONEQUORUMALL

Weaviate consistency level

Consistency level: ONE, QUORUM (default) or ALL.

gRPC port if used

Weaviate metadata field name

The name of the metadata field to store. If not provided, will default to "_metadata".

SubType string

Weaviate metadata keys

The list of metadata keys to store. If not provided, will default to an empty list.

Weaviate object class

The object class you want to store, e.g. "MyGreatClass". Must start from an uppercase letter. If not provided, will default to "Default".

Weaviate port

The port, e.g. 8080. This parameter is optional.

Weaviate scheme

The scheme, e.g. "https" of cluster URL. Find in under Details of your Weaviate cluster.

The gRPC connection is secured

Use gRPC for inserts

Use GRPC instead of HTTP for batch inserts only. You still need HTTP configured for search.