How is this different from the Argo CD page?

Argo CD reconciles desired state from Git: you commit a manifest, the controller applies it. This plugin acts on the cluster directly: PodCreate runs a workload now, kubectl.Apply applies a manifest now (driven by a non-Git event), kubectl.Restart rolls a Deployment now. Use Argo CD when Git is the source of truth for what should be deployed. Use this plugin when the trigger source is upstream events (Vault rotation, a flow's prior step output, a schedule) and the cluster needs to react. Most teams run both.

What does PodCreate actually do?

PodCreate spawns a Pod from a manifest spec in a chosen namespace, streams container logs back into the Kestra execution view, supports init containers for input data (download a dataset before the main container starts) and sidecar containers for output data (upload artifacts after the main container ends), and by default deletes the Pod after completion. The Pod runs on the cluster's own nodes, so a GPU job runs on a GPU node, a high-memory job on a high-memory node, no extra infra on Kestra's side.

Can I run a Spark job, a Python script, or an arbitrary container?

Yes, anything that runs as a Pod. Spark-on-Kubernetes operators expose a SparkApplication CRD that kubectl.Apply can submit. A plain Python script runs as a Pod with a Python image and args. A long-running batch job runs as a Pod with restartPolicy Never. The plugin does not impose a runtime; the Pod spec is yours. Pair with init/sidecar containers for input/output handling.

How does Kestra authenticate to the Kubernetes API?

Three options. In-cluster: when Kestra runs inside the cluster, the default service account works automatically. Kubeconfig: each task accepts a kubeConfig property containing a path or inline kubeconfig YAML, read from Kestra secrets. Token-based: pair with auth.EksToken from the AWS plugin to generate a short-lived k8s-aws-v1 token for EKS, then pass via kubeConfig. Same patterns for GKE and AKS via their respective auth tasks.

How does a rolling restart work in this plugin?

kubectl.Restart targets a Deployment or StatefulSet by name in a namespace, and triggers the same rollout that kubectl rollout restart performs: the controller annotates the Pod template with a timestamp, which forces the Pods to be recreated according to the rollout strategy. waitUntilReady is ignored for this task because the rollout itself is the wait, but pair it with kubectl.Get to block downstream tasks until the rollout completes.

Can the same flow apply to multiple clusters in one execution?

Yes, that is the standard pattern for multi-tenant SaaS, multi-region deployments, and DR. A ForEach over a list of (cluster, kubeconfig_secret_name) tuples runs the same task definition per cluster, reading the matching kubeconfig from Kestra secrets per iteration. Concurrency controls cap blast radius. Per-cluster errors route to an errors branch that rolls back only the affected cluster.

Is the Kubernetes plugin Enterprise-only?

No. The io.kestra.plugin.kubernetes plugin ships in the open-source edition with PodCreate, kubectl.Apply, kubectl.Get, kubectl.Patch, kubectl.Delete, and kubectl.Restart. Kestra Enterprise adds Apps (typed self-service forms so developers trigger a Pod run through a UI), namespace-scoped RBAC, audit logs, worker isolation, and the Kubernetes task runner that runs every flow task itself as a Kubernetes Pod.

Can Kestra orchestrate Kubernetes in an air-gapped cluster?

Yes. Kestra ships self-hosted on Docker or Kubernetes, often deployed into the same cluster it orchestrates. In-cluster authentication uses the service account by default. The OSS edition runs air-gapped with no external dependency. See Kestra for infrastructure automation for the broader picture across Kubernetes, Argo CD, Terraform, and the rest of the platform.

Run Pods inside Kubernetes as flow steps.

Schedule a Pod with GPU as a task and stream container logs back to the Kestra execution view. Server-side apply manifests on event, restart Deployments after Secret rotation, patch ConfigMaps when an upstream value changes, and run the same flow across many clusters from one definition.

Book a Demo Get Started

Blueprints for Kubernetes orchestration.

Connect Kubernetes to your platform with a workflow engine that runs Pods as flow steps, applies manifests on non-Git events, restarts Deployments after Secret rotation, and chains kubectl with Argo CD, Vault, and GitHub in one audited flow. Drop the duplicate GPU hardware bill; span every tenant cluster from one flow via per-task kubeConfig.

PodCreate with GPU request, stream logs, save the model artifactOpen blueprint

id: kubernetes-gpu-pod-artifact
namespace: company.team
description: |
  Run a GPU workload as a Kubernetes pod and collect its artifact. The pod
  requests a GPU, runs the job, writes a result file that Kestra pulls back into
  internal storage, and Slack reports completion.

tasks:
  - id: run_gpu_pod
    type: io.kestra.plugin.kubernetes.core.PodCreate
    description: Schedule a GPU pod, run the job, and collect its output file.
    namespace: gpu-jobs
    waitUntilRunning: PT10M
    spec:
      containers:
        - name: train
          image: nvidia/cuda:12.4.0-runtime-ubuntu22.04
          command:
            - bash
            - -c
            - 'nvidia-smi > {{ workingDir }}/result.txt 2>&1 || echo "no gpu" >
              {{ workingDir }}/result.txt'
          resources:
            limits:
              nvidia.com/gpu: 1
      restartPolicy: Never
    outputFiles:
      - result.txt

  - id: notify
    type: io.kestra.plugin.slack.notifications.SlackIncomingWebhook
    description: Report that the GPU job finished and its artifact was collected.
    url: "{{ secret('SLACK_WEBHOOK_URL') }}"
    payload: |
      {
        "text": "GPU pod finished; artifact stored at {{ outputs.run_gpu_pod.outputFiles['result.txt'] }}."
      }

pluginDefaults:
  - type: io.kestra.plugin.kubernetes.core.PodCreate
    values:
      connection:
        masterUrl: "{{ secret('K8S_MASTER_URL') }}"
        oauthToken: "{{ secret('K8S_TOKEN') }}"

triggers:
  - id: nightly_job
    type: io.kestra.plugin.core.trigger.Schedule
    description: Run the GPU job nightly. Adjust or disable as needed.
    cron: "0 2 * * *"
    disabled: true

Rotate a Vault secret, restart every Deployment that reads itOpen blueprint

id: kubernetes-vault-rotation-restart
namespace: company.team
description: |
  Rotate a secret from Vault into Kubernetes and roll the workloads that use it.
  Read the fresh secret from Vault, patch the Kubernetes Secret, trigger a rolling
  restart of the Deployment, and confirm the rollout.

inputs:
  - id: namespace
    type: STRING
    defaults: production
    description: Kubernetes namespace holding the secret and deployment.
  - id: secret_name
    type: STRING
    defaults: app-credentials
    description: Kubernetes Secret to patch with the rotated value.
  - id: deployment_name
    type: STRING
    defaults: my-api
    description: Deployment to roll after the secret is rotated.
  - id: vault_path
    type: STRING
    defaults: https://vault.example.com/v1/secret/data/app
    description: Vault API URL that returns the current secret.

tasks:
  - id: read_from_vault
    type: io.kestra.plugin.core.http.Request
    description: Read the current secret value from Vault.
    uri: "{{ inputs.vault_path }}"
    method: GET
    headers:
      X-Vault-Token: "{{ secret('VAULT_TOKEN') }}"

  - id: patch_secret
    type: io.kestra.plugin.kubernetes.kubectl.Patch
    description: Patch the Kubernetes Secret with the rotated, base64-encoded value.
    namespace: "{{ inputs.namespace }}"
    resourceType: secret
    resourceName: "{{ inputs.secret_name }}"
    patchStrategy: JSON_MERGE
    patch: |
      {
        "data": {
          "password": "{{ outputs.read_from_vault.body | jq('.data.data.password') | first | base64encode }}"
        }
      }

  - id: restart_deployment
    type: io.kestra.plugin.kubernetes.kubectl.Restart
    description: Trigger a rolling restart so pods pick up the new secret.
    namespace: "{{ inputs.namespace }}"
    resourceType: Deployment
    resourcesNames:
      - "{{ inputs.deployment_name }}"

  - id: verify_rollout
    type: io.kestra.plugin.kubernetes.kubectl.Get
    description: Read the deployment back to confirm the rollout.
    namespace: "{{ inputs.namespace }}"
    resourceType: deployments
    resourcesNames:
      - "{{ inputs.deployment_name }}"
    fetchType: FETCH

pluginDefaults:
  - type: io.kestra.plugin.kubernetes.kubectl.Patch
    values:
      connection:
        masterUrl: "{{ secret('K8S_MASTER_URL') }}"
        oauthToken: "{{ secret('K8S_TOKEN') }}"
  - type: io.kestra.plugin.kubernetes.kubectl.Restart
    values:
      connection:
        masterUrl: "{{ secret('K8S_MASTER_URL') }}"
        oauthToken: "{{ secret('K8S_TOKEN') }}"
  - type: io.kestra.plugin.kubernetes.kubectl.Get
    values:
      connection:
        masterUrl: "{{ secret('K8S_MASTER_URL') }}"
        oauthToken: "{{ secret('K8S_TOKEN') }}"

triggers:
  - id: weekly_rotation
    type: io.kestra.plugin.core.trigger.Schedule
    description: Rotate weekly. Adjust or disable as needed.
    cron: "0 3 * * 0"
    disabled: true

ForEach tenant: apply the same manifest, wait for Ready, report per tenantOpen blueprint

id: kubernetes-multicluster-apply
namespace: company.team
description: |
  Apply the same manifest across many tenant clusters. Iterate over a list of
  cluster connections, server-side apply the manifest to each, and read the
  resource back to confirm it landed.

inputs:
  - id: clusters
    type: ARRAY
    itemType: STRING
    description: Cluster names used to resolve per-cluster connection secrets
      (K8S_MASTER_URL_<name>, K8S_TOKEN_<name>).
  - id: target_namespace
    type: STRING
    defaults: platform
    description: Namespace the manifest is applied into on every cluster.

tasks:
  - id: apply_across_clusters
    type: io.kestra.plugin.core.flow.ForEach
    description: Apply and verify the manifest on each tenant cluster.
    values: "{{ inputs.clusters }}"
    concurrencyLimit: 3
    tasks:
      - id: apply_manifest
        type: io.kestra.plugin.kubernetes.kubectl.Apply
        description: Server-side apply the shared manifest to this cluster.
        connection:
          masterUrl: "{{ secret('K8S_MASTER_URL_' ~ taskrun.value) }}"
          oauthToken: "{{ secret('K8S_TOKEN_' ~ taskrun.value) }}"
        namespace: "{{ inputs.target_namespace }}"
        spec: |
          apiVersion: v1
          kind: ConfigMap
          metadata:
            name: platform-baseline
          data:
            managed-by: kestra
            policy-version: "2024.1"

      - id: verify_apply
        type: io.kestra.plugin.kubernetes.kubectl.Get
        description: Read the applied resource back to confirm it exists.
        connection:
          masterUrl: "{{ secret('K8S_MASTER_URL_' ~ taskrun.value) }}"
          oauthToken: "{{ secret('K8S_TOKEN_' ~ taskrun.value) }}"
        namespace: "{{ inputs.target_namespace }}"
        resourceType: configmaps
        resourcesNames:
          - platform-baseline
        fetchType: FETCH

triggers:
  - id: rollout_on_schedule
    type: io.kestra.plugin.core.trigger.Schedule
    description: Roll the baseline out on a schedule. Adjust or disable as needed.
    cron: "0 8 * * 1"
    disabled: true

Browse all 23 Kubernetes blueprints

Direct cluster API, not GitOps reconciliation.

Argo CD reconciles what should be deployed from Git. This plugin acts on the cluster directly: spawns Pods as flow steps, patches Secrets, rolls Deployments, reads live state into flow inputs. Two complementary patterns, two different sources of truth, both first-class in Kestra.

PodCreate runs a workload inside the cluster, no shell-out

PodCreate spawns a Pod from a manifest spec, streams container logs back into the Kestra execution view, handles file upload via init containers and download via sidecars, and deletes the Pod by default after completion. Run a 50GB Python training job, a Spark batch on EMR-on-EKS, or a CUDA workload using the cluster's own GPU nodes. No kubectl in a Shell task, no separate Kestra runner with GPUs.

Server-side apply when the source of truth isn't Git

kubectl.Apply runs server-side apply on YAML or JSON manifests with optional waitUntilReady. Drive the apply from any upstream event: a Vault secret rotation, a config value updated in a non-Git source, an approval gate that just passed. Argo CD reconciles from Git; this task applies directly when the source of truth is upstream events.

Rolling restart on event, not on annotation-poke

kubectl.Restart triggers a rolling restart on a Deployment or StatefulSet by name. Chain it after Secret rotation, ConfigMap update, or external dependency change. The kubectl rollout restart annotation pattern, but driven by an event chain rather than a human running a command. waitUntilReady is ignored here because the rollout itself is the wait.

Patch and Get for surgical changes and state reads

kubectl.Patch applies Strategic Merge (default), JSON Merge, or JSON Patch operations to a namespaced resource. waitUntilReady ensures reconciliation before downstream tasks fire. kubectl.Get reads resources into the flow context, so a ConfigMap value, a Deployment status, or a list of Pods matching a label all become flow inputs without parsing kubectl output.

Delete by kind and name, scoped to the namespace

kubectl.Delete removes named resources of a given kind from a namespace. Use to tear down test environments, clean up ephemeral preview deployments tied to closed PRs, or remove a tenant's resources when an offboarding flow fires. Namespaced resources only, with explicit apiGroup and apiVersion.

Same flow, different clusters via kubeconfig

Every task accepts a kubeconfig path or inline config from Kestra secrets. Run the same flow definition against staging, prod, or a per-customer tenant cluster by parameterizing the kubeconfig per execution. EKS, GKE, AKS, OpenShift, on-prem K8s all addressable from one flow. Pair with the EksToken task from the AWS plugin for short-lived EKS authentication.

How platform teams use Kubernetes and Kestra

Patterns engineering teams run in production today. Each one shows the flow end to end, with the real plugin classes in play.

Compute

Train an ML model on the cluster's GPU nodes from a flow

PodCreate spawns a Pod with nvidia.com/gpu: 1 and a node selector for the GPU pool. An init container mounts the training dataset from S3. The main container runs the training script. A sidecar uploads the model artifact and metrics back to Kestra internal storage. Logs stream into the execution view in real time, so a 4-hour training run is debuggable as it happens, not after it finishes.

Cluster's GPUs, not a separate Kestra worker

The expensive node pool the cluster already runs. No duplicate hardware on the orchestrator side.

Init + sidecar handle data movement

Input dataset mounted before the main container starts; output artifact uploaded after it ends. Same Pod lifecycle.

Logs streamed live to the execution view

Long-running jobs surface their progress in Kestra without external log shipping.

Pod is cleaned up after completion

Default behavior. Set a different lifecycle if you need to keep the Pod around for forensics.

scheduled trigger

or webhook

create gpu pod

PodCreate

stream logs

container output

collect artifact

via sidecar

external registry

notify

Slack with metrics

Secret rotation

Rotate a Vault secret, restart the Deployments that read it

A scheduled flow rotates the DB credential in Vault. Pods read the secret via the Vault sidecar, but they need a restart to pick up the new value if the sidecar caches. kubectl.Patch annotates the affected Deployments with a rollout timestamp; kubectl.Restart triggers a rolling restart; kubectl.Get polls Available replicas until the rollout completes. One flow, one execution ID across Vault and Kubernetes.

Rotation is one flow, not two systems

Vault, the Patch, the Restart, the wait, the notify. One execution to audit, one place to debug a failure.

Per-Deployment scope

Patch and Restart take resource names. List the Deployments that consume the rotated secret; the flow handles each.

Wait gated on real readiness

kubectl.Get with waitUntilReady polls until Available replicas equal desired. No fixed sleep, no flaky timing.

weekly schedule

cron

rotate in vault

new credential

patch deployments

annotate rollout

rolling restart

all reading pods

wait for ready

Available = desired

notify

Slack confirm

State-driven

Promote staging to prod only if Pods are Ready

After kubectl.Apply ships the new manifest to staging, kubectl.Get fetches the Deployment status. An If task branches on the Ready replica count vs desired. If matched, the flow applies the same manifest to prod. If not, the flow opens a GitHub issue with the failing namespace and Pod statuses, then stops. Promotion gated on cluster reality, never on a 'should be ready by now' sleep.

Gate on actual cluster state

Available replicas, Ready conditions, Pod phases. Not on a 'wait 5 minutes and hope' pattern.

Two environments, one manifest

Same YAML applied to staging and to prod. The branch only decides whether to promote, never what to apply.

Failure produces context, not silence

An errors branch opens a GitHub issue with the namespace, the failing Deployment, and the Pod statuses captured by Get.

release trigger

manual or schedule

apply staging

server-side apply

read status

kubectl.Get

ready?

branch on replicas

apply prod

same manifest

open issue on fail

errors branch

Multi-cluster

Apply the same manifest to every tenant cluster in parallel

A SaaS platform team runs one K8s cluster per customer tenant. A release flow iterates a list of (tenant_name, kubeconfig_secret) tuples via ForEach. Each iteration runs kubectl.Apply against the tenant cluster, waits for Ready, posts per-tenant status to Slack. The same flow definition spans all tenants. No per-cluster pipeline, no duplicated YAML, no separate CI per customer.

Kubeconfig per tenant, in Kestra secrets

One secret per cluster. The flow reads the right kubeconfig per iteration. No long-lived static creds in the flow YAML.

Parallel apply across tenants

ForEach with concurrency runs all tenant applies at once. A blast-radius limit caps concurrency for safety.

Per-tenant rollback path

If a tenant apply fails, the errors branch kubectl.Apply's the previous manifest revision for that tenant only.

release trigger

manual

fetch tenant list

from inventory

ForEach tenant

parallel

apply manifest

per-cluster kubeconfig

wait ready