Create a Databricks job and run it.

Set waitForCompletion to the desired maximum duration if you want the task to wait for the job completion (e.g., PT1H to wait up to one hour).

yaml
type: "io.kestra.plugin.databricks.job.CreateJob"

Create a Databricks job, run it, and wait for completion for five minutes.

yaml
id: databricks_job_create
namespace: company.team

tasks:
  - id: create_job
    type: io.kestra.plugin.databricks.job.CreateJob
    authentication:
      token: "{{ secret('DATABRICKS_TOKEN') }}"
    host: "{{ secret('DATABRICKS_HOST') }}"
    jobTasks:
      - existingClusterId: <your-cluster>
        taskKey: taskKey
        sparkPythonTask:
          pythonFile: /Shared/hello.py
          sparkPythonTaskSource: WORKSPACE
    waitForCompletion: PT5M
Properties
Min items 1

The job tasks, if multiple tasks are defined you must set dependsOn on each task.

Databricks account identifier.

Databricks authentication configuration.

This property allows to configure the authentication to Databricks, different properties should be set depending on the type of authentication and the cloud provider. All configuration options can also be set using the standard Databricks environment variables. Check the Databricks authentication guide for more information.

Databricks configuration file, use this if you don't want to configure each Databricks account properties one by one.

Databricks host.

The name of the job.

Format duration

If set, the task will wait for the job run completion for up to the waitForCompletion duration before timing out.

The job identifier.

Format uri

The job URI on the Databricks console.

The run identifier.

Format uri

The run URI on the Databricks console.

SubType string

Map of task parameters.

Can be a map of string/string or a variable that binds to a JSON object.

SubType string

Map of task base parameters.

Can be a map of string/string or a variable that binds to a JSON object.

Possible Values
GITWORKSPACE

DBT task settings.

SubType string

Task dependencies, set this if multiple tasks are defined on the job.

Task description.

The identifier of the cluster.

Task libraries.

Notebook task settings.

Pipeline task settings.

Python Wheel task settings.

Run job task settings.

Spark JAR task settings.

Spark Python task settings.

Spark Submit task settings.

SQL task settings.

Task key.

Task timeout in seconds.

SubType string

Map of task named parameters.

Can be a map of string/string or a variable that binds to a JSON object.

List of task parameters.

Can be a list of strings or a variable that binds to a JSON array of strings.

List of task parameters.

Can be a list of strings or a variable that binds to a JSON array of strings.

Possible Values
GITWORKSPACE

List of task parameters.

Can be a list of strings or a variable that binds to a JSON array of strings.

SubType string

List of task parameters.

Can be a list of strings or a variable that binds to a JSON array of strings.

SubType string