CreateJob
Create a Databricks job and run it.
Set waitForCompletion
to the desired maximum duration if you want the task to wait for the job completion (e.g., PT1H
to wait up to one hour).
type: "io.kestra.plugin.databricks.job.CreateJob"
Examples
Create a Databricks job, run it, and wait for completion for five minutes.
id: databricks_job_create
namespace: company.team
tasks:
- id: create_job
type: io.kestra.plugin.databricks.job.CreateJob
authentication:
token: "{{ secret('DATABRICKS_TOKEN') }}"
host: "{{ secret('DATABRICKS_HOST') }}"
jobTasks:
- existingClusterId: <your-cluster>
taskKey: taskKey
sparkPythonTask:
pythonFile: /Shared/hello.py
sparkPythonTaskSource: WORKSPACE
waitForCompletion: PT5M
Properties
jobTasks *Requiredarray
accountId string
Databricks account identifier.
authentication AbstractTask-AuthenticationConfig
Databricks authentication configuration.
This property allows to configure the authentication to Databricks, different properties should be set depending on the type of authentication and the cloud provider. All configuration options can also be set using the standard Databricks environment variables. Check the Databricks authentication guide for more information.
configFile string
Databricks configuration file, use this if you don't want to configure each Databricks account properties one by one.
host string
Databricks host.
jobName string
The name of the job.
waitForCompletion string
duration
If set, the task will wait for the job run completion for up to the waitForCompletion
duration before timing out.
Outputs
jobId integer
The job identifier.
jobURI string
uri
The job URI on the Databricks console.
runId integer
The run identifier.
runURI string
uri
The run URI on the Databricks console.
Definitions
io.kestra.plugin.databricks.job.task.SqlTaskSetting
parameters stringobject
Map of task parameters.
Can be a map of string/string or a variable that binds to a JSON object.
queryId string
warehouseId string
io.kestra.plugin.databricks.job.task.NotebookTaskSetting
baseParameters stringobject
Map of task base parameters.
Can be a map of string/string or a variable that binds to a JSON object.
notebookPath string
source string
GIT
WORKSPACE
io.kestra.plugin.databricks.job.CreateJob-JobTaskSetting
dbtTask DbtTaskSetting
DBT task settings.
dependsOn array
Task dependencies, set this if multiple tasks are defined on the job.
description string
Task description.
existingClusterId string
The identifier of the cluster.
notebookTask NotebookTaskSetting
Notebook task settings.
pipelineTask PipelineTaskSetting
Pipeline task settings.
pythonWheelTask PythonWheelTaskSetting
Python Wheel task settings.
runJobTask RunJobTaskSetting
Run job task settings.
sparkJarTask SparkJarTaskSetting
Spark JAR task settings.
sparkPythonTask SparkPythonTaskSetting
Spark Python task settings.
sparkSubmitTask SparkSubmitTaskSetting
Spark Submit task settings.
sqlTask SqlTaskSetting
SQL task settings.
taskKey string
Task key.
timeoutSeconds integerstring
Task timeout in seconds.
io.kestra.plugin.databricks.job.task.PythonWheelTaskSetting
entryPoint string
namedParameters stringobject
Map of task named parameters.
Can be a map of string/string or a variable that binds to a JSON object.
packageName string
parameters stringarray
List of task parameters.
Can be a list of strings or a variable that binds to a JSON array of strings.
io.kestra.plugin.databricks.job.task.LibrarySetting-CranSetting
_package string
repo string
io.kestra.plugin.databricks.job.task.SparkSubmitTaskSetting
parameters stringarray
List of task parameters.
Can be a list of strings or a variable that binds to a JSON array of strings.
io.kestra.plugin.databricks.AbstractTask-AuthenticationConfig
authType string
azureClientId string
azureClientSecret string
azureTenantId string
clientId string
clientSecret string
googleCredentials string
googleServiceAccount string
password string
token string
username string
io.kestra.plugin.databricks.job.task.RunJobTaskSetting
jobId string
jobParameters object
io.kestra.plugin.databricks.job.task.SparkPythonTaskSetting
pythonFile *Requiredstring
sparkPythonTaskSource *Requiredstring
GIT
WORKSPACE
parameters stringarray
List of task parameters.
Can be a list of strings or a variable that binds to a JSON array of strings.
io.kestra.plugin.databricks.job.task.PipelineTaskSetting
fullRefresh booleanstring
pipelineId string
io.kestra.plugin.databricks.job.task.LibrarySetting
egg string
jar string
whl string
io.kestra.plugin.databricks.job.task.DbtTaskSetting
catalog string
commands array
schema string
warehouseId string
io.kestra.plugin.databricks.job.task.LibrarySetting-PypiSetting
_package string
repo string
io.kestra.plugin.databricks.job.task.SparkJarTaskSetting
jarUri string
mainClassName string
parameters stringarray
List of task parameters.
Can be a list of strings or a variable that binds to a JSON array of strings.