RSparkSubmit | Kestra

RSparkSubmit

Submit a SparkR batch to Dataproc

yaml
type: "io.kestra.plugin.gcp.dataproc.batches.RSparkSubmit"

Examples

yaml
id: gcp_dataproc_r_spark_submit
namespace: company.team
tasks:
  - id: r_spark_submit
    type: io.kestra.plugin.gcp.dataproc.batches.RSparkSubmit
    mainRFileUri: 'gs://spark-jobs-kestra/dataframe.r'
    name: test-rspark
    region: europe-west3

Properties

mainRFileUri *string

name *string

region *string

archiveUris array

SubTypestring

args array

SubTypestring

execution

Definitions

io.kestra.plugin.gcp.dataproc.batches.AbstractBatch-ExecutionConfiguration

kmsKeystring

networkTagsarray

SubTypestring

networkUristring

serviceAccountEmailstring

subnetworkUristring

fileUris array

SubTypestring

impersonatedServiceAccount string

jarFileUris array

SubTypestring

peripherals

Definitions

io.kestra.plugin.gcp.dataproc.batches.AbstractBatch-PeripheralsConfiguration

metastoreServicestring

sparkHistoryServer

io.kestra.plugin.gcp.dataproc.batches.AbstractBatch-SparkHistoryServerConfiguration

dataprocClusterstring

projectId string

runtime

Definitions

io.kestra.plugin.gcp.dataproc.batches.AbstractBatch-RuntimeConfiguration

containerImagestring

propertiesobject

SubTypestring

versionstring

scopes array

SubTypestring

Default["https://www.googleapis.com/auth/cloud-platform"]

serviceAccount string

Outputs

state string

Possible Values

STATE_UNSPECIFIEDPENDINGRUNNINGCANCELLINGCANCELLEDSUCCEEDEDFAILEDUNRECOGNIZED

Apache Cassandra

Tasks that integrate Apache Cassandra into Kestra workflows for querying and event-driven triggers.

Data

MongoDB

Tasks that query and manipulate MongoDB collections.

Data

Argo CD

GitOps-focused tasks that interact with Argo CD using the Argo CD CLI. Tasks are executed inside a container and rely on the official Argo CD CLI to perform application synchronization and status inspection.

Infrastructure

RSparkSubmit Submit a SparkR batch to Dataproc

More Plugins in this Category

Apache Cassandra

MongoDB

Argo CD

RSparkSubmit

Submit a SparkR batch to Dataproc