IonToAvro | Kestra

IonToAvro

Convert an ION file into Avro.

yaml
type: "io.kestra.plugin.serdes.avro.IonToAvro"

Examples

yaml
id: divvy_tripdata
namespace: company.team

variables:
  file_id: "{{ execution.startDate | dateAdd(-3, 'MONTHS') | date('yyyyMM') }}"

tasks:
  - id: get_zipfile
    type: io.kestra.plugin.core.http.Download
    uri: "https://divvy-tripdata.s3.amazonaws.com/{{ render(vars.file_id) }}-divvy-tripdata.zip"

  - id: unzip
    type: io.kestra.plugin.compress.ArchiveDecompress
    algorithm: ZIP
    from: "{{ outputs.get_zipfile.uri }}"

  - id: convert
    type: io.kestra.plugin.serdes.csv.CsvToIon
    from: "{{ outputs.unzip.files[render(vars.file_id) ~ '-divvy-tripdata.csv'] }}"

  - id: to_avro
    type: io.kestra.plugin.serdes.avro.IonToAvro
    from: "{{ outputs.convert.uri }}"
    datetimeFormat: "yyyy-MM-dd' 'HH:mm:ss"
    schema: |
      {
        "type": "record",
        "name": "Ride",
        "namespace": "com.example.bikeshare",
        "fields": [
          {"name": "ride_id", "type": "string"},
          {"name": "rideable_type", "type": "string"},
          {"name": "started_at", "type": {"type": "long", "logicalType": "timestamp-millis"}},
          {"name": "ended_at", "type": {"type": "long", "logicalType": "timestamp-millis"}},
          {"name": "start_station_name", "type": "string"},
          {"name": "start_station_id", "type": "string"},
          {"name": "end_station_name", "type": "string"},
          {"name": "end_station_id", "type": "string"},
          {"name": "start_lat", "type": "double"},
          {"name": "start_lng", "type": "double"},
          {
            "name": "end_lat",
            "type": ["null", "double"],
            "default": null
          },
          {
            "name": "end_lng",
            "type": ["null", "double"],
            "default": null
          },
          {"name": "member_casual", "type": "string"}
        ]
      }

Properties

from *string

dateFormat string

Defaultyyyy-MM-dd[XXX]

datetimeFormat string

Defaultyyyy-MM-dd'T'HH:mm[:ss][.SSSSSS][XXX]

decimalSeparator string

Default.

falseValues array

SubTypestring

Default["f","false","disabled","0","off","no",""]

inferAllFields booleanstring

Defaultfalse

nullValues array

SubTypestring

Default["","#N/A","#N/A N/A","#NA","-1.#IND","-1.#QNAN","-NaN","1.#IND","1.#QNAN","NA","n/a","nan","null"]

numberOfRowsToScan integerstring

Default100

onBadLines string

DefaultERROR

Possible Values

ERRORWARNSKIP

schema string

strictSchema booleanstring

Defaultfalse

timeFormat string

DefaultHH:mm[:ss][.SSSSSS][XXX]

timeZoneId string

DefaultEtc/UTC

trueValues array

SubTypestring

Default["t","true","enabled","1","on","yes"]

Outputs

uri string

Formaturi

Metrics

records counter

Stream kestra audit logs from a Kafka topic to BigQuery for analytics and troubleshooting

Download a ZIP file, unzip it, and convert it from CSV to Parquet

Core Plugins and tasks

Tasks that provide Kestra's built-in orchestration, I/O, and observability capabilities.

Core

MongoDB

Tasks that query and manipulate MongoDB collections.

Data

Weaviate

Tasks that manage and query vectors in Weaviate.

DataAI

IonToAvro

Convert an ION file into Avro.

Create automations with Serdes Avro IonToAvro

Stream kestra audit logs from a Kafka topic to BigQuery for analytics and troubleshooting

Download a ZIP file, unzip it, and convert it from CSV to Parquet

More Plugins in this Category

Core Plugins and tasks

MongoDB

Weaviate

1.4.1

IonToAvro Convert an ION file into Avro.

Create automations with Serdes Avro IonToAvro

Stream kestra audit logs from a Kafka topic to BigQuery for analytics and troubleshooting

Download a ZIP file, unzip it, and convert it from CSV to Parquet

More Plugins in this Category

Core Plugins and tasks

MongoDB

Weaviate

1.4.1

IonToAvro

Convert an ION file into Avro.