Skip to content

S3

Write events to a S3 bucket.

FieldTypeRequiredDescription
retryRetryHow to retry failed requests.
batchBatchBatching input events together.
input-fieldevent-field (string)Use the specified field as the content for the object.
bucket-namestringBucket Name.
object-names3_output:object-nameName of object, either as text or from a field.
modeModePut or delete the object? Default is Put.
preprocessorsPreprocessorsPreprocessors (process data before making it available for upload) these processors will be run in the order they are specified.
track-schemaboolean (bool)Check the schema of the written data and update __SCHEMA_NUMBER (written data must be JSON).
scubaScubaIf this Job is sending data to Scuba Lite, optionally have the Job configure the Scuba Lite instance.
Authentication
FieldTypeRequiredDescription
access-keystringAccess Key ID.
secret-keystringSecret Key ID.
security-tokenstringSecurity Token.
session-tokenstringSession Token.
role-arnstringA Role ARN for assuming role using above credentials.
Location
FieldTypeRequiredDescription
regionstringS3 Region.
endpointstringS3 Endpoint, if needed.
Object Properties
FieldTypeRequiredDescription
disable-object-name-guidboolean (bool)Disable the GUID prefix if you want object name to be treated literally (off for deletes).
guid-prefixstringGUID Prefix, will be prepended to the GUID, the default value is ”/”.
guid-suffixstringGUID Suffix, will be appended to the GUID if specified.
FieldTypeRequiredDescription
countintegerHow to retry? Either forever or for a limited number of times.
pausestringHow long to pause before re-trying.
FieldTypeRequiredDescription
fixed-sizeintegermaximum number of events in an output batch.
modesymbolIf ‘document’ send on end of document generated by input. If ‘fixed’, use fixed_size.
timeoutstringinterval after which the batch is sent, to keep throughput going (default 100ms).
headerstringput a header line before the batch.
footerstringput a header line after the last line of the batch.
use-document-markerboolEnrich the job metadata with a document marker (for document handling in batch mode).
wrap-as-jsonboolFormat the output batch as a JSON array.
FieldTypeRequiredDescription
hoststringThe Scuba Lite instance (for example: https://scuba-lite.example.com).
insecureboolAllow self-signed certificates for Scuba Lite connections.
tokenstringThe Scuba Lite API token.
table-namestringThe Scuba Lite table name.
file-patternstringThe Scuba Lite file search pattern.
columnsobjectThe columns specified by name and type.
ValueNameDescription
putputPut Objects
deletedeleteDelete Objects
ValueNameDescription
gzipgzipGzip the output data
parquetparquetExtract the received data as JSON rows from a parquet file
base64base64Decode base64 as binary