Interface ParquetOutputFormatProps
- All Superinterfaces:
software.amazon.jsii.JsiiSerializable
- All Known Implementing Classes:
ParquetOutputFormatProps.Jsii$Proxy
Example:
ParquetOutputFormat outputFormat = ParquetOutputFormat.Builder.create()
.blockSize(Size.mebibytes(512))
.compression(ParquetCompression.UNCOMPRESSED)
.enableDictionaryCompression(true)
.maxPadding(Size.bytes(10))
.pageSize(Size.mebibytes(2))
.writerVersion(ParquetWriterVersion.V2)
.build();
-
Nested Class Summary
Nested ClassesModifier and TypeInterfaceDescriptionstatic final classA builder forParquetOutputFormatPropsstatic final classAn implementation forParquetOutputFormatProps -
Method Summary
Modifier and TypeMethodDescriptionbuilder()default SizeThe Hadoop Distributed File System (HDFS) block size.default ParquetCompressionThe compression code to use over data blocks.default BooleanIndicates whether to enable dictionary compression.default SizeThe maximum amount of padding to apply.default SizeThe Parquet page size.default ParquetWriterVersionIndicates the version of Parquet to output.Methods inherited from interface software.amazon.jsii.JsiiSerializable
$jsii$toJson
-
Method Details
-
getBlockSize
The Hadoop Distributed File System (HDFS) block size.This is useful if you intend to copy the data from Amazon S3 to HDFS before querying. Firehose uses this value for padding calculations.
Default: `Size.mebibytes(256)`
-
getCompression
The compression code to use over data blocks.The possible values are
UNCOMPRESSED,SNAPPY, andGZIP. UseSNAPPYfor higher decompression speed. UseGZIPif the compression ratio is more important than speed.Default: `SNAPPY`
- See Also:
-
getEnableDictionaryCompression
Indicates whether to enable dictionary compression.Default: `false`
- See Also:
-
getMaxPadding
The maximum amount of padding to apply.This is useful if you intend to copy the data from Amazon S3 to HDFS before querying.
Default: no padding is applied
- See Also:
-
getPageSize
The Parquet page size.Column chunks are divided into pages. A page is conceptually an indivisible unit (in terms of compression and encoding). The minimum value is 64 KiB and the default is 1 MiB.
Default: `Size.mebibytes(1)`
- See Also:
-
getWriterVersion
Indicates the version of Parquet to output.The possible values are
V1andV2Default: `V1`
- See Also:
-
builder
- Returns:
- a
ParquetOutputFormatProps.BuilderofParquetOutputFormatProps
-