This is the new AWS CloudFormation Template Reference Guide. Please update your bookmarks and links. For help getting started with CloudFormation, see the AWS CloudFormation User Guide.
AWS::DMS::Endpoint S3Settings
Provides information that defines an Amazon S3 endpoint. This information includes the output format of records applied to the endpoint and details of transaction and control table data information. For more information about the available settings, see Extra connection attributes when using Amazon S3 as a source for AWS DMS and Extra connection attributes when using Amazon S3 as a target for AWS DMS in theAWS Database Migration Service User Guide.
Syntax
To declare this entity in your AWS CloudFormation template, use the following syntax:
JSON
{ "AddColumnName" :Boolean, "AddTrailingPaddingCharacter" :Boolean, "BucketFolder" :String, "BucketName" :String, "CannedAclForObjects" :String, "CdcInsertsAndUpdates" :Boolean, "CdcInsertsOnly" :Boolean, "CdcMaxBatchInterval" :Integer, "CdcMinFileSize" :Integer, "CdcPath" :String, "CompressionType" :String, "CsvDelimiter" :String, "CsvNoSupValue" :String, "CsvNullValue" :String, "CsvRowDelimiter" :String, "DataFormat" :String, "DataPageSize" :Integer, "DatePartitionDelimiter" :String, "DatePartitionEnabled" :Boolean, "DatePartitionSequence" :String, "DatePartitionTimezone" :String, "DictPageSizeLimit" :Integer, "EnableStatistics" :Boolean, "EncodingType" :String, "EncryptionMode" :String, "ExpectedBucketOwner" :String, "ExternalTableDefinition" :String, "GlueCatalogGeneration" :Boolean, "IgnoreHeaderRows" :Integer, "IncludeOpForFullLoad" :Boolean, "MaxFileSize" :Integer, "ParquetTimestampInMillisecond" :Boolean, "ParquetVersion" :String, "PreserveTransactions" :Boolean, "Rfc4180" :Boolean, "RowGroupLength" :Integer, "ServerSideEncryptionKmsKeyId" :String, "ServiceAccessRoleArn" :String, "TimestampColumnName" :String, "UseCsvNoSupValue" :Boolean, "UseTaskStartTimeForFullLoadTimestamp" :Boolean}
YAML
AddColumnName:BooleanAddTrailingPaddingCharacter:BooleanBucketFolder:StringBucketName:StringCannedAclForObjects:StringCdcInsertsAndUpdates:BooleanCdcInsertsOnly:BooleanCdcMaxBatchInterval:IntegerCdcMinFileSize:IntegerCdcPath:StringCompressionType:StringCsvDelimiter:StringCsvNoSupValue:StringCsvNullValue:StringCsvRowDelimiter:StringDataFormat:StringDataPageSize:IntegerDatePartitionDelimiter:StringDatePartitionEnabled:BooleanDatePartitionSequence:StringDatePartitionTimezone:StringDictPageSizeLimit:IntegerEnableStatistics:BooleanEncodingType:StringEncryptionMode:StringExpectedBucketOwner:StringExternalTableDefinition:StringGlueCatalogGeneration:BooleanIgnoreHeaderRows:IntegerIncludeOpForFullLoad:BooleanMaxFileSize:IntegerParquetTimestampInMillisecond:BooleanParquetVersion:StringPreserveTransactions:BooleanRfc4180:BooleanRowGroupLength:IntegerServerSideEncryptionKmsKeyId:StringServiceAccessRoleArn:StringTimestampColumnName:StringUseCsvNoSupValue:BooleanUseTaskStartTimeForFullLoadTimestamp:Boolean
Properties
- AddColumnName
- 
                    An optional parameter that, when set to trueory, you can use to add column name information to the .csv output file.The default value is false. Valid values aretrue,false,y, andn.Required: No Type: Boolean Update requires: No interruption 
- AddTrailingPaddingCharacter
- 
                    Use the S3 target endpoint setting AddTrailingPaddingCharacterto add padding on string data. The default value isfalse.Required: No Type: Boolean Update requires: No interruption 
- BucketFolder
- 
                    An optional parameter to set a folder name in the S3 bucket. If provided, tables are created in the path bucketFolder/schema_name/table_name/. If this parameter isn't specified, the path used isschema_name/table_name/.Required: No Type: String Update requires: No interruption 
- BucketName
- 
                    The name of the S3 bucket. Required: No Type: String Update requires: No interruption 
- CannedAclForObjects
- 
                    A value that enables AWS DMS to specify a predefined (canned) access control list (ACL) for objects created in an Amazon S3 bucket as .csv or .parquet files. For more information about Amazon S3 canned ACLs, see Canned ACL in the Amazon S3 Developer Guide. The default value is NONE. Valid values include NONE, PRIVATE, PUBLIC_READ, PUBLIC_READ_WRITE, AUTHENTICATED_READ, AWS_EXEC_READ, BUCKET_OWNER_READ, and BUCKET_OWNER_FULL_CONTROL. Required: No Type: String Allowed values: none | private | public-read | public-read-write | authenticated-read | aws-exec-read | bucket-owner-read | bucket-owner-full-controlUpdate requires: No interruption 
- CdcInsertsAndUpdates
- 
                    A value that enables a change data capture (CDC) load to write INSERT and UPDATE operations to .csv or .parquet (columnar storage) output files. The default setting is false, but whenCdcInsertsAndUpdatesis set totrueory, only INSERTs and UPDATEs from the source database are migrated to the .csv or .parquet file.For .csv file format only, how these INSERTs and UPDATEs are recorded depends on the value of the IncludeOpForFullLoadparameter. IfIncludeOpForFullLoadis set totrue, the first field of every CDC record is set to eitherIorUto indicate INSERT and UPDATE operations at the source. But ifIncludeOpForFullLoadis set tofalse, CDC records are written without an indication of INSERT or UPDATE operations at the source. For more information about how these settings work together, see Indicating Source DB Operations in Migrated S3 Data in the AWS Database Migration Service User Guide.NoteAWS DMS supports the use of the CdcInsertsAndUpdatesparameter in versions 3.3.1 and later.CdcInsertsOnlyandCdcInsertsAndUpdatescan't both be set totruefor the same endpoint. Set eitherCdcInsertsOnlyorCdcInsertsAndUpdatestotruefor the same endpoint, but not both.Required: No Type: Boolean Update requires: No interruption 
- CdcInsertsOnly
- 
                    A value that enables a change data capture (CDC) load to write only INSERT operations to .csv or columnar storage (.parquet) output files. By default (the falsesetting), the first field in a .csv or .parquet record contains the letter I (INSERT), U (UPDATE), or D (DELETE). These values indicate whether the row was inserted, updated, or deleted at the source database for a CDC load to the target.If CdcInsertsOnlyis set totrueory, only INSERTs from the source database are migrated to the .csv or .parquet file. For .csv format only, how these INSERTs are recorded depends on the value ofIncludeOpForFullLoad. IfIncludeOpForFullLoadis set totrue, the first field of every CDC record is set to I to indicate the INSERT operation at the source. IfIncludeOpForFullLoadis set tofalse, every CDC record is written without a first field to indicate the INSERT operation at the source. For more information about how these settings work together, see Indicating Source DB Operations in Migrated S3 Data in the AWS Database Migration Service User Guide.NoteAWS DMS supports the interaction described preceding between the CdcInsertsOnlyandIncludeOpForFullLoadparameters in versions 3.1.4 and later.CdcInsertsOnlyandCdcInsertsAndUpdatescan't both be set totruefor the same endpoint. Set eitherCdcInsertsOnlyorCdcInsertsAndUpdatestotruefor the same endpoint, but not both.Required: No Type: Boolean Update requires: No interruption 
- CdcMaxBatchInterval
- 
                    Maximum length of the interval, defined in seconds, after which to output a file to Amazon S3. When CdcMaxBatchIntervalandCdcMinFileSizeare both specified, the file write is triggered by whichever parameter condition is met first within an AWS DMS CloudFormation template.The default value is 60 seconds. Required: No Type: Integer Update requires: No interruption 
- CdcMinFileSize
- 
                    Minimum file size, defined in kilobytes, to reach for a file output to Amazon S3. When CdcMinFileSizeandCdcMaxBatchIntervalare both specified, the file write is triggered by whichever parameter condition is met first within an AWS DMS CloudFormation template.The default value is 32 MB. Required: No Type: Integer Update requires: No interruption 
- CdcPath
- 
                    Specifies the folder path of CDC files. For an S3 source, this setting is required if a task captures change data; otherwise, it's optional. If CdcPathis set, AWS DMS reads CDC files from this path and replicates the data changes to the target endpoint. For an S3 target if you setPreserveTransactionstotrue, AWS DMS verifies that you have set this parameter to a folder path on your S3 target where AWS DMS can save the transaction order for the CDC load. AWS DMS creates this CDC folder path in either your S3 target working directory or the S3 target location specified byBucketFolderandBucketName.For example, if you specify CdcPathasMyChangedData, and you specifyBucketNameasMyTargetBucketbut do not specifyBucketFolder, AWS DMS creates the CDC folder path following:MyTargetBucket/MyChangedData.If you specify the same CdcPath, and you specifyBucketNameasMyTargetBucketandBucketFolderasMyTargetData, AWS DMS creates the CDC folder path following:MyTargetBucket/MyTargetData/MyChangedData.For more information on CDC including transaction order on an S3 target, see Capturing data changes (CDC) including transaction order on the S3 target. NoteThis setting is supported in AWS DMS versions 3.4.2 and later. Required: No Type: String Update requires: No interruption 
- CompressionType
- 
                    An optional parameter. When set to GZIP it enables the service to compress the target files. To allow the service to write the target files uncompressed, either set this parameter to NONE (the default) or don't specify the parameter at all. This parameter applies to both .csv and .parquet file formats. Required: No Type: String Allowed values: none | gzipUpdate requires: No interruption 
- CsvDelimiter
- 
                    The delimiter used to separate columns in the .csv file for both source and target. The default is a comma. Required: No Type: String Update requires: No interruption 
- CsvNoSupValue
- 
                    This setting only applies if your Amazon S3 output files during a change data capture (CDC) load are written in .csv format. If UseCsvNoSupValueis set to true, specify a string value that you want AWS DMS to use for all columns not included in the supplemental log. If you do not specify a string value, AWS DMS uses the null value for these columns regardless of theUseCsvNoSupValuesetting.NoteThis setting is supported in AWS DMS versions 3.4.1 and later. Required: No Type: String Update requires: No interruption 
- CsvNullValue
- 
                    An optional parameter that specifies how AWS DMS treats null values. While handling the null value, you can use this parameter to pass a user-defined string as null when writing to the target. For example, when target columns are not nullable, you can use this option to differentiate between the empty string value and the null value. So, if you set this parameter value to the empty string ("" or ''), AWS DMS treats the empty string as the null value instead of NULL.The default value is NULL. Valid values include any valid string.Required: No Type: String Update requires: No interruption 
- CsvRowDelimiter
- 
                    The delimiter used to separate rows in the .csv file for both source and target. The default is a carriage return ( \n).Required: No Type: String Update requires: No interruption 
- DataFormat
- 
                    The format of the data that you want to use for output. You can choose one of the following: - 
                            csv: This is a row-based file format with comma-separated values (.csv).
- 
                            parquet: Apache Parquet (.parquet) is a columnar storage file format that features efficient compression and provides faster query response.
 Required: No Type: String Allowed values: csv | parquetUpdate requires: No interruption 
- 
                            
- DataPageSize
- 
                    The size of one data page in bytes. This parameter defaults to 1024 * 1024 bytes (1 MiB). This number is used for .parquet file format only. Required: No Type: Integer Update requires: No interruption 
- DatePartitionDelimiter
- 
                    Specifies a date separating delimiter to use during folder partitioning. The default value is SLASH. Use this parameter whenDatePartitionedEnabledis set totrue.Required: No Type: String Allowed values: SLASH | UNDERSCORE | DASH | NONEUpdate requires: No interruption 
- DatePartitionEnabled
- 
                    When set to true, this parameter partitions S3 bucket folders based on transaction commit dates. The default value isfalse. For more information about date-based folder partitioning, see Using date-based folder partitioning.Required: No Type: Boolean Update requires: No interruption 
- DatePartitionSequence
- 
                    Identifies the sequence of the date format to use during folder partitioning. The default value is YYYYMMDD. Use this parameter whenDatePartitionedEnabledis set totrue.Required: No Type: String Allowed values: YYYYMMDD | YYYYMMDDHH | YYYYMM | MMYYYYDD | DDMMYYYYUpdate requires: No interruption 
- DatePartitionTimezone
- 
                    When creating an S3 target endpoint, set DatePartitionTimezoneto convert the current UTC time into a specified time zone. The conversion occurs when a date partition folder is created and a change data capture (CDC) file name is generated. The time zone format is Area/Location. Use this parameter whenDatePartitionedEnabledis set totrue, as shown in the following example.s3-settings='{"DatePartitionEnabled": true, "DatePartitionSequence": "YYYYMMDDHH", "DatePartitionDelimiter": "SLASH", "DatePartitionTimezone":"Asia/Seoul", "BucketName": "dms-nattarat-test"}'Required: No Type: String Update requires: No interruption 
- DictPageSizeLimit
- 
                    The maximum size of an encoded dictionary page of a column. If the dictionary page exceeds this, this column is stored using an encoding type of PLAIN. This parameter defaults to 1024 * 1024 bytes (1 MiB), the maximum size of a dictionary page before it reverts toPLAINencoding. This size is used for .parquet file format only.Required: No Type: Integer Update requires: No interruption 
- EnableStatistics
- 
                    A value that enables statistics for Parquet pages and row groups. Choose trueto enable statistics,falseto disable. Statistics includeNULL,DISTINCT,MAX, andMINvalues. This parameter defaults totrue. This value is used for .parquet file format only.Required: No Type: Boolean Update requires: No interruption 
- EncodingType
- 
                    The type of encoding that you're using: - 
                            RLE_DICTIONARYuses a combination of bit-packing and run-length encoding to store repeated values more efficiently. This is the default.
- 
                            PLAINdoesn't use encoding at all. Values are stored as they are.
- 
                            PLAIN_DICTIONARYbuilds a dictionary of the values encountered in a given column. The dictionary is stored in a dictionary page for each column chunk.
 Required: No Type: String Allowed values: plain | plain-dictionary | rle-dictionaryUpdate requires: No interruption 
- 
                            
- EncryptionMode
- 
                    The type of server-side encryption that you want to use for your data. This encryption type is part of the endpoint settings or the extra connections attributes for Amazon S3. You can choose either SSE_S3(the default) orSSE_KMS.NoteFor the ModifyEndpointoperation, you can change the existing value of theEncryptionModeparameter fromSSE_KMStoSSE_S3. But you can’t change the existing value fromSSE_S3toSSE_KMS.To use SSE_S3, you need an IAM role with permission to allow"arn:aws:s3:::dms-*"to use the following actions:- 
                            s3:CreateBucket
- 
                            s3:ListBucket
- 
                            s3:DeleteBucket
- 
                            s3:GetBucketLocation
- 
                            s3:GetObject
- 
                            s3:PutObject
- 
                            s3:DeleteObject
- 
                            s3:GetObjectVersion
- 
                            s3:GetBucketPolicy
- 
                            s3:PutBucketPolicy
- 
                            s3:DeleteBucketPolicy
 Required: No Type: String Allowed values: sse-s3 | sse-kmsUpdate requires: No interruption 
- 
                            
- ExpectedBucketOwner
- 
                    To specify a bucket owner and prevent sniping, you can use the ExpectedBucketOwnerendpoint setting.Example: --s3-settings='{"ExpectedBucketOwner": "AWS_Account_ID"}'When you make a request to test a connection or perform a migration, S3 checks the account ID of the bucket owner against the specified parameter. Required: No Type: String Update requires: No interruption 
- ExternalTableDefinition
- 
                    The external table definition. Conditional: If S3is used as a source thenExternalTableDefinitionis required.Required: Conditional Type: String Update requires: No interruption 
- GlueCatalogGeneration
- 
                    When true, allows AWS Glue to catalog your S3 bucket. Creating an AWS Glue catalog lets you use Athena to query your data. Required: No Type: Boolean Update requires: No interruption 
- IgnoreHeaderRows
- 
                    When this value is set to 1, AWS DMS ignores the first row header in a .csv file. A value of 1 turns on the feature; a value of 0 turns off the feature. The default is 0. Required: No Type: Integer Update requires: No interruption 
- IncludeOpForFullLoad
- 
                    A value that enables a full load to write INSERT operations to the comma-separated value (.csv) output files only to indicate how the rows were added to the source database. NoteAWS DMS supports the IncludeOpForFullLoadparameter in versions 3.1.4 and later.For full load, records can only be inserted. By default (the falsesetting), no information is recorded in these output files for a full load to indicate that the rows were inserted at the source database. IfIncludeOpForFullLoadis set totrueory, the INSERT is recorded as an I annotation in the first field of the .csv file. This allows the format of your target records from a full load to be consistent with the target records from a CDC load.NoteThis setting works together with the CdcInsertsOnlyand theCdcInsertsAndUpdatesparameters for output to .csv files only. For more information about how these settings work together, see Indicating Source DB Operations in Migrated S3 Data in the AWS Database Migration Service User Guide.Required: No Type: Boolean Update requires: No interruption 
- MaxFileSize
- 
                    A value that specifies the maximum size (in KB) of any .csv file to be created while migrating to an S3 target during full load. The default value is 1,048,576 KB (1 GB). Valid values include 1 to 1,048,576. Required: No Type: Integer Update requires: No interruption 
- ParquetTimestampInMillisecond
- 
                    A value that specifies the precision of any TIMESTAMPcolumn values that are written to an Amazon S3 object file in .parquet format.NoteAWS DMS supports the ParquetTimestampInMillisecondparameter in versions 3.1.4 and later.When ParquetTimestampInMillisecondis set totrueory, AWS DMS writes allTIMESTAMPcolumns in a .parquet formatted file with millisecond precision. Otherwise, DMS writes them with microsecond precision.Currently, Amazon Athena and AWS Glue can handle only millisecond precision for TIMESTAMPvalues. Set this parameter totruefor S3 endpoint object files that are .parquet formatted only if you plan to query or process the data with Athena or AWS Glue.NoteAWS DMS writes any TIMESTAMPcolumn values written to an S3 file in .csv format with microsecond precision.Setting ParquetTimestampInMillisecondhas no effect on the string format of the timestamp column value that is inserted by setting theTimestampColumnNameparameter.Required: No Type: Boolean Update requires: No interruption 
- ParquetVersion
- 
                    The version of the Apache Parquet format that you want to use: parquet_1_0(the default) orparquet_2_0.Required: No Type: String Allowed values: parquet-1-0 | parquet-2-0Update requires: No interruption 
- PreserveTransactions
- 
                    If this setting is set to true, AWS DMS saves the transaction order for a change data capture (CDC) load on the Amazon S3 target specified byCdcPath. For more information, see Capturing data changes (CDC) including transaction order on the S3 target.NoteThis setting is supported in AWS DMS versions 3.4.2 and later. Required: No Type: Boolean Update requires: No interruption 
- Rfc4180
- 
                    For an S3 source, when this value is set to trueory, each leading double quotation mark has to be followed by an ending double quotation mark. This formatting complies with RFC 4180. When this value is set tofalseorn, string literals are copied to the target as is. In this case, a delimiter (row or column) signals the end of the field. Thus, you can't use a delimiter as part of the string, because it signals the end of the value.For an S3 target, an optional parameter used to set behavior to comply with RFC 4180 for data migrated to Amazon S3 using .csv file format only. When this value is set to trueoryusing Amazon S3 as a target, if the data has quotation marks or newline characters in it, AWS DMS encloses the entire column with an additional pair of double quotation marks ("). Every quotation mark within the data is repeated twice.The default value is true. Valid values includetrue,false,y, andn.Required: No Type: Boolean Update requires: No interruption 
- RowGroupLength
- 
                    The number of rows in a row group. A smaller row group size provides faster reads. But as the number of row groups grows, the slower writes become. This parameter defaults to 10,000 rows. This number is used for .parquet file format only. If you choose a value larger than the maximum, RowGroupLengthis set to the max row group length in bytes (64 * 1024 * 1024).Required: No Type: Integer Update requires: No interruption 
- ServerSideEncryptionKmsKeyId
- 
                    If you are using SSE_KMSfor theEncryptionMode, provide the AWS KMS key ID. The key that you use needs an attached policy that enables IAM user permissions and allows use of the key.Here is a CLI example: aws dms create-endpoint --endpoint-identifier value --endpoint-type target --engine-name s3 --s3-settings ServiceAccessRoleArn=value,BucketFolder=value,BucketName=value,EncryptionMode=SSE_KMS,ServerSideEncryptionKmsKeyId=valueRequired: No Type: String Update requires: No interruption 
- ServiceAccessRoleArn
- 
                    A required parameter that specifies the Amazon Resource Name (ARN) used by the service to access the IAM role. The role must allow the iam:PassRoleaction. It enables AWS DMS to read and write objects from an S3 bucket.Required: No Type: String Update requires: No interruption 
- TimestampColumnName
- 
                    A value that when nonblank causes AWS DMS to add a column with timestamp information to the endpoint data for an Amazon S3 target. NoteAWS DMS supports the TimestampColumnNameparameter in versions 3.1.4 and later.AWS DMS includes an additional STRINGcolumn in the .csv or .parquet object files of your migrated data when you setTimestampColumnNameto a nonblank value.For a full load, each row of this timestamp column contains a timestamp for when the data was transferred from the source to the target by DMS. For a change data capture (CDC) load, each row of the timestamp column contains the timestamp for the commit of that row in the source database. The string format for this timestamp column value is yyyy-MM-dd HH:mm:ss.SSSSSS. By default, the precision of this value is in microseconds. For a CDC load, the rounding of the precision depends on the commit timestamp supported by DMS for the source database.When the AddColumnNameparameter is set totrue, DMS also includes a name for the timestamp column that you set withTimestampColumnName.Required: No Type: String Update requires: No interruption 
- UseCsvNoSupValue
- 
                    This setting applies if the S3 output files during a change data capture (CDC) load are written in .csv format. If this setting is set to truefor columns not included in the supplemental log, AWS DMS uses the value specified byCsvNoSupValue. If this setting isn't set or is set tofalse, AWS DMS uses the null value for these columns.NoteThis setting is supported in AWS DMS versions 3.4.1 and later. Required: No Type: Boolean Update requires: No interruption 
- UseTaskStartTimeForFullLoadTimestamp
- 
                    When set to true, this parameter uses the task start time as the timestamp column value instead of the time data is written to target. For full load, when useTaskStartTimeForFullLoadTimestampis set totrue, each row of the timestamp column contains the task start time. For CDC loads, each row of the timestamp column contains the transaction commit time.When useTaskStartTimeForFullLoadTimestampis set tofalse, the full load timestamp in the timestamp column increments with the time data arrives at the target.Required: No Type: Boolean Update requires: No interruption