# UpdateCrawler Updates a crawler. If a crawler is running, you must stop it using `StopCrawler` before updating it. ## Request Syntax ``` { "Classifiers": [ "string" ], "Configuration": "string", "CrawlerSecurityConfiguration": "string", "DatabaseName": "string", "Description": "string", "LakeFormationConfiguration": { "AccountId": "string", "UseLakeFormationCredentials": boolean }, "LineageConfiguration": { "CrawlerLineageSettings": "string" }, "Name": "string", "RecrawlPolicy": { "RecrawlBehavior": "string" }, "Role": "string", "Schedule": "string", "SchemaChangePolicy": { "DeleteBehavior": "string", "UpdateBehavior": "string" }, "TablePrefix": "string", "Targets": { "CatalogTargets": [ { "ConnectionName": "string", "DatabaseName": "string", "DlqEventQueueArn": "string", "EventQueueArn": "string", "Tables": [ "string" ] } ], "DeltaTargets": [ { "ConnectionName": "string", "CreateNativeDeltaTable": boolean, "DeltaTables": [ "string" ], "WriteManifest": boolean } ], "DynamoDBTargets": [ { "Path": "string", "scanAll": boolean, "scanRate": number } ], "HudiTargets": [ { "ConnectionName": "string", "Exclusions": [ "string" ], "MaximumTraversalDepth": number, "Paths": [ "string" ] } ], "IcebergTargets": [ { "ConnectionName": "string", "Exclusions": [ "string" ], "MaximumTraversalDepth": number, "Paths": [ "string" ] } ], "JdbcTargets": [ { "ConnectionName": "string", "EnableAdditionalMetadata": [ "string" ], "Exclusions": [ "string" ], "Path": "string" } ], "MongoDBTargets": [ { "ConnectionName": "string", "Path": "string", "ScanAll": boolean } ], "S3Targets": [ { "ConnectionName": "string", "DlqEventQueueArn": "string", "EventQueueArn": "string", "Exclusions": [ "string" ], "Path": "string", "SampleSize": number } ] } } ``` ## Request Parameters For information about the parameters that are common to all actions, see [Common Parameters](CommonParameters.md). The request accepts the following data in JSON format. ** [Classifiers](#API_UpdateCrawler_RequestSyntax) ** A list of custom classifiers that the user has registered. By default, all built-in classifiers are included in a crawl, but these custom classifiers always override the default classifiers for a given classification. Type: Array of strings Length Constraints: Minimum length of 1. Maximum length of 255. Pattern: `[\u0020-\uD7FF\uE000-\uFFFD\uD800\uDC00-\uDBFF\uDFFF\t]*` Required: No ** [Configuration](#API_UpdateCrawler_RequestSyntax) ** Crawler configuration information. This versioned JSON string allows users to specify aspects of a crawler's behavior. For more information, see [Setting crawler configuration options](https://docs.aws.amazon.com/glue/latest/dg/crawler-configuration.html). Type: String Required: No ** [CrawlerSecurityConfiguration](#API_UpdateCrawler_RequestSyntax) ** The name of the `SecurityConfiguration` structure to be used by this crawler. Type: String Length Constraints: Minimum length of 0. Maximum length of 128. Required: No ** [DatabaseName](#API_UpdateCrawler_RequestSyntax) ** The AWS Glue database where results are stored, such as: `arn:aws:daylight:us-east-1::database/sometable/*`. Type: String Required: No ** [Description](#API_UpdateCrawler_RequestSyntax) ** A description of the new crawler. Type: String Length Constraints: Minimum length of 0. Maximum length of 2048. Pattern: `[\u0020-\uD7FF\uE000-\uFFFD\uD800\uDC00-\uDBFF\uDFFF\r\n\t]*` Required: No ** [LakeFormationConfiguration](#API_UpdateCrawler_RequestSyntax) ** Specifies AWS Lake Formation configuration settings for the crawler. Type: [LakeFormationConfiguration](API_LakeFormationConfiguration.md) object Required: No ** [LineageConfiguration](#API_UpdateCrawler_RequestSyntax) ** Specifies data lineage configuration settings for the crawler. Type: [LineageConfiguration](API_LineageConfiguration.md) object Required: No ** [Name](#API_UpdateCrawler_RequestSyntax) ** Name of the new crawler. Type: String Length Constraints: Minimum length of 1. Maximum length of 255. Pattern: `[\u0020-\uD7FF\uE000-\uFFFD\uD800\uDC00-\uDBFF\uDFFF\t]*` Required: Yes ** [RecrawlPolicy](#API_UpdateCrawler_RequestSyntax) ** A policy that specifies whether to crawl the entire dataset again, or to crawl only folders that were added since the last crawler run. Type: [RecrawlPolicy](API_RecrawlPolicy.md) object Required: No ** [Role](#API_UpdateCrawler_RequestSyntax) ** The IAM role or Amazon Resource Name (ARN) of an IAM role that is used by the new crawler to access customer resources. Type: String Required: No ** [Schedule](#API_UpdateCrawler_RequestSyntax) ** A `cron` expression used to specify the schedule (see [Time-Based Schedules for Jobs and Crawlers](https://docs.aws.amazon.com/glue/latest/dg/monitor-data-warehouse-schedule.html). For example, to run something every day at 12:15 UTC, you would specify: `cron(15 12 * * ? *)`. Type: String Required: No ** [SchemaChangePolicy](#API_UpdateCrawler_RequestSyntax) ** The policy for the crawler's update and deletion behavior. Type: [SchemaChangePolicy](API_SchemaChangePolicy.md) object Required: No ** [TablePrefix](#API_UpdateCrawler_RequestSyntax) ** The table prefix used for catalog tables that are created. Type: String Length Constraints: Minimum length of 0. Maximum length of 128. Required: No ** [Targets](#API_UpdateCrawler_RequestSyntax) ** A list of targets to crawl. Type: [CrawlerTargets](API_CrawlerTargets.md) object Required: No ## Response Elements If the action is successful, the service sends back an HTTP 200 response with an empty HTTP body. ## Errors For information about the errors that are common to all actions, see [Common Error Types](CommonErrors.md). ** CrawlerRunningException ** The operation cannot be performed because the crawler is already running. ** Message ** A message describing the problem. HTTP Status Code: 400 ** EntityNotFoundException ** A specified entity does not exist ** FromFederationSource ** Indicates whether or not the exception relates to a federated source. ** Message ** A message describing the problem. HTTP Status Code: 400 ** InvalidInputException ** The input provided was not valid. ** FromFederationSource ** Indicates whether or not the exception relates to a federated source. ** Message ** A message describing the problem. HTTP Status Code: 400 ** OperationTimeoutException ** The operation timed out. ** Message ** A message describing the problem. HTTP Status Code: 400 ** VersionMismatchException ** There was a version conflict. ** Message ** A message describing the problem. HTTP Status Code: 400 ## See Also For more information about using this API in one of the language-specific AWS SDKs, see the following: + [AWS Command Line Interface V2](https://docs.aws.amazon.com/goto/cli2/glue-2017-03-31/UpdateCrawler) + [AWS SDK for .NET V4](https://docs.aws.amazon.com/goto/DotNetSDKV4/glue-2017-03-31/UpdateCrawler) + [AWS SDK for C\$1\$1](https://docs.aws.amazon.com/goto/SdkForCpp/glue-2017-03-31/UpdateCrawler) + [AWS SDK for Go v2](https://docs.aws.amazon.com/goto/SdkForGoV2/glue-2017-03-31/UpdateCrawler) + [AWS SDK for Java V2](https://docs.aws.amazon.com/goto/SdkForJavaV2/glue-2017-03-31/UpdateCrawler) + [AWS SDK for JavaScript V3](https://docs.aws.amazon.com/goto/SdkForJavaScriptV3/glue-2017-03-31/UpdateCrawler) + [AWS SDK for Kotlin](https://docs.aws.amazon.com/goto/SdkForKotlin/glue-2017-03-31/UpdateCrawler) + [AWS SDK for PHP V3](https://docs.aws.amazon.com/goto/SdkForPHPV3/glue-2017-03-31/UpdateCrawler) + [AWS SDK for Python](https://docs.aws.amazon.com/goto/boto3/glue-2017-03-31/UpdateCrawler) + [AWS SDK for Ruby V3](https://docs.aws.amazon.com/goto/SdkForRubyV3/glue-2017-03-31/UpdateCrawler)