Crawler scheduler API
The Crawler scheduler API describes AWS Glue crawler data types, along with the API for creating, deleting, updating, and listing crawlers.
Data types
Schedule structure
A scheduling object using a cron statement to schedule
an event.
Fields
-
ScheduleExpression– UTF-8 string.A
cronexpression used to specify the schedule (see Time-Based Schedules for Jobs and Crawlers. For example, to run something every day at 12:15 UTC, you would specify:cron(15 12 * * ? *). -
State– UTF-8 string (valid values:SCHEDULED|NOT_SCHEDULED|TRANSITIONING).The state of the schedule.
Operations
UpdateCrawlerSchedule action (Python: update_crawler_schedule)
Updates the schedule of a crawler using a cron expression.
Request
-
CrawlerName– Required: UTF-8 string, not less than 1 or more than 255 bytes long, matching the Single-line string pattern.The name of the crawler whose schedule to update.
-
Schedule– UTF-8 string.The updated
cronexpression used to specify the schedule (see Time-Based Schedules for Jobs and Crawlers. For example, to run something every day at 12:15 UTC, you would specify:cron(15 12 * * ? *).
Response
No Response parameters.
Errors
EntityNotFoundExceptionInvalidInputExceptionVersionMismatchExceptionSchedulerTransitioningExceptionOperationTimeoutException
StartCrawlerSchedule action (Python: start_crawler_schedule)
Changes the schedule state of the specified crawler to SCHEDULED,
unless the crawler is already running or the schedule state is already SCHEDULED.
Request
-
CrawlerName– Required: UTF-8 string, not less than 1 or more than 255 bytes long, matching the Single-line string pattern.Name of the crawler to schedule.
Response
No Response parameters.
Errors
EntityNotFoundExceptionSchedulerRunningExceptionSchedulerTransitioningExceptionNoScheduleExceptionOperationTimeoutException
StopCrawlerSchedule action (Python: stop_crawler_schedule)
Sets the schedule state of the specified crawler to NOT_SCHEDULED,
but does not stop the crawler if it is already running.
Request
-
CrawlerName– Required: UTF-8 string, not less than 1 or more than 255 bytes long, matching the Single-line string pattern.Name of the crawler whose schedule state to set.
Response
No Response parameters.
Errors
EntityNotFoundExceptionSchedulerNotRunningExceptionSchedulerTransitioningExceptionOperationTimeoutException