Labeling images with an Amazon SageMaker AI Ground Truth job
With Amazon SageMaker AI Ground Truth, you can use workers from either Amazon Mechanical Turk, a vendor company that you choose, or an internal, private workforce along with machine learning that allows you to create a labeled set of images. Amazon Rekognition Custom Labels imports SageMaker AI Ground Truth manifest files from an Amazon S3 bucket that you specify.
Amazon Rekognition Custom Labels supports the following SageMaker AI Ground Truth tasks.
The files you import are the images and a manifest file. The manifest file contains label and bounding box information for the images you import.
Amazon Rekognition needs permissions to access the Amazon S3 bucket where your images are stored. If you are using the console bucket set up for you by Amazon Rekognition Custom Labels, the required permissions are already set up. If you are not using the console bucket, see Accessing external Amazon S3 Buckets.
Creating a manifest file with a SageMaker AI Ground Truth job (Console)
The following procedure shows you how to create a dataset by using images labeled by a SageMaker AI Ground Truth job. The job output files are stored in your Amazon Rekognition Custom Labels console bucket.
To create a dataset using images labeled by a SageMaker AI Ground Truth job (console)
Sign in to the AWS Management Console and open the Amazon S3 console at https://console.aws.amazon.com/s3/
. -
In the console bucket, create a folder to hold your training images.
Note
The console bucket is created when you first open the Amazon Rekognition Custom Labels console in an AWS Region. For more information, see Managing an Amazon Rekognition Custom Labels project.
-
Upload your images to the folder that you just created.
-
In the console bucket, create a folder to hold the output of the Ground Truth job.
-
Open the SageMaker AI console at https://console.aws.amazon.com/sagemaker/
. -
Create a Ground Truth labeling job. You'll need the Amazon S3 URLs for the folders you created in step 2 and step 4. For more information, see Use Amazon SageMaker Ground Truth for Data Labeling.
-
Note the location of the
output.manifest
file in the folder you created in step 4. It should be in the sub-folder
.Ground-Truth-Job-Name
/manifests/output -
Follow the instructions at Creating a dataset with a SageMaker AI Ground Truth manifest file (Console) to create a dataset with the uploaded manifest file. For step 8, in .manifest file location, enter the Amazon S3 URL for the location you noted in the previous step. If you are using the AWS SDK, do Creating a dataset with a SageMaker AI Ground Truth manifest file (SDK).
-
Repeat steps 1 - 6 to create SageMaker AI Ground Truth job for your test dataset.