View the results of a batch inference job

After a batch inference job is Completed, you can extract the results of the batch inference job from the files in the Amazon S3 bucket that you specified during creation of the job. To learn how to download an S3 object, see Downloading objects. The S3 bucket contains the following files:

Amazon Bedrock generates an output JSONL file for each input JSONL file. The output files contain outputs from the model for each input in the following format. An error object replaces the modelOutput field in any line where there was an error in inference. The format of the modelOutput JSON object depends on the model invocation type. For InvokeModel jobs, the format matches the body field in the InvokeModel response (see Inference request parameters and response fields for foundation models). For Converse jobs, the format matches the response body of the Converse API.
```
{ "recordId" : "string", "modelInput": {JSON body}, "modelOutput": {JSON body} }
```
The following example shows a possible output file.
```
{ "recordId" : "3223593EFGH", "modelInput" : {"inputText": "Roses are red, violets are"}, "modelOutput" : {"inputTextTokenCount": 8, "results": [{"tokenCount": 3, "outputText": "blue\n", "completionReason": "FINISH"}]}}
{ "recordId" : "1223213ABCD", "modelInput" : {"inputText": "Hello world"}, "error" : {"errorCode" : 400, "errorMessage" : "bad request" }}
```
A manifest.json.out file containing a summary of the batch inference job.
```
{
    "totalRecordCount" : number, 
    "processedRecordCount" : number,
    "successRecordCount": number,
    "errorRecordCount": number,
    "inputTokenCount": number,
    "outputTokenCount" : number
}
```
The fields are described below:
- totalRecordCount – The total number of records submitted to the batch inference job.
- processedRecordCount – The number of records processed, which includes both successes and errors.
- successRecordCount – The number of records successfully processed.
- errorRecordCount – The number of records that caused errors during processing.
- inputTokenCount – The total number of input tokens submitted to the batch inference job.
- outputTokenCount – The total number of output tokens generated by the batch inference job.

Warning Javascript is disabled or is unavailable in your browser.

To use the Amazon Web Services Documentation, Javascript must be enabled. Please refer to your browser's Help pages for instructions.

Document Conventions

Stop a job

Code example