File paths in Amazon S3 - Media2Cloud on AWS

File paths in Amazon S3

When a customer deploys Media2Cloud on AWS, the solution creates four different Amazon Simple Storage Service (Amazon S3) buckets to store assets:

  • A web bucket that stores the static HTML, CSS, and JavaScript files for the web interface.

  • An ingestion bucket that stores your original source files.

  • A proxy bucket that stores all the files and assets generated by the solution, including:

    • Video proxies and thumbnail images generated by AWS Elemental MediaConvert

    • MediaInfo XML output generated by MediaInfo

    • JSON documents generated by EXIFTool

    • Machine learning metadata generated by AWS AI services

    • Additional WebVTT tracks and analysis JSON documents created by the solution

  • A logs bucket that stores all access logs for the web bucket, the ingestion bucket, the proxy bucket, and Amazon CloudFront standard logs.

Table 1: File types and Amazon S3 file paths

File type File path
Web static assets S3://<web-bucket>/
Uploaded file S3://<ingest-bucket>/<file-basename>/<filename>
Technical metadata such as mediainfo and EXIF results S3://<proxy-bucket>/<uuid>/<filename>/mediainfo/

S3://<proxy-bucket>/<uuid>/<filename>/imageinfo/

Proxy files, thumbnails generated by MediaConvert S3://<proxy-bucket>/<uuid>/<filename>/transcode/proxy S3://<proxy-bucket>/<uuid>/<filename>/transcode/aiml

S3://<proxy-bucket>/<uuid>/<filename>/transcode/frameCapture

All AI/ML analysis results S3://<proxy-bucket>/<uuid>/<filename>/raw/
Raw AI/ML analysis results from Amazon AI services S3://<proxy-bucket>/<uuid>/<filename>/raw/<date-time>/comprehend/

S3://<proxy-bucket>/<uuid>/<filename>/raw/<date-time>/rekognition/

S3://<proxy-bucket>/<uuid>/<filename>/raw/<date-time>/transcribe/

S3://<proxy-bucket>/<uuid>/<filename>/raw/<date-time>/textract/

WebVTT tracks generated by analysis state machine

S3://<proxy-bucket>/<uuid>/<filename>/vtt/celeb/<name>.vtt

S3://<proxy-bucket>/<uuid>/<filename>/vtt/face/<name>.vtt

S3://<proxy-bucket>/<uuid>/<filename>/vtt/moderation/<name>.vtt

S3://<proxy-bucket>/<uuid>/<filename>/vtt/person/<name>.vtt

S3://<proxy-bucket>/<uuid>/<filename>/vtt/label/<name>.vtt

S3://<proxy-bucket>/<uuid>/<filename>/vtt/segment/<name>.vtt

S3://<proxy-bucket>/<uuid>/<filename>/vtt/transcribe/<name>.vtt

Metadata JSON documents generated by analysis state machine

S3://<proxy-bucket>/<uuid>/<filename>/analysis/vtt/celeb/<name>.vtt

S3://<proxy-bucket>/<uuid>/<filename>/analysis/vtt/entity/<name>.vtt

S3://<proxy-bucket>/<uuid>/<filename>/analysis/vtt/keyphrase/<name>.vtt

Access logs for the web bucket, ingest bucket, proxy bucket, and CloudFront OIA

S3://<logs-bucket>/access_logs_cloudfront/uuid.<date>.uuid.gz

S3://<logs-bucket>/access_logs_ingest_bucket/<date-time>-uuid

S3://<logs-bucket>/access_proxy_bucket/<date-time>-uuid

S3://<logs-bucket>/access_logs_web_bucket/<date-time>-uuid