AWS runtime for Apache Spark (emr-spark-8.0.0) on EKS
This page describes the new and updated functionality for Amazon EMR that is specific to the Amazon EMR on EKS deployment. For details about Amazon EMR running on Amazon EC2 and about the Amazon EMR Spark 8.0.0 release in general, see AWS runtime for Apache Spark (emr-spark-8.0.0) in the Amazon EMR Release Guide.
AWS runtime for Apache Spark (emr-spark-8.0.0) on EKS
The following emr-spark-8.0.0 releases are available for AWS runtime for Apache Spark on EKS.
spark/emr-spark-8.0.0-latest
spark/emr-spark-8.0.0-20260421
notebook-spark/emr-spark-8.0.0-latest
notebook-spark/emr-spark-8.0.0-20260421
notebook-python/emr-spark-8.0.0-latest
notebook-python/emr-spark-8.0.0-20260421
livy/emr-spark-8.0.0-latest
livy/emr-spark-8.0.0-20260421
Release notes
Release notes for AWS runtime for Apache Spark (emr-spark-8.0.0) on EKS:
-
Supported applications ‐ AWS SDK for Java 2.41.32, Apache Spark 4.0.2-amzn-0, Apache Hudi 1.1.0-amzn-0, Apache Iceberg 1.10.1-amzn-0, Delta Lake 4.0.0-amzn-1-spark
-
Supported components ‐
emr-ddb,emr-goodies,hadoop-client,hudi,hudi-spark,iceberg,spark-kubernetes. -
Supported configuration classifications
For use with StartJobRun and CreateManagedEndpoint APIs:
Classifications Descriptions core-siteChange values in the
core-site.xmlHadoop file.spark-metricsChange values in the
metrics.propertiesSpark file.spark-defaultsChange values in the
spark-defaults.confSpark file.spark-envChange values in the Spark environment.
spark-hive-siteChange values in the
hive-site.xmlSpark file.spark-log4j2Change values in the
log4j2.propertiesSpark file.emr-job-submitterConfiguration for job submitter pod.
For use specifically with CreateManagedEndpoint APIs:
Classifications Descriptions jeg-configChange values in Jupyter Enterprise Gateway
jupyter_enterprise_gateway_config.pyfile.jupyter-kernel-overridesChange value for the Kernel Image in Jupyter Kernel Spec file.
Configuration classifications allow you to customize applications. These often correspond to a configuration XML file for the application, such as
spark-hive-site.xml. For more information, see Configure Applications.
Changes and features
The following features are included with the emr-spark-8.0.0 release of AWS runtime for Apache Spark on EKS:
-
Apache Spark 4.0.2 GA – First production-ready release of Spark 4.x on Amazon EMR on EKS, featuring ANSI SQL mode, SQL PIPE syntax, VARIANT data type, SQL scripting, and streaming enhancements.
-
Python 3.11 default – Python 3.11 is the default for PySpark and Spark workloads. Python 3.12 and 3.13 are also available.