We use essential cookies and similar tools that are necessary to provide our site and services. We use performance cookies to collect anonymous statistics, so we can understand how customers use our site and make improvements. Essential cookies cannot be deactivated, but you can choose “Customize” or “Decline” to decline performance cookies.
If you agree, AWS and approved third parties will also use cookies to provide useful site features, remember your preferences, and display relevant content, including relevant advertising. To accept or decline all non-essential cookies, choose “Accept” or “Decline.” To make more detailed choices, choose “Customize.”
Essential cookies are necessary to provide our site and services and cannot be deactivated. They are usually set in response to your actions on the site, such as setting your privacy preferences, signing in, or filling in forms.
Performance cookies provide anonymous statistics about how customers navigate our site so we can improve site experience and performance. Approved third parties may perform analytics on our behalf, but they cannot use the data for their own purposes.
Functional cookies help us provide useful site features, remember your preferences, and display relevant content. Approved third parties may set these cookies to provide certain site features. If you do not allow these cookies, then some or all of these services may not function properly.
Advertising cookies may be set through our site by us or our advertising partners and help us deliver relevant marketing content. If you do not allow these cookies, you will experience less relevant advertising.
Blocking some types of cookies may impact your experience of our sites. You may review and change your choices at any time by selecting Cookie preferences in the footer of this site. We and selected third-parties use cookies or similar technologies as specified in the AWS Cookie Notice.
We display ads relevant to your interests on AWS sites and on other properties, including cross-context behavioral advertising. Cross-context behavioral advertising uses data from one site or app to advertise to you on a different company’s site or app.
To not allow AWS cross-context behavioral advertising based on cookies or similar technologies, select “Don't allow” and “Save privacy choices” below, or visit an AWS site with a legally-recognized decline signal enabled, such as the Global Privacy Control. If you delete your cookies or visit this site from a different browser or device, you will need to make your selection again. For more information about cookies and how we use them, please read our AWS Cookie Notice.
To not allow all other AWS cross-context behavioral advertising, complete this form by email.
For more information about how AWS handles your information, please read the AWS Privacy Notice.
We will only store essential cookies at this time, because we were unable to save your cookie preferences.
If you want to change your cookie preferences, try again later using the link in the AWS console footer, or contact support if the problem persists.
SageMaker Data Processing analyzes, prepares, integrates and orchestrates your data with processing capabilities from Amazon Athena, Amazon EMR, AWS Glue, and Amazon Managed Workflows for Apache Airflow (Amazon MWAA). You can use open source data-processing frameworks such as Apache Spark, analyze data at scale with Trino, and seamlessly build real-time analytics with Apache Flink and Apache Spark.
SageMaker Data Processing brings together Amazon EMR, Athena, AWS Glue, and Amazon MWAA.
SageMaker Data Processing helps you explore data, build data-transformation jobs, orchestrate, and deploy data pipelines at scale. It improves performance, driving faster insights than traditional open source systems with cost-effective and open source API-compatible versions of Apache Spark, Apache Airflow, Apache Flink, Trino, and more. SageMaker Data Processing provides access to your data sources in Amazon SageMaker Lakehouse through zero-ETL integrations, federated querying capabilities, and connectors.
No, you do not need to migrate to SageMaker. You can continue to use Amazon EMR, Athena, AWS Glue, and Amazon MWAA as you do today. However, we recommend that you get started with SageMaker to use unified tooling, built-in data governance, and simplified SageMaker Lakehouse architectures.
There is no impact to current code, queries, jobs, and other resources that you’ve created and used with Amazon EMR, Athena, or AWS Glue. You can continue to use these services for new workloads, if you prefer. Resources created in these services, such as Amazon EMR on Amazon Elastic Compute Cloud (Amazon EC2) clusters, are visible in SageMaker to simplify the development of analytics and AI applications. Existing development experiences built into Amazon EMR, AWS Glue, and Athena will continue to exist in addition to a new development experience within SageMaker.
The latest version of AWS Glue, AWS Glue 5.0, is available in SageMaker. AWS Glue 5.0 accelerates data-processing workloads and delivers the latest performance-optimized Apache Spark 3.5.2 runtime so you can develop, run, and scale for faster insights. To learn more, visit AWS Glue.
Each AWS service that you use through SageMaker is subject to its own individual pricing. For more details, please consult the AWS pricing page for Athena, Amazon EMR, AWS Glue, and Amazon MWAA.