View a markdown version of this page

FAQ - AWS Prescriptive Guidance

FAQ

What's the difference between a data pipeline and an ML pipeline?

A data pipeline is a data engineering pipeline that typically ingests, cleans, and processes data to make it compatible or optimized for machine learning (ML) or other analytical and visualization processes. An ML pipeline typically automates the creation of an ML model.

What's the difference between horizontal and vertical scaling?

Horizontal scaling is the addition of hardware to increase processing power and enable the use of clusters (for example, by using Amazon EMR or AWS Glue). Vertical scaling is the increase in processing power of existing hardware (for example, increasing the RAM capacity of an EC2 instance).