Data Warehouse |
BigQuery |
Amazon Redshift |
Azure Synapse Analytics |
Analytical SQL warehouse for large-scale queries. |
Data Lake Storage |
Cloud Storage (GCS) |
Amazon S3 |
Azure Data Lake Storage (ADLS) Gen2 / Blob Storage |
Object storage for raw, semi-/structured data. |
Batch/Stream Processing |
Dataflow (Apache Beam) |
Kinesis Data Analytics / AWS Glue Streaming ETL |
Azure Stream Analytics / Data Factory Mapping Data Flows |
Serverless batch + streaming ETL. |
Managed Hadoop/Spark |
Dataproc |
Amazon EMR |
Azure HDInsight / Synapse Spark Pools |
Managed Hadoop/Spark/Hive/Presto clusters. |
Data Orchestration |
Cloud Composer (Airflow) |
AWS MWAA / Step Functions |
Azure Data Factory |
Workflow orchestration and scheduling. |
Real-time Messaging |
Pub/Sub |
Kinesis Data Streams / SNS |
Event Hubs / Service Bus |
Pub/sub messaging for real-time ingestion. |
ETL/ELT Service |
Dataprep (Trifacta) |
AWS Glue DataBrew |
ADF Wrangling Data Flows |
No-code data prep for analytics. |
Database Migration |
DMS |
AWS DMS |
Azure Database Migration Service |
Migrate DBs to the cloud. |
NoSQL Wide-Column Store |
Cloud Bigtable |
Amazon DynamoDB |
Cosmos DB (Cassandra API) |
Low-latency, high-throughput NoSQL store. |
Machine Learning |
Vertex AI |
SageMaker |
Azure Machine Learning |
Managed ML platform for training, deployment, MLOps. |