Have this cheat sheet at your fingertips
Download PDFCloud computing eliminates the capital expenditure of building and maintaining data centers, enabling businesses to access and pay for only the resources they use. Its scalable nature allows for quick adjustment to changing business needs. Mirroring data simplifies data recovery and business continuity. By providing access to resources from anywhere, cloud computing also supports remote work and collaboration.
The big three public clouds - Amazon Web Services, Microsoft Azure, and Google Cloud Platform - have hundreds of services, and it can be hard to determine what you need for any given project.
This cheat sheet provides a comparison of the main services needed for data and AI-related work, from data engineering to data analysis and data science, to creating data applications.
Storage
Service type |
Description |
AWS |
Azure |
GCP |
Object storage |
For storing any files you regularly use |
|||
Archive storage |
Low cost (but slower) storage for rarely used files |
S3 Glacier Instant, Glacier Flexible, Glacier Deep Archive tiers |
||
File storage |
For storing files needing hierarchical organization |
|||
Block storage |
For storing groups of related files |
|||
Hybrid storage |
Move files between on-prem & cloud |
|||
Edge/offline storage |
Move offline data to the cloud |
|||
Backup |
Prevent data loss |
Database
Service type |
Description |
AWS |
Azure |
GCP |
Relational DB management |
Standard SQL DB (PostgreSQL, MySQL, SQL Server, etc.) |
|||
NoSQL: Key-value |
Redis-like DBs for semi-structured data |
|||
NoSQL: Document |
MongoDB/CouchDB-like DBs for hierarchical JSON data |
|||
NoSQL: Column store |
Cassandra/HBase-like DBs for structured hierarchical data |
|||
NoSQL: Graph |
Neo4j-like DBs for connected data |
N/A |
N/A |
|
Caching |
Redis/Memcached-like memory for calculations |
|||
Time Series DB |
DB tuned for time series data |
|||
Blockchain |
Dogecoin, etc. |
Blockchain Service, Blockchain Workbench, Confidential Ledger |
N/A |
Compute
Service type |
Description |
AWS |
Azure |
GCP |
Virtual machines |
Software-emulated computers |
|||
Spot virtual machines |
Cost-effective VMs |
|||
Autoscaling |
Adjust resources to match demand |
|||
Functions as a service (Serverless computing) |
Execute code chunks without worrying about infrastructure |
|||
Platform as a service |
Manage applications without worrying about infrastructure |
App Service, Cloud Services, Spring Cloud, Red Hat OpenShift |
||
Batch scheduling |
Run code at specified times |
|||
Isolated servers |
VM on your own machine, for high security |
|||
On-premise/Edge devices |
Cloud-services on your own hardware |
N/A |
||
Quantum computing |
Determine if cat is alive or dead |
N/A |
Analytics
Service type |
Description |
AWS |
Azure |
GCP |
Data Warehouse |
Centralized platform for all your data |
|||
Big data platform |
Run Spark, Hadoop, Hive, Presto, etc. |
|||
Business analytics |
Dashboards and visualization |
|||
Real-time analytics |
Streaming data analytics |
Kinesis Data Analytics, Kinesis Data Streams, Managed Streaming for Kafka |
||
Extract-Transform-Load (ETL) |
Preprocessing and importing data |
|||
Workflow orchestration |
Build data and model pipelines |
|||
Data lake creation |
Import data into a lake |
|||
Managed search |
Enterprise search |
|||
Data Catalog |
Metadata management |
ML & AI
Service type |
Description |
AWS |
Azure |
GCP |
Machine Learning |
Train, fit, validate, and deploy ML models |
|||
Jupyter notebooks |
Write data analyses and reports |
|||
Data science/machine learning VM |
Virtual machines tailored to data work |
|||
AutoML |
Automatically build ML models |
|||
Natural language Processing AI |
Analyze text data |
|||
Recommendation AI |
Product recommendation engine |
|||
Document capture |
Extract text from printed text & handwriting |
|||
Computer vision |
Image classification, object detection & other AI with image data |
|||
Speech to text |
Speech transcription |
Cognitive Services for Speech to Text, Cognitive Services for Speaker Recognition |
||
Text to speech |
Speech generation |
|||
Translation AI |
Convert text between human languages |
|||
Video Intelligence |
Video indexing and asset search |
|||
AI agents |
Virtual assistants and chatbots |
Bot Service, Cognitive Services for Conversational Language Understanding |
||
Human-in-the-loop |
Human-based quality control for AI |
N/A |
Networking & Content Delivery
Service type |
Description |
AWS |
Azure |
GCP |
Content delivery network |
Serve content to users |
|||
Application Programming Interface (API) management |
Build and deploy APIs |
|||
Domain Name System (DNS) |
Route end users to applications |
|||
Load balancing |
Distribute work evenly across machines |
Containers
Service type |
Description |
AWS |
Azure |
GCP |
Managed containers |
Run and deploy containers |
|||
Container registration |
Manage container images |
Management & Security, Identity
Service type |
Description |
AWS |
Azure |
GCP |
Access management |
User permissions and authentication |
|||
Activity tracking |
Track user Activity |
Monitor Activity Log |
||
Security |
Protect your data, network and applications |
|||
Monitoring |
Monitor network traffic and detect anomalies |
|||
Automation |
Preform processes automatically |
|||
Cost optimization |
Reduce your cloud spend |