What is the difference between Route 53 Geolocation and Geoproximity routing?

Geolocation routes based on the user's mapped geographic location (continent/country/state) to a specific endpoint you configure. Geoproximity routes to the nearest resource based on geographic distance and allows you to shift traffic using bias values (-99 to +99) to expand or shrink coverage areas.

When should I use Weighted routing vs Latency-based routing?

Use Weighted routing when you want to control the exact percentage of traffic to each endpoint (blue/green deployments, A/B testing, gradual migrations). Use Latency-based routing when you want users automatically directed to the fastest region without manual traffic distribution.

Can Route 53 routing policies be combined?

Yes. Using alias records and Route 53 Traffic Flow, you can chain policies. For example: Latency → Weighted routes to the nearest region first, then splits traffic within that region. Failover → Latency provides DR with latency-optimized primary routing.

When should I use DynamoDB vs DocumentDB?

Use DynamoDB for extreme scale with simple key-value access patterns, serverless applications, and single-digit millisecond latency. Use DocumentDB when you need MongoDB compatibility, complex nested documents with flexible querying, aggregation pipelines, and geospatial queries.

When should I use Amazon Neptune?

Use Neptune when your data is highly connected with complex relationships that require multi-hop traversals. Common use cases include social networks, fraud detection, knowledge graphs, recommendation engines, and network topology analysis.

What is the difference between DynamoDB and DocumentDB?

DynamoDB is a serverless key-value/document database optimized for massive scale with simple access patterns (400KB item limit). DocumentDB is MongoDB-compatible with rich query support, aggregation pipelines, and 16MB document size, but requires provisioned instances or Elastic Clusters.

When should I use AWS Step Functions?

Use Step Functions when you need to coordinate multiple steps in a specific order, require error handling with retries and fallbacks, need visibility into execution progress, require human approval or callbacks, or for long-running processes and batch processing millions of items.

When should I use Amazon EventBridge?

Use EventBridge to react to events from AWS services or SaaS apps, when services should be decoupled, for content-based routing to different targets, scheduling (cron/one-time), fan-out patterns, or cross-account event routing.

Can Step Functions and EventBridge be used together?

Yes. A common pattern is EventBridge detecting an event (like an S3 upload) and triggering a Step Functions workflow that orchestrates multi-step processing with error handling and state tracking.

What is the difference between CloudWatch, CloudTrail, and Config?

CloudWatch monitors performance (how is it performing?), CloudTrail records API activity (who did what?), and Config tracks resource configuration changes (what changed and is it compliant?).

When should I use AWS CloudWatch?

Use CloudWatch to monitor CPU/memory/disk metrics, set alarms for thresholds, centralize application logs, create dashboards, and track SLOs.

When should I use AWS CloudTrail?

Use CloudTrail to audit API calls, investigate security incidents, meet compliance requirements for activity logging, and detect unusual API patterns with CloudTrail Insights.

When should I use AWS KMS?

Choose KMS for encrypting data in AWS services (S3, EBS, RDS), envelope encryption, and most standard encryption needs. KMS integrates with 100+ AWS services natively.

When should I use AWS CloudHSM?

Choose CloudHSM for regulatory compliance requiring dedicated single-tenant HSM (FIPS 140-2 Level 3), custom cryptographic operations, SSL offloading, or running your own certificate authority.

When should I use Secrets Manager vs Parameter Store?

Use Secrets Manager for database credentials needing automatic rotation and lifecycle management. Use Parameter Store for application configuration, feature flags, and non-rotating secrets (free tier available).

When should I use VPC Peering?

Choose VPC Peering for a small number of VPCs (2-5) needing simple point-to-point connectivity. It has no hourly charge (data transfer only) and no transitive routing.

When should I use Transit Gateway?

Choose Transit Gateway for many VPCs needing full mesh connectivity, centralized VPN/Direct Connect, shared services VPC, and network segmentation with route tables. Supports transitive routing.

When should I use AWS PrivateLink?

Choose PrivateLink to expose a specific service to other accounts/VPCs without full network access, when you have overlapping CIDRs, or for zero-trust architecture with minimal exposure.

AWS Route 53 Routing Policies Comparison

June 15, 2026 ~ Last updated on : July 14, 2026 ~ Kiro Agent

AWS Route 53 Routing Policies Comparison

Amazon Route 53 supports 7 routing policies that determine how DNS queries are answered.
Choosing the right routing policy depends on whether you need failover, latency optimization, geographic restrictions, or traffic distribution.
Multiple policies can be combined using alias records and health checks for complex routing architectures.

Route 53 Routing Policies Comparison

Policy	Use Case	How It Works	Health Checks
Simple	Single resource, no special routing	Returns all values in random order	No (can’t attach)
Weighted	Traffic distribution, blue/green, A/B testing	Routes based on assigned weights (0-255)	Yes
Latency-based	Best performance for global users	Routes to region with lowest latency	Yes
Failover	Active-passive disaster recovery	Primary until unhealthy, then secondary	Yes (required for primary)
Geolocation	Content localization, compliance, restrictions	Routes based on user’s geographic location	Yes
Geoproximity	Route based on resource location + bias	Routes to nearest resource; bias expands/shrinks coverage	Yes
Multivalue Answer	Simple load balancing with health checks	Returns up to 8 healthy records randomly	Yes
IP-based	Route by client IP/CIDR (ISP optimization)	Routes based on client subnet CIDR mapping	Yes

Simple Routing

Routes traffic to a single resource (or multiple values returned in random order).
Cannot attach health checks to simple routing records.
If multiple values are returned, client chooses one randomly (client-side load balancing).
Can only have one record per name with simple routing.
Best for: Single server, single resource behind a load balancer.

Weighted Routing

Routes traffic based on weights assigned to records (0-255).
Traffic proportion = record weight / sum of all weights for the same name.
Setting weight to 0 stops traffic to that resource (useful for maintenance).
If all records have weight 0, traffic is distributed equally.
Supports health checks – unhealthy records removed from responses.
Best for: Blue/green deployments (90/10 split), A/B testing, gradual migrations, load distribution across regions.

Latency-based Routing

Routes traffic to the region with the lowest network latency for the user.
Latency is measured between the user’s DNS resolver and AWS regions.
Requires resources in multiple AWS regions.
Supports health checks – if lowest-latency resource is unhealthy, routes to next-best.
Latency data is updated periodically by AWS (not real-time per request).
Best for: Global applications deployed in multiple regions needing best user experience.

Failover Routing

Routes traffic to primary resource when healthy, secondary when primary fails health check.
Active-passive configuration – only one designation per record set (primary or secondary).
Health check required on primary record; optional on secondary.
Secondary can point to a static S3 website (maintenance page) or another resource.
Can be combined with other routing policies using alias records.
Best for: Disaster recovery, maintenance pages, active-passive HA architectures.

Geolocation Routing

Routes traffic based on geographic location of the user (continent, country, or US state).
Most specific match wins – state > country > continent > default.
A default record is recommended – users from unmapped locations get this response.
If no default and no match, Route 53 returns “no answer”.
Does NOT route to closest resource – routes to the location you configure (use geoproximity for nearest).
Best for: Content localization (language), compliance (restrict access by country), serving region-specific content.

Geoproximity Routing

Routes traffic based on geographic distance between user and resources.
Bias values (-99 to +99) expand or shrink the geographic area that routes to a resource.
Positive bias = attracts more traffic (expands coverage area).
Negative bias = repels traffic (shrinks coverage area).
Supports both AWS resources (auto-detects region) and non-AWS resources (specify latitude/longitude).
Requires Route 53 Traffic Flow to use geoproximity routing.
Best for: Routing to nearest resource with ability to shift traffic between regions using bias.

Multivalue Answer Routing

Returns up to 8 healthy records in response to each DNS query.
Similar to simple routing but supports health checks – only healthy resources returned.
Not a substitute for a load balancer but provides basic DNS-level load balancing with health checking.
Each record can have its own health check.
Best for: Basic load distribution with health checking when you don’t need ELB.

IP-based Routing

Routes traffic based on client’s source IP address mapped to CIDR blocks.
Create CIDR collections with locations, then map records to locations.
Useful when you know the IP ranges of your users (corporate networks, ISPs).
More precise than geolocation – routes based on actual network, not estimated location.
Best for: ISP-specific routing, enterprise users with known IP ranges, optimizing costs by routing to specific endpoints.

Combining Routing Policies

Alias records can point to other Route 53 record sets, enabling policy combinations.
Example: Latency → Weighted (route to nearest region, then split between blue/green within region).
Example: Failover → Latency (primary is latency-based across regions, secondary is S3 static page).
Example: Geolocation → Failover (per-country routing with DR fallback).
Traffic Flow – visual editor for building complex routing trees with multiple policies.

AWS Certification Exam Practice Questions

A company wants to gradually migrate traffic from an on-premises data center to AWS by sending 10% of traffic to AWS initially, increasing over time. Which routing policy supports this?
1. Latency-based
2. Weighted
3. Failover
4. Geolocation
An application deployed in us-east-1 and eu-west-1 should route users to whichever region provides the fastest response. Which routing policy is appropriate?
1. Geolocation
2. Geoproximity
3. Latency-based
4. Weighted (50/50)
A streaming service must serve different content libraries to users in different countries due to licensing restrictions. Which routing policy enforces this?
1. Latency-based
2. Geolocation
3. Geoproximity
4. IP-based
A company needs to route traffic to the nearest data center but temporarily shift more traffic to a new region during a migration. Which routing policy allows adjusting the geographic coverage area?
1. Geolocation with failover
2. Weighted with latency
3. Geoproximity with bias
4. Multivalue answer
An architect needs DNS-level health checking where unhealthy endpoints are automatically removed from DNS responses, but a full load balancer is not required. Which policy provides this with multiple IPs?
1. Simple routing
2. Weighted routing
3. Failover routing
4. Multivalue answer routing

References

Route 53 Routing Policies

Route 53 Traffic Flow

AWS DynamoDB vs DocumentDB vs Neptune

June 12, 2026 ~ Kiro Agent

AWS DynamoDB vs DocumentDB vs Neptune

AWS offers multiple purpose-built NoSQL database services, each optimized for different data models and access patterns.
DynamoDB is a key-value/document database for high-scale low-latency workloads, DocumentDB is MongoDB-compatible for document workloads, and Neptune is a graph database for highly connected data.
Choice depends on data model, query patterns, scale requirements, and existing application compatibility.

DynamoDB vs DocumentDB vs Neptune Comparison

Feature	DynamoDB	DocumentDB	Neptune
Data Model	Key-value + Document	Document (JSON)	Graph (property graph + RDF)
Compatibility	AWS proprietary API	MongoDB 3.6/4.0/5.0 compatible	Gremlin, SPARQL, openCypher
Architecture	Serverless, fully managed	Cluster-based (primary + replicas)	Cluster-based (primary + replicas)
Scaling	Automatic, unlimited (horizontal)	Vertical (instance size) + read replicas (up to 15)	Vertical (instance size) + read replicas (up to 15)
Serverless Option	Yes (On-Demand or Provisioned)	Yes (DocumentDB Elastic Clusters)	Yes (Neptune Serverless)
Latency	Single-digit milliseconds	Low milliseconds	Milliseconds for traversals
Max Item/Document Size	400KB	16MB	N/A (graph edges/vertices)
Query Flexibility	Limited (partition key + sort key, GSI/LSI)	Rich (MongoDB query language, aggregation pipelines)	Graph traversals (multi-hop relationships)
Transactions	Yes (up to 100 items, 4MB)	Yes (multi-document ACID)	Yes (ACID)
Global Replication	Global Tables (multi-region active-active)	Global Clusters (up to 5 regions, read replicas)	Global Database (up to 5 read regions)
Change Streams	DynamoDB Streams / Kinesis Data Streams	Change Streams (MongoDB compatible)	Neptune Streams
Caching	DAX (microsecond reads)	No built-in (use ElastiCache)	No built-in (use ElastiCache)
Full-Text Search	No (integrate OpenSearch)	Basic text indexes	Neptune Analytics (vector + full-text)
Vector Search	No	No	Yes (Neptune Analytics)
Pricing	Per request (on-demand) or per RCU/WCU (provisioned)	Per instance-hour + storage + I/O	Per instance-hour + storage + I/O

Amazon DynamoDB

Fully serverless key-value and document database – single-digit millisecond latency at any scale.
Capacity modes: On-Demand (pay per request, zero capacity planning) or Provisioned (with Auto Scaling).
Designed for massive scale – handles 10+ trillion requests per day, peaks above 100 million requests/second.
Global Tables – multi-region, multi-active replication with Multi-Region Strong Consistency (MRSC, GA 2025).
DynamoDB Accelerator (DAX) – in-memory cache for microsecond read latency.
DynamoDB Streams – capture item-level changes for event-driven processing (Lambda integration).
TTL – automatic item expiration at no cost.
Zero-ETL to Redshift – replicate data to Redshift for analytics without pipelines.
Limitations: 400KB item size, limited query flexibility (must know partition key), no joins, no aggregations.
Best for: High-scale applications with known access patterns – gaming leaderboards, session stores, IoT, e-commerce carts, serverless backends.

Amazon DocumentDB

MongoDB-compatible document database – supports MongoDB 3.6, 4.0, and 5.0 API compatibility.
Purpose-built storage – separates compute from storage (similar to Aurora); storage auto-scales to 128TB.
Rich queries – full MongoDB query language, aggregation pipelines, secondary indexes, geospatial queries.
Elastic Clusters – shard collections across multiple nodes for horizontal scaling (millions of reads/writes per second).
Global Clusters – cross-region disaster recovery with up to 5 read regions.
Change Streams – MongoDB-compatible change data capture for event-driven architectures.
16MB document size – suitable for complex nested documents.
Not 100% MongoDB compatible – some features differ (check compatibility matrix).
Best for: MongoDB workloads migrating to AWS, content management, catalogs, user profiles, applications needing flexible schemas with rich querying.

Amazon Neptune

Fully managed graph database – purpose-built for storing and querying highly connected data.
Supports three query languages: Apache TinkerPop Gremlin (property graphs), SPARQL (RDF/linked data), and openCypher (declarative graph queries).
Neptune Analytics – analyze graph data with vector search, graph algorithms, and full-text search.
Neptune Serverless – automatically scales compute based on workload.
Neptune ML – machine learning predictions on graph data using GNNs (Graph Neural Networks) via SageMaker.
Global Database – cross-region read replicas for low-latency reads and disaster recovery.
Neptune Streams – capture graph changes for downstream processing.
Up to 15 read replicas – scale reads across multiple instances.
Best for: Relationship-heavy data – social networks, recommendation engines, fraud detection, knowledge graphs, network topology, identity graphs, supply chain.

When to Choose Which

Choose DynamoDB when:
- You need extreme scale with single-digit ms latency
- Access patterns are well-defined (key-value lookups)
- You want fully serverless with zero management
- Use cases: session stores, gaming, IoT, e-commerce, serverless apps
Choose DocumentDB when:
- You’re migrating from MongoDB or need MongoDB compatibility
- Documents are complex/nested and need flexible querying
- You need aggregation pipelines and secondary indexes
- Use cases: content management, catalogs, user profiles
Choose Neptune when:
- Data is highly connected with complex relationships
- Queries involve traversing relationships (multi-hop)
- You need graph algorithms (shortest path, centrality, community detection)
- Use cases: social networks, fraud detection, knowledge graphs, recommendations

AWS Certification Exam Practice Questions

A social media application needs to find “friends of friends” and recommend connections based on mutual relationships. Which database is purpose-built for this query pattern?
1. DynamoDB with GSI
2. DocumentDB with aggregation
3. Neptune (graph traversal)
4. RDS with JOIN queries
A company is migrating a MongoDB application to AWS. They use aggregation pipelines, geospatial queries, and change streams extensively. Which service provides the best compatibility?
1. DynamoDB with Document model
2. DocumentDB
3. Neptune
4. ElastiCache for MongoDB
A gaming application needs a leaderboard that handles 50,000 writes per second with single-digit millisecond latency, using simple key-value access patterns. Which database fits?
1. DocumentDB
2. Neptune
3. Aurora
4. DynamoDB
A fraud detection system needs to analyze transaction patterns by traversing relationships between accounts, devices, IP addresses, and merchants to find suspicious clusters. Which database is best suited?
1. DynamoDB with Streams
2. DocumentDB with aggregation
3. Neptune with graph algorithms
4. Redshift for analytics
An e-commerce application stores product catalogs with deeply nested attributes (variations, specifications, reviews) and needs to query by any attribute with aggregation. Documents average 2MB. Which database fits?
1. DynamoDB (400KB limit would be exceeded)
2. DocumentDB (16MB limit, rich queries)
3. Neptune
4. S3 with Athena

References

Amazon DynamoDB Developer Guide

Amazon DocumentDB Developer Guide

Amazon Neptune User Guide

AWS Step Functions vs EventBridge

June 12, 2026 ~ Kiro Agent

AWS Step Functions vs EventBridge

Both Step Functions and EventBridge are serverless services for coordinating workflows, but they serve fundamentally different purposes.
Step Functions orchestrates multi-step workflows with state management and error handling.
EventBridge routes events between services based on content-based rules without maintaining state.
They are often used together – EventBridge triggers Step Functions workflows based on events.

Step Functions vs EventBridge Comparison

Feature	Step Functions	EventBridge
Pattern	Orchestration (centralized control)	Choreography (decoupled routing)
State Management	Yes – tracks execution state, input/output between steps	No – stateless event routing
Execution Model	Sequential, parallel, branching, looping	Fire-and-forget event delivery
Duration	Standard: up to 1 year; Express: up to 5 minutes	Near real-time delivery (no duration concept)
Error Handling	Built-in Retry, Catch, Fallback states	Dead-letter queue on target delivery failure
Visibility	Visual workflow graph, step-by-step execution history	Rule match metrics, limited execution visibility
Targets/Integrations	200+ AWS service integrations (direct SDK calls)	200+ AWS service targets per rule
Event Sources	Triggered by API call, EventBridge, API Gateway, Lambda	90+ AWS services, SaaS partners, custom apps
Filtering	Choice state (conditions on input data)	Content-based filtering on event body (event patterns)
Parallelism	Parallel state, Distributed Map (millions of items)	Multiple targets per rule (fan-out)
Human Approval	Yes – Task tokens with callback pattern	No native support
Scheduling	Wait state (delay steps)	EventBridge Scheduler (cron/rate/one-time)
Replay	Redrive failed executions (2024)	Event Archive and Replay
Pricing	Standard: per state transition; Express: per request + duration	Per event published ($1/million)

AWS Step Functions

Serverless workflow orchestration – coordinates multiple AWS services into visual workflows.
Standard Workflows – up to 1 year, exactly-once execution, full execution history, ideal for long-running processes.
Express Workflows – up to 5 minutes, at-least-once, high-volume event processing (100K+ executions/second).
States: Task, Choice, Parallel, Map, Wait, Pass, Succeed, Fail.
Direct SDK integrations – call 200+ AWS services without Lambda (DynamoDB PutItem, SQS SendMessage, ECS RunTask, Bedrock InvokeModel).
Distributed Map – process millions of items from S3 in parallel (up to 10,000 concurrent executions).
Callback pattern – pause workflow, wait for external system/human approval via task token.
Error handling – Retry with exponential backoff, Catch with fallback states, per-step timeout.
Redrive (2024) – restart failed executions from the point of failure without re-running completed steps.
Variables and JSONata (2024) – workflow-level variables and powerful data transformation expressions.
Best for: Multi-step processes needing coordination, error handling, human approval, long-running workflows, batch processing.

Amazon EventBridge

Serverless event bus – routes events between decoupled services based on rules.
Receives events from 90+ AWS services automatically without configuration.
Content-based filtering – event patterns match on any field in the event JSON body.
Multiple targets per rule – fan-out a single event to up to 5 targets.
EventBridge Scheduler – millions of one-time or recurring schedules (replaces CloudWatch Events).
EventBridge Pipes – point-to-point with filtering, enrichment, and transformation between source and target.
Event Archive and Replay – store events indefinitely for reprocessing or debugging.
Schema Registry – auto-discover event schemas for code generation.
Global endpoints – automatic failover to secondary region.
SaaS integrations – receive events from Zendesk, Datadog, Shopify, Auth0, etc.
Best for: Event-driven architectures, reacting to AWS service changes, decoupled microservices, SaaS integration, scheduling.

When to Choose Which

Choose Step Functions when:
- You need to coordinate multiple steps in a specific order
- Workflow requires error handling with retries and fallbacks
- You need visibility into which step succeeded/failed
- Process requires human approval or external callbacks
- Long-running processes (minutes to months)
- Batch processing of millions of items (Distributed Map)
Choose EventBridge when:
- You need to react to events from AWS services or SaaS apps
- Services should be decoupled (producers don’t know about consumers)
- Routing based on event content to different targets
- You need scheduling (cron jobs, one-time future events)
- Fan-out: one event triggers multiple independent actions
- Cross-account or cross-region event routing
Use Both Together: EventBridge detects an event (e.g., S3 upload) → triggers Step Functions workflow → orchestrates multi-step processing (validate → transform → load → notify).

AWS Certification Exam Practice Questions

An order processing system requires validating payment, checking inventory, reserving items, charging the card, and sending confirmation – each step depends on the previous one succeeding. If payment fails, the reserved items must be released. Which service handles this?
1. EventBridge with multiple rules
2. Step Functions with error handling (Catch/compensating actions)
3. SQS with multiple queues
4. SNS with filter policies
A company wants to automatically trigger different Lambda functions when EC2 instances change state (running, stopped, terminated) – each state routes to a different function. Which service is most appropriate?
1. Step Functions with Choice state
2. CloudWatch Alarms
3. EventBridge with content-based rules
4. SNS with message filtering
A data pipeline processes millions of S3 objects in parallel, with each object needing 3 transformation steps. The pipeline must track progress and retry individual failures. Which approach is recommended?
1. EventBridge Pipes with SQS
2. Lambda triggered by S3 events
3. Step Functions Distributed Map
4. EventBridge with Lambda targets
A workflow requires pausing execution until a human reviews and approves a document via an external web application (may take hours or days). Which feature supports this?
1. EventBridge wait pattern
2. Step Functions callback pattern with task token
3. SQS visibility timeout
4. Lambda with DynamoDB polling
A company needs to schedule 2 million one-time reminder notifications to be sent at specific future times (each different). Which service handles this at scale?
1. Step Functions Wait state
2. CloudWatch Events cron
3. EventBridge Scheduler
4. SQS delay queues

References

AWS Step Functions Developer Guide

Amazon EventBridge User Guide

EventBridge Scheduler User Guide

AWS CloudWatch vs CloudTrail vs Config

June 12, 2026 ~ Last updated on : June 12, 2026 ~ Kiro Agent

AWS CloudWatch vs CloudTrail vs Config

AWS provides three core monitoring and governance services that are often confused but serve distinct purposes.
CloudWatch monitors performance and operational health, CloudTrail records API activity (who did what), and Config tracks resource configuration changes and compliance.
All three work together for a complete observability and governance strategy.

CloudWatch vs CloudTrail vs Config Comparison

Feature	CloudWatch	CloudTrail	Config
Purpose	Performance monitoring & observability	API audit trail & activity logging	Resource configuration tracking & compliance
Answers	“How is it performing?”	“Who did what and when?”	“What changed and is it compliant?”
Data Type	Metrics, logs, traces, events	API call records (events)	Resource configuration snapshots
Scope	Resources, applications, services	AWS account API activity	AWS resource inventory & state
Retention	Metrics: 15 months; Logs: configurable (forever)	90 days (console) or S3 (indefinite)	Indefinite (configuration history)
Alerting	Yes (Alarms on metrics and logs)	Via EventBridge or CloudWatch Logs	Yes (Config Rules – non-compliant triggers SNS)
Automation	Auto Scaling, EC2 actions, Lambda	EventBridge rules trigger actions	Auto-remediation via SSM Automation
Cross-account	Cross-account dashboards, metric sharing	Organization trail	Aggregator (multi-account, multi-region)
Pricing	Per metric, log ingestion, dashboard	Free (management events, 1 copy); data events paid	Per rule evaluation + per configuration item recorded
Example	CPU > 80% for 5 minutes → alarm	User X deleted S3 bucket at 3:42pm	Security group changed to allow 0.0.0.0/0 → non-compliant

Amazon CloudWatch

Monitoring and observability service for AWS resources and applications.
Metrics – collect and track standard (free) and custom metrics; 1-second resolution available.
Alarms – trigger actions (Auto Scaling, SNS, EC2 stop/terminate/reboot) when metrics cross thresholds.
Logs – centralized log collection with Logs Insights for SQL-like querying.
Dashboards – create visualizations across accounts and regions.
CloudWatch Agent – collect OS-level metrics (memory, disk) and application logs from EC2.
Anomaly Detection – ML-based bands to detect unusual metric behavior.
Composite Alarms – combine multiple alarms with AND/OR logic to reduce noise.
Synthetics – canary scripts to monitor endpoints and APIs proactively.
Application Signals – automatic application monitoring with SLOs (GA 2024).
Internet Monitor – monitor internet connectivity to your application.
Database Insights – unified database monitoring across RDS, Aurora, and self-managed databases.

AWS CloudTrail

Records all API calls made in your AWS account – who, what, when, from where.
Management events – control plane operations (CreateBucket, RunInstances, etc.) – free, 1 copy per region.
Data events – data plane operations (S3 GetObject, Lambda Invoke, DynamoDB GetItem) – paid.
Insights events – detect unusual API activity patterns (e.g., spike in API calls).
Trail delivery – send events to S3 (long-term storage) and/or CloudWatch Logs (real-time alerting).
Organization trail – single trail for all accounts in AWS Organizations.
CloudTrail Lake – managed data lake for querying events with SQL (replaces Athena queries on S3).
Event history – 90-day free lookup in the console (management events only).
Integrity validation – digest files prove logs haven’t been tampered with.
Network activity events (2024) – track VPC endpoint API calls for data perimeter monitoring.

AWS Config

Tracks resource configuration changes and evaluates compliance over time.
Configuration recorder – captures current state of resources as configuration items.
Configuration history – timeline of how a resource’s configuration changed.
Config Rules – evaluate resources against desired configurations (400+ AWS managed rules + custom Lambda rules).
Conformance Packs – collection of Config Rules and remediation actions packaged as a single entity.
Auto-remediation – automatically fix non-compliant resources via SSM Automation documents.
Aggregator – centralized view across multiple accounts and regions.
Advanced Query – SQL queries on current configuration state of all resources.
Proactive compliance (2024) – evaluate CloudFormation templates BEFORE deployment.
Service-linked rules – Config Rules managed by other AWS services (Security Hub, Control Tower).
Resource timeline – view config changes, compliance changes, and CloudTrail events together.

How They Work Together

Security incident investigation: Config shows WHAT changed → CloudTrail shows WHO changed it → CloudWatch shows the IMPACT on performance.
Compliance automation: Config Rule detects non-compliant resource → triggers SNS → Auto-remediation fixes it → CloudTrail logs the remediation → CloudWatch tracks the metric.
Proactive monitoring: CloudWatch alarm fires on high error rate → CloudTrail reveals recent deployment → Config shows configuration change that caused it.

When to Choose Which

Use CloudWatch – Monitor CPU/memory/disk, set alarms for thresholds, centralize application logs, create dashboards, track SLOs.
Use CloudTrail – Audit API calls, investigate security incidents, meet compliance requirements for activity logging, detect unusual API patterns.
Use Config – Track resource configuration drift, enforce compliance rules, audit resource history, auto-remediate non-compliant resources.
Use all three together – Complete governance: monitoring (CloudWatch) + auditing (CloudTrail) + compliance (Config).

AWS Certification Exam Practice Questions

A security team needs to determine who deleted an S3 bucket last Tuesday and from which IP address. Which service provides this information?
1. CloudWatch Logs
2. CloudTrail
3. AWS Config
4. VPC Flow Logs
A company needs to ensure all Security Groups in their account never allow SSH (port 22) from 0.0.0.0/0. If a non-compliant Security Group is detected, it should be automatically remediated. Which service provides this?
1. CloudWatch Alarm with Lambda
2. CloudTrail with EventBridge rule
3. AWS Config Rule with auto-remediation
4. GuardDuty
An operations team wants to receive an alert when EC2 CPU utilization exceeds 90% for more than 5 minutes and automatically add instances to the fleet. Which service and feature enables this?
1. CloudTrail with SNS
2. Config Rule with remediation
3. CloudWatch Alarm with Auto Scaling action
4. EventBridge with Step Functions
A compliance auditor needs to see the complete configuration history of an RDS instance over the past 6 months, including every change to its configuration. Which service provides this timeline view?
1. CloudTrail event history
2. CloudWatch Logs
3. AWS Config (configuration timeline)
4. RDS event notifications
An organization wants to detect when an unusually high number of API calls are made to IAM (potential credential compromise). Which service and feature is purpose-built for this?
1. CloudWatch Anomaly Detection
2. Config Rule
3. CloudTrail Insights
4. GuardDuty

References

Amazon CloudWatch User Guide

AWS CloudTrail User Guide

AWS Config Developer Guide

AWS KMS vs CloudHSM vs Secrets Manager vs Parameter Store

June 12, 2026 ~ Last updated on : June 12, 2026 ~ Kiro Agent

AWS KMS vs CloudHSM vs Secrets Manager vs Parameter Store

AWS provides multiple services for managing encryption keys and secrets, each designed for different security requirements and use cases.
KMS is managed key management, CloudHSM is dedicated hardware security modules, Secrets Manager is for rotating secrets, and Systems Manager Parameter Store is for configuration and secrets storage.
Choice depends on compliance requirements (FIPS 140-2 Level 3), key control needs, rotation requirements, and cost.

KMS vs CloudHSM vs Secrets Manager vs Parameter Store Comparison

Feature	KMS	CloudHSM	Secrets Manager	Parameter Store
Purpose	Managed encryption key service	Dedicated HSM for key management	Secret storage with automatic rotation	Configuration & secret storage
Key Control	AWS manages HSM, you manage keys	You manage everything (single-tenant HSM)	Uses KMS for encryption	Uses KMS for encryption (SecureString)
FIPS 140-2	Level 3 (since 2023)	Level 3	N/A (uses KMS)	N/A (uses KMS)
Multi-tenancy	Multi-tenant (shared infrastructure)	Single-tenant (dedicated hardware)	Multi-tenant	Multi-tenant
Automatic Rotation	Yes (annual for AWS-managed, configurable 90-365 days for customer-managed)	Manual (you control rotation)	Yes (built-in for RDS, Redshift, DocumentDB; Lambda for custom)	No built-in rotation
Cross-account	Yes (key policy + IAM)	No (same VPC/account)	Yes (resource policy)	Yes (resource policy, Advanced tier)
Cross-region	Multi-Region keys	Cluster in single region	Multi-Region secret replication	No native replication
Max Secret Size	4KB (symmetric key operations)	Unlimited (HSM capacity)	64KB	4KB (Standard) / 8KB (Advanced)
Pricing	$1/month per key + API calls	~$1.50/hour per HSM ($1,095/month)	$0.40/secret/month + API calls	Free (Standard) / $0.05/parameter/month (Advanced)
Versioning	Automatic (rotation creates new version)	Manual	Yes (staging labels: AWSCURRENT, AWSPREVIOUS)	Yes (up to 100 versions)
Audit	CloudTrail	CloudTrail + HSM audit logs	CloudTrail	CloudTrail
AWS Integration	100+ services natively	Custom integration required	RDS, Redshift, DocumentDB, ECS, Lambda	ECS, Lambda, CloudFormation, CodeDeploy
Key Types	Symmetric (AES-256), Asymmetric (RSA, ECC), HMAC	Symmetric, Asymmetric, HMAC, custom algorithms	N/A (stores secrets, not keys)	N/A (stores values)

AWS KMS (Key Management Service)

Fully managed encryption key service integrated with 100+ AWS services.
Three key types: AWS owned (free, AWS-managed), AWS managed (auto-created per service), Customer managed (full control).
Envelope encryption – generates data keys for encrypting data locally; KMS never stores data keys.
Multi-Region keys – replicate keys across regions for cross-region encryption/decryption.
Key policies + IAM – fine-grained access control; grants for temporary access.
Automatic key rotation – configurable 90-365 days for customer-managed keys (was annual only before 2024).
External Key Store (XKS) – use keys stored in your own HSM outside AWS.
FIPS 140-2 Level 3 validated since March 2023.
Best for: Most encryption use cases – S3, EBS, RDS, DynamoDB, Lambda, and 100+ other AWS services.

AWS CloudHSM

Dedicated, single-tenant HSM instances in your VPC – you own and manage the keys.
FIPS 140-2 Level 3 validated hardware – required for certain regulatory compliance.
Full key control – AWS cannot access your keys; AWS manages hardware only.
Supports PKCS#11, JCE, CNG, and OpenSSL interfaces for custom applications.
Cluster-based – deploy across multiple AZs for HA; keys automatically replicated.
Custom key store for KMS – back KMS keys with CloudHSM for compliance + service integration.
SSL/TLS offloading – use CloudHSM for web server private keys.
Code signing, certificate authority – custom crypto operations not available in KMS.
Best for: Regulatory compliance (PCI-DSS, HIPAA requiring dedicated HSM), custom cryptographic operations, SSL offloading, certificate authorities.

AWS Secrets Manager

Purpose-built for managing secrets (database credentials, API keys, tokens).
Automatic rotation – built-in for RDS (MySQL, PostgreSQL, Oracle, SQL Server, MariaDB), Redshift, DocumentDB; Lambda-based for custom secrets.
Multi-Region replication – replicate secrets across regions for DR and multi-region applications.
Versioning with staging labels – AWSCURRENT, AWSPREVIOUS, AWSPENDING during rotation.
Resource-based policies – share secrets cross-account.
Integration – ECS/Fargate (inject as environment variables), Lambda, RDS Proxy.
Batch retrieval – retrieve up to 20 secrets in a single API call.
Best for: Database credentials that need automatic rotation, API keys, OAuth tokens, any secret requiring lifecycle management.

Systems Manager Parameter Store

Hierarchical configuration storage for both configuration data and secrets.
Two tiers: Standard (free, 4KB, 10K params) and Advanced ($0.05/month, 8KB, 100K params).
Parameter types: String, StringList, SecureString (encrypted with KMS).
Hierarchy and tagging – organize parameters like /prod/db/password, /dev/api/key.
No built-in rotation – use EventBridge + Lambda for custom rotation.
Parameter policies (Advanced tier) – expiration notifications, no-change notifications.
Public parameters – AWS provides latest AMI IDs, ECS-optimized AMI, etc.
CloudFormation integration – resolve parameters dynamically during stack creation.
Best for: Application configuration, feature flags, non-rotating secrets, AMI IDs, and cost-sensitive use cases where rotation isn’t needed.

When to Choose Which

Choose KMS – Encrypting data in AWS services (S3, EBS, RDS), envelope encryption, most standard encryption needs.
Choose CloudHSM – Regulatory requirement for dedicated HSM (FIPS 140-2 Level 3 single-tenant), custom cryptographic operations, SSL offloading, running your own CA.
Choose Secrets Manager – Database credentials needing automatic rotation, API keys with lifecycle management, cross-region secret replication.
Choose Parameter Store – Application configuration, feature flags, non-rotating secrets, cost-sensitive (free tier), hierarchical organization of config data.
Combine KMS + Secrets Manager – Secrets Manager uses KMS for encryption; use customer-managed KMS key for additional control.
Combine CloudHSM + KMS – Use CloudHSM as a custom key store backing KMS keys (compliance + service integration).

AWS Certification Exam Practice Questions

A company needs to store database credentials that automatically rotate every 30 days and are accessible from ECS tasks as environment variables. Which service is most appropriate?
1. KMS with custom rotation
2. Parameter Store SecureString
3. Secrets Manager
4. CloudHSM
A financial institution must use dedicated hardware security modules (not shared) for key management to satisfy PCI-DSS Level 1 compliance. Which service meets this requirement?
1. KMS with customer-managed keys
2. CloudHSM
3. KMS with external key store
4. Secrets Manager with KMS
A development team needs to store application configuration values (non-sensitive) and sensitive database passwords together in a hierarchical structure with minimal cost. Which approach is recommended?
1. Secrets Manager for all values
2. Parameter Store (String for config, SecureString for passwords)
3. KMS encrypted S3 bucket
4. DynamoDB with encryption
An application needs encryption keys that work identically across 3 AWS regions for cross-region data encryption/decryption without re-encrypting. Which feature enables this?
1. CloudHSM cluster replication
2. Secrets Manager multi-region secrets
3. KMS Multi-Region keys
4. KMS key import in each region
A company wants to use AWS KMS for service integrations but needs their keys to remain in their on-premises HSM that they fully control. Which KMS feature supports this?
1. CloudHSM custom key store
2. KMS imported key material
3. KMS External Key Store (XKS)
4. KMS with VPN connection

References

AWS KMS Developer Guide

AWS CloudHSM User Guide

AWS Secrets Manager User Guide

AWS Systems Manager Parameter Store

AWS Container Services Cheat Sheet

June 10, 2026 ~ Kiro Agent

AWS Container Services Cheat Sheet

AWS provides a full container stack: orchestration (ECS, EKS), compute (Fargate, EC2), registry (ECR), and supporting services (App Mesh, Cloud Map, Proton).
Containers package applications with dependencies for consistent deployment across environments.

Container Orchestration

Amazon ECS (Elastic Container Service)

AWS-native container orchestrator – deeply integrated with IAM, CloudWatch, ALB, VPC.
Task Definition – blueprint for containers (image, CPU, memory, ports, IAM role, volumes).
Service – maintains desired count of tasks, integrates with load balancers, handles rolling updates.
Launch types: EC2 (you manage instances) or Fargate (serverless).
No control plane cost – free; pay only for EC2 or Fargate compute.
Capacity Providers – automatic EC2 Auto Scaling and Fargate/Fargate Spot management.
Service Connect – simplified service-to-service communication (built-in service mesh).
ECS Exec – interactive shell into running containers for debugging.
ECS Anywhere – run ECS tasks on on-premises servers.
Blue/Green deployments – native CodeDeploy integration.

Amazon EKS (Elastic Kubernetes Service)

Managed Kubernetes – certified conformant, runs upstream K8s.
Control plane: fully managed by AWS ($0.10/hour per cluster).
Compute options: Managed Node Groups, Self-Managed Nodes, Fargate, EKS Auto Mode.
EKS Auto Mode – AWS manages nodes, scaling, upgrades, and security patches automatically.
Add-ons: CoreDNS, kube-proxy, VPC CNI, EBS CSI, managed via EKS.
EKS Anywhere – run Kubernetes on-premises with EKS management.
EKS Connector – register external Kubernetes clusters to the EKS console.
Full Kubernetes ecosystem: Helm, Karpenter, Istio, ArgoCD, Prometheus, etc.

Compute

AWS Fargate

Serverless compute for ECS and EKS – no EC2 to manage.
Per-task pricing: vCPU-hour + GB-hour (per second billing, 1-min minimum).
Isolation: each task/pod runs in its own Firecracker microVM.
Fargate Spot: up to 70% discount; tasks can be interrupted with 2-min warning.
Limitations: no GPU, no EBS, no daemonsets, no privileged containers.
Storage: 20GB ephemeral per task (configurable up to 200GB) + EFS supported.

Container Registry

Amazon ECR (Elastic Container Registry)

Fully managed Docker container registry – stores, manages, and deploys container images.
Private repositories with IAM-based access control.
Public repositories (ECR Public Gallery) for open-source images.
Image scanning – automatic vulnerability scanning (Basic with Clair, or Enhanced with Inspector).
Lifecycle policies – automatically clean up old/untagged images.
Cross-region and cross-account replication.
Image immutability – prevent image tags from being overwritten.
OCI support – stores OCI images and Helm charts.

Networking & Service Discovery

AWS App Mesh

Service mesh using Envoy proxy for traffic management, observability, and security between services.
Supports ECS, EKS, and EC2 workloads.
Features: traffic routing, retries, timeouts, circuit breaking, mutual TLS.

Amazon VPC Lattice

Application-layer networking – connect, secure, and monitor services across VPCs and accounts.
Simpler than App Mesh – no sidecar proxies needed.
Supports ECS, EKS, Lambda, and EC2 targets.

AWS Cloud Map

Service discovery – register and discover services using DNS or API.
Health checking for registered instances.
Used by ECS Service Connect and App Mesh.

CI/CD & DevOps

AWS CodePipeline – CI/CD pipeline automation for container deployments.
AWS CodeBuild – build container images (docker build + push to ECR).
AWS CodeDeploy – blue/green deployments for ECS services.
AWS Proton – managed delivery service for container and serverless application templates.
AWS Copilot – CLI for building, releasing, and operating containerized apps on ECS.

Monitoring & Logging

CloudWatch Container Insights – metrics and logs for ECS and EKS (CPU, memory, network, disk per task/pod).
AWS X-Ray – distributed tracing for containerized microservices.
FireLens – ECS log router using Fluent Bit/Fluentd to send logs to CloudWatch, S3, Splunk, Datadog.
CloudWatch Logs – awslogs driver for ECS; Fluent Bit DaemonSet for EKS.

AWS Certification Exam Practice Questions

A team wants to run containers without managing any infrastructure and needs the lowest operational overhead. They don’t use Kubernetes. Which combination is correct?
1. EKS with Fargate
2. ECS with Fargate
3. ECS with EC2
4. EKS with Managed Node Groups
A company needs to automatically scan container images for vulnerabilities when pushed to the registry. Which service and feature provides this?
1. ECS image scanning
2. ECR Enhanced Scanning (with Amazon Inspector)
3. GuardDuty container protection
4. AWS Config rules
An application needs service-to-service communication with mutual TLS, traffic routing, and retry policies across ECS and EKS services. Which service provides this?
1. VPC Lattice
2. Cloud Map
3. AWS App Mesh
4. ECS Service Connect
A company needs to run Kubernetes on-premises while managing it with the same tools used for their AWS EKS clusters. Which service supports this?
1. ECS Anywhere
2. EKS Anywhere
3. EKS Connector
4. AWS Outposts

References

Amazon ECS Developer Guide

Amazon EKS User Guide

Amazon ECR User Guide

AWS Storage Services Cheat Sheet

June 10, 2026 ~ Kiro Agent

AWS Storage Services Cheat Sheet

AWS provides storage services across four categories: Object (S3), Block (EBS), File (EFS, FSx), and Hybrid/Edge (Storage Gateway, Snow Family).
Each is optimized for different access patterns, latency requirements, and cost profiles.

Object Storage

Amazon S3

Unlimited object storage with 99.999999999% (11 nines) durability.
Max object size: 5TB; multipart upload for objects >100MB.
Storage Classes:
- S3 Standard – frequently accessed, low latency, high throughput.
- S3 Intelligent-Tiering – automatic cost optimization with access pattern monitoring.
- S3 Standard-IA – infrequent access, lower storage cost, retrieval fee.
- S3 One Zone-IA – single AZ, 20% cheaper than Standard-IA.
- S3 Glacier Instant Retrieval – archive with millisecond access.
- S3 Glacier Flexible Retrieval – archive, 1-12 hour retrieval.
- S3 Glacier Deep Archive – lowest cost, 12-48 hour retrieval.
- S3 Express One Zone – single-digit millisecond, single AZ, for analytics.
Lifecycle Policies – automatically transition objects between classes or expire them.
Versioning – keep multiple versions; protect against accidental deletes.
Replication – Cross-Region (CRR) or Same-Region (SRR) replication.
Object Lock – WORM (Write Once Read Many) for compliance (Governance or Compliance mode).
S3 Event Notifications – trigger Lambda, SQS, SNS, EventBridge on object events.
S3 Transfer Acceleration – faster uploads using CloudFront edge locations.
S3 Select / Glacier Select – retrieve subset of data using SQL.
Encryption: SSE-S3 (default), SSE-KMS, SSE-C, client-side.
Access Control: Bucket policies, IAM policies, ACLs (legacy), Access Points, Block Public Access.

Block Storage

Amazon EBS

Persistent block storage for EC2 instances.
Volume Types:
- gp3 – general purpose SSD, 3,000 IOPS baseline, up to 16,000 IOPS. Cost-effective default.
- gp2 – general purpose SSD, burst up to 3,000 IOPS (legacy, prefer gp3).
- io2 Block Express – highest performance SSD, up to 256,000 IOPS, sub-ms latency. For databases.
- io1 – provisioned IOPS SSD, up to 64,000 IOPS.
- st1 – throughput-optimized HDD, up to 500 MB/s. For big data, data warehouses.
- sc1 – cold HDD, lowest cost, up to 250 MB/s. For infrequent access.
Single AZ – must be in same AZ as EC2 instance.
Snapshots – point-in-time backups to S3 (incremental); can copy cross-region.
Multi-Attach – io2 volumes can attach to up to 16 Nitro instances in same AZ.
Encryption – AES-256 via KMS; encrypt at rest, in transit, and snapshots.
Elastic Volumes – resize, change type, or adjust IOPS without downtime.

EC2 Instance Store

Ephemeral block storage physically attached to host – highest IOPS/throughput.
Data lost on instance stop/terminate/failure.
Use for: temporary buffers, caches, scratch data.

File Storage

Amazon EFS

Managed NFS (NFSv4.1) – concurrent access from multiple EC2, ECS, Lambda.
Elastic – grows/shrinks automatically; pay only for what you use.
Performance modes: General Purpose (latency-sensitive) and Max I/O (high parallelism).
Throughput modes: Elastic (auto), Bursting, Provisioned.
Storage classes: Standard, Infrequent Access (IA), Archive – with lifecycle management.
Regional – data stored across multiple AZs; One Zone option available at lower cost.
Supports cross-region replication for DR.

Amazon FSx

FSx for Windows File Server – managed Windows SMB with Active Directory, DFS, VSS.
FSx for Lustre – high-performance parallel file system (HPC, ML). Integrates with S3.
FSx for NetApp ONTAP – multi-protocol (NFS, SMB, iSCSI) with snapshots, clones, tiering.
FSx for OpenZFS – high-performance NFS with snapshots and data compression.

Hybrid & Edge Storage

AWS Storage Gateway

Hybrid cloud storage connecting on-premises to AWS.
S3 File Gateway – NFS/SMB access to S3 objects.
FSx File Gateway – local cache for FSx for Windows File Server.
Volume Gateway – iSCSI block storage backed by S3 (Cached or Stored mode).
Tape Gateway – virtual tape library (VTL) backed by S3/Glacier.

AWS Snow Family

Snowcone – 8-14TB, portable edge computing and data transfer.
Snowball Edge – 80-210TB, Storage Optimized or Compute Optimized.
Snowmobile – 100PB, exabyte-scale data migration (truck).
Use for: offline data migration, edge computing where connectivity is limited.

AWS DataSync

Automated data transfer – on-premises to AWS (S3, EFS, FSx) or between AWS services.
Up to 10x faster than open-source tools; built-in scheduling, integrity validation.

AWS Transfer Family

Managed SFTP, FTPS, FTP, and AS2 transfers directly to/from S3 or EFS.

AWS Certification Exam Practice Questions

A company needs to store 500TB of data that is rarely accessed (once per quarter) but must be retrievable within milliseconds. Which storage class is most cost-effective?
1. S3 Standard
2. S3 Standard-IA
3. S3 Glacier Instant Retrieval
4. S3 Glacier Deep Archive
A database requires 100,000 IOPS with sub-millisecond latency on a single EC2 instance. Which EBS volume type should be used?
1. gp3
2. io1
3. io2 Block Express
4. st1
Multiple EC2 instances across AZs need shared access to a POSIX-compliant file system with automatic capacity scaling. Which service fits?
1. EBS Multi-Attach
2. S3
3. EFS
4. FSx for Lustre
A company needs to migrate 50TB of on-premises data to S3, but their internet connection would take 2 weeks. Which service provides faster physical transfer?
1. S3 Transfer Acceleration
2. DataSync
3. Snowball Edge
4. Storage Gateway
An on-premises application uses NFS to access files that must be stored in S3 for durability. Which service provides this transparent NFS-to-S3 bridge?
1. EFS
2. FSx for ONTAP
3. S3 File Gateway
4. DataSync

References

Amazon S3 User Guide

Amazon EBS User Guide

Amazon EFS User Guide

AWS Storage Gateway User Guide

AWS Serverless Services Cheat Sheet

June 10, 2026 ~ Kiro Agent

AWS Serverless Services Cheat Sheet

AWS serverless services allow running applications without provisioning or managing servers.
Services automatically scale, provide built-in high availability, and use pay-per-use pricing.
Core serverless stack: Lambda (compute) + API Gateway (API) + DynamoDB (database) + S3 (storage) + EventBridge (events) + Step Functions (orchestration).

Compute

AWS Lambda

Run code without provisioning servers – event-driven, function-as-a-service.
Triggers: 200+ event sources (S3, DynamoDB, SQS, API Gateway, EventBridge, Kinesis, etc.).
Duration: max 15 minutes per invocation.
Memory: 128MB – 10GB (CPU proportional).
Concurrency: 1,000 default (can increase); Reserved and Provisioned Concurrency available.
Pricing: per request ($0.20/million) + per GB-second of compute.
Deployment: ZIP package (250MB unzipped) or container image (10GB).
Layers: share code/libraries across functions (up to 5 layers).
Versions & Aliases: immutable versions with aliases for traffic shifting (canary/linear).
Lambda@Edge: run at CloudFront edge locations (viewer/origin request/response).
SnapStart: reduce cold starts for Java functions (caches initialized snapshot).

AWS Fargate

Serverless compute for containers – works with ECS and EKS.
No EC2 instances to manage – per-task pricing (vCPU + memory per second).
Each task runs in isolated microVM (Firecracker).
Fargate Spot: up to 70% savings for fault-tolerant workloads.

AWS App Runner

Fully managed – deploy from source code (GitHub) or container image to running web service in minutes.
Auto-scales based on traffic; can pause when idle.
Built-in HTTPS endpoint, load balancing, and certificate management.

API & Integration

Amazon API Gateway

Create, publish, and manage REST, HTTP, and WebSocket APIs.
REST API: full-featured (caching, request validation, WAF, usage plans, API keys).
HTTP API: lower latency, lower cost, simpler (JWT authorizers, OIDC).
WebSocket API: real-time two-way communication.
Integrates with Lambda, HTTP backends, AWS services, and Mock integrations.
Throttling: 10,000 requests/second default (account-level), burst 5,000.
Stages: dev/staging/prod with stage variables and canary deployments.

AWS Step Functions

Serverless orchestration – coordinate multiple AWS services into workflows.
Standard Workflows: up to 1 year, exactly-once execution, audit history.
Express Workflows: up to 5 minutes, at-least-once, high-volume event processing.
States: Task, Choice, Parallel, Map, Wait, Pass, Succeed, Fail.
Direct 200+ AWS service integrations (DynamoDB, SQS, SNS, ECS, Bedrock, etc.).
Distributed Map: process millions of items in parallel from S3.
Built-in error handling with Retry and Catch.

Amazon EventBridge

Serverless event bus for event-driven architectures.
90+ AWS service events automatically; SaaS partner events; custom events.
Content-based filtering, event archive/replay, schema registry.
EventBridge Scheduler: one-time and recurring schedules (replaces CloudWatch Events cron).
EventBridge Pipes: point-to-point integration with filtering and enrichment.

Data & Storage

Amazon DynamoDB

Serverless NoSQL database – single-digit millisecond latency at any scale.
Capacity modes: On-Demand (pay per request) or Provisioned (with Auto Scaling).
DynamoDB Streams: capture item-level changes for event-driven processing.
Global Tables: multi-region, multi-active replication.
DAX: in-memory cache for microsecond read latency.
TTL: automatic item expiration at no cost.

Amazon S3

Serverless object storage – unlimited capacity, 11 nines durability.
Event notifications to Lambda, SQS, SNS, EventBridge.
S3 Object Lambda: transform data on retrieval using Lambda.

Amazon Aurora Serverless v2

Serverless relational database – scales instantly from 0.5 to 256 ACUs.
MySQL and PostgreSQL compatible.
Scales to zero (when paused) for development workloads.

Messaging

Amazon SQS

Serverless message queue – Standard (unlimited throughput) or FIFO (ordering + exactly-once).
Retention up to 14 days. Visibility timeout, delay queues, dead-letter queues.

Amazon SNS

Serverless pub/sub messaging – fan out to SQS, Lambda, HTTP, email, SMS.
Message filtering on attributes. FIFO topics for ordered fan-out.

Other Serverless Services

AWS AppSync – managed GraphQL and Pub/Sub API with real-time data sync.
Amazon Cognito – serverless user authentication, authorization, and user management.
AWS SAM (Serverless Application Model) – framework for building serverless applications (extends CloudFormation).
Amazon Kinesis Data Firehose – serverless streaming ETL to S3, Redshift, OpenSearch.
AWS Glue – serverless ETL for data preparation.
Amazon OpenSearch Serverless – serverless search and analytics.

Serverless Architecture Patterns

Synchronous API: API Gateway → Lambda → DynamoDB
Asynchronous Processing: S3 Event → SQS → Lambda → DynamoDB
Fan-out: SNS → multiple SQS queues → Lambda consumers
Orchestration: API Gateway → Step Functions → (Lambda + DynamoDB + SNS)
Event-driven: EventBridge → Lambda / Step Functions / SQS
Streaming: Kinesis → Lambda → DynamoDB / S3

AWS Certification Exam Practice Questions

A serverless application needs to coordinate a multi-step order processing workflow that includes payment, inventory check, and shipping. Each step may take variable time and needs error handling with retries. Which service orchestrates this?
1. Amazon SQS with multiple queues
2. Lambda calling Lambda
3. AWS Step Functions
4. EventBridge rules
A Lambda function needs to process messages from an SQS queue but should handle no more than 5 messages concurrently to avoid overwhelming a downstream API. What controls this?
1. SQS visibility timeout
2. Lambda Reserved Concurrency set to 5
3. SQS MaximumBatchSize
4. Lambda timeout
An application needs a serverless relational database that automatically scales based on load and has zero cost when there are no connections. Which service provides this?
1. DynamoDB On-Demand
2. RDS with Auto Scaling
3. Aurora Serverless v2 (with pause)
4. ElastiCache Serverless
A company wants to create a REST API with caching, request validation, API keys for rate limiting, and AWS WAF integration. Which API Gateway type should they use?
1. HTTP API
2. REST API
3. WebSocket API
4. AppSync

References

AWS Serverless Overview

AWS Lambda Developer Guide

AWS Step Functions Developer Guide

AWS AI & ML Services Cheat Sheet

June 10, 2026 ~ Last updated on : July 13, 2026 ~ Kiro Agent

AWS AI & ML Services Cheat Sheet

AWS provides a comprehensive suite of AI and Machine Learning services spanning generative AI, ML platforms, AI services, and responsible AI.
Services range from pre-trained APIs requiring no ML expertise to fully managed platforms for custom model training and deployment.
This cheat sheet covers services relevant to the AWS AI Practitioner (AIF-C01), ML Engineer Associate (MLA-C01), and Solutions Architect certifications.

🎓 Build AI Skills with Google
Learn practical AI skills and earn a Google Certificate. No experience required – learn at your own pace.
Start the Google AI Essentials Learning Path →

Generative AI Services

Amazon Bedrock

Fully managed service to build generative AI applications using foundation models (FMs).
Access models from AI21 Labs, Anthropic (Claude), Cohere, Meta (Llama), Mistral, Stability AI, and Amazon (Titan).
No infrastructure to manage – serverless API access to foundation models.
Knowledge Bases – implement RAG (Retrieval Augmented Generation) by connecting FMs to your data sources (S3, databases).
Agents – create AI agents that can plan, execute multi-step tasks, and call APIs/Lambda functions.
Guardrails – control model outputs with content filters, denied topics, PII redaction, and word filters.
Model Evaluation – evaluate and compare FM performance on your specific tasks.
Fine-tuning – customize models with your data (continued pre-training or instruction fine-tuning).
Provisioned Throughput – reserve model capacity for consistent performance.
Data is not used to train base models – data privacy by default.

Amazon Q

Amazon Q Business – AI assistant for enterprise that connects to company data (S3, SharePoint, Confluence, Salesforce, etc.).
Amazon Q Developer – AI coding assistant for IDEs with code generation, debugging, transformation, and security scanning.
Amazon Q in QuickSight – natural language queries for BI dashboards.
Amazon Q in Connect – AI-powered agent assistance for contact centers.
Respects existing access controls and permissions – users only see answers from data they can access.

Amazon Titan Models

Titan Text – text generation, summarization, classification, Q&A.
Titan Embeddings – convert text to numerical vectors for search, RAG, and recommendations.
Titan Image Generator – generate and edit images from text prompts.
Titan Multimodal Embeddings – embeddings for both text and images.
All Titan models include built-in watermarking for generated content.

ML Platform

Amazon SageMaker

Fully managed ML platform for building, training, and deploying models at scale.
SageMaker Studio – integrated IDE for ML development (notebooks, experiments, pipelines).
Built-in algorithms – XGBoost, Linear Learner, K-Means, Image Classification, Object Detection, etc.
Training – managed training infrastructure with spot instances (up to 90% savings).
SageMaker Pipelines – CI/CD for ML (MLOps) with automated workflow orchestration.
Model Registry – catalog, version, and manage trained models.
SageMaker ML Lineage Tracking – automatically tracks end-to-end relationships from data → training → model → endpoint; supports cross-account lineage sharing via RAM for enterprise governance and compliance audits.
Endpoints – real-time inference, batch transform, async inference, serverless inference.
SageMaker Canvas – no-code ML for business analysts (visual interface).
SageMaker JumpStart – pre-trained foundation models and ML solutions ready to deploy.
SageMaker Clarify – detect bias in data/models and explain model predictions (SHAP values).
SageMaker Data Wrangler – visual data preparation and feature engineering.
SageMaker Feature Store – centralized repository for ML features (online + offline store).
SageMaker Ground Truth – data labeling with human annotators and active learning.
SageMaker Model Monitor – detect data drift, model quality drift, and bias drift in production.

AI Services (Pre-trained APIs)

Natural Language Processing (NLP)

Amazon Comprehend – NLP service for sentiment analysis, entity recognition, key phrases, language detection, PII detection, topic modeling.
Amazon Comprehend Medical – extract medical entities (conditions, medications, dosages) from clinical text.
Amazon Translate – neural machine translation for 75+ languages with custom terminology support.
Amazon Transcribe – speech-to-text (ASR) with speaker identification, custom vocabulary, PII redaction.
Amazon Transcribe Medical – medical speech-to-text for clinical documentation.

Vision

Amazon Rekognition – image and video analysis (object/scene detection, face analysis, text in images, content moderation, celebrity recognition, custom labels).
Amazon Textract – extract text, tables, and forms from documents (beyond basic OCR). Supports invoices, receipts, ID documents.

Speech

Amazon Polly – text-to-speech with neural and standard voices, SSML support, speech marks for lip-sync.
Amazon Lex – build conversational chatbots with automatic speech recognition (ASR) and natural language understanding (NLU). Powers Alexa technology.

Search & Recommendations

Amazon Kendra – intelligent enterprise search powered by ML with natural language queries and document ranking.
Amazon Personalize – real-time personalized recommendations (similar to Amazon.com) without ML expertise.

Forecasting & Other

Amazon Forecast – time-series forecasting using ML (demand planning, resource planning).
Amazon Fraud Detector – identify potentially fraudulent online activities using ML.
Amazon CodeWhisperer (now Amazon Q Developer) – AI-powered code suggestions in IDEs.

Data & Analytics for ML

AWS Glue – serverless ETL with built-in ML transforms (FindMatches for deduplication).
Amazon Athena ML – run ML inference from SQL queries using SageMaker models.
Amazon Redshift ML – create, train, and deploy ML models using SQL (uses SageMaker Autopilot).
Amazon Kinesis – real-time data streaming for ML inference on streaming data.
AWS Lake Formation – build secure data lakes as training data sources.

Responsible AI

Amazon Bedrock Guardrails – content filters, denied topics, PII redaction, hallucination reduction (grounding checks).
SageMaker Clarify – pre-training bias detection (CI, DPL, KL metrics) and post-training bias detection (DPPL, DI, AD).
SageMaker Model Monitor – continuous monitoring for data quality, model quality, bias drift, and feature attribution drift.
Model Explainability – SHAP values for feature importance and individual prediction explanations.
Amazon Titan watermarking – invisible watermarks in generated images for content authenticity.
AWS AI Service Cards – transparency documentation for AWS AI services.
Human-in-the-loop – Amazon Augmented AI (A2I) for human review of ML predictions.

Infrastructure for AI/ML

AWS Trainium – custom chip optimized for deep learning training (used in EC2 Trn1 instances).
AWS Inferentia – custom chip optimized for inference (used in EC2 Inf2 instances). Up to 40% better price-performance than GPU.
Amazon EC2 P5/P4d instances – NVIDIA GPU instances for training and inference.
Amazon EC2 G5/G6 instances – GPU instances for graphics and ML inference.
AWS Neuron SDK – compile and optimize models for Trainium and Inferentia chips.
Amazon S3 – primary storage for training data, model artifacts, and outputs.
FSx for Lustre – high-throughput file system for training data (integrates with S3).

Key Concepts for Certification

ML Workflow

Data Collection → Data Preparation (cleaning, feature engineering) → Model Training → Evaluation → Deployment → Monitoring

Model Types

Supervised Learning – labeled data (classification, regression). Examples: fraud detection, price prediction.
Unsupervised Learning – no labels (clustering, anomaly detection). Examples: customer segmentation, topic modeling.
Reinforcement Learning – agent learns through rewards (robotics, game playing, recommendations).
Foundation Models – large pre-trained models fine-tuned or used via prompting (GPT, Claude, Llama, Titan).

RAG (Retrieval Augmented Generation)

Combines a foundation model with external knowledge retrieval to provide accurate, up-to-date, and cited answers.
AWS implementation: Bedrock Knowledge Bases + vector database (OpenSearch Serverless, Aurora PostgreSQL, Pinecone).
Process: Query → Retrieve relevant chunks from knowledge base → Augment prompt with context → Generate answer.

Prompt Engineering

Zero-shot – ask directly without examples.
Few-shot – provide examples in the prompt.
Chain-of-thought – instruct the model to reason step by step.
System prompts – set behavior, persona, and constraints.

AWS Certification Exam Practice Questions

A company wants to build a chatbot that answers questions using their internal documentation stored in S3 and Confluence. The answers must cite sources. Which AWS service and feature combination is most appropriate?
1. Amazon Lex with Lambda
2. Amazon Bedrock with Knowledge Bases (RAG)
3. Amazon Kendra with Lex
4. Amazon Comprehend with Q Business
A team needs to detect if their ML model exhibits bias against a protected demographic group before deploying to production. Which service should they use?
1. Amazon Bedrock Guardrails
2. Amazon Rekognition
3. SageMaker Clarify
4. Amazon Comprehend
An application needs to extract structured data (tables, key-value pairs) from scanned invoices and receipts. Which service is purpose-built for this?
1. Amazon Rekognition
2. Amazon Comprehend
3. Amazon Textract
4. Amazon Bedrock
A generative AI application must prevent the model from discussing competitor products and must redact any PII in responses. Which feature provides these controls?
1. SageMaker Model Monitor
2. Amazon Bedrock Guardrails
3. Amazon Comprehend PII detection
4. AWS WAF
A company needs the lowest cost per inference for deploying a trained deep learning model at high throughput. Which AWS hardware is optimized for this?
1. EC2 P5 instances (NVIDIA GPU)
2. EC2 G5 instances
3. EC2 Inf2 instances (AWS Inferentia2)
4. EC2 Trn1 instances (AWS Trainium)

References

Amazon Bedrock User Guide

Amazon SageMaker Developer Guide

Amazon Q Business User Guide

AWS AI/ML Services Overview

AWS Transit Gateway vs VPC Peering vs PrivateLink

June 10, 2026 ~ Last updated on : June 12, 2026 ~ Kiro Agent

AWS Transit Gateway vs VPC Peering vs PrivateLink

AWS provides multiple VPC connectivity options, each designed for different network topologies and use cases.
VPC Peering is point-to-point, Transit Gateway is a hub for many-to-many connectivity, and PrivateLink is for private service access without network exposure.
Choice depends on number of VPCs, routing requirements, security posture, and cost.

Transit Gateway vs VPC Peering vs PrivateLink Comparison

Feature	VPC Peering	Transit Gateway	PrivateLink
Topology	Point-to-point (1:1)	Hub-and-spoke (many:many)	Service endpoint (consumer:provider)
Transitive Routing	No	Yes	No (service access only)
Scale	125 peering connections per VPC	5,000 attachments per TGW	Unlimited endpoints
Cross-Region	Yes (inter-region peering)	Yes (inter-region peering)	Yes (with inter-region support)
Cross-Account	Yes	Yes (RAM sharing)	Yes
CIDR Overlap	Not allowed	Not allowed (per attachment)	Allowed (uses ENI in consumer VPC)
Network Exposure	Full VPC network visible to peer	Full VPC network via route tables	Only the service endpoint exposed
Bandwidth	No limit (same as inter-AZ)	Up to 50 Gbps per attachment	Up to 100 Gbps per endpoint
Cost	Data transfer only (no hourly charge)	Hourly per attachment + data processing	Hourly per endpoint + data processing
Use Case	Few VPCs, simple connectivity	Many VPCs, centralized routing, VPN/DX aggregation	Expose service privately, SaaS connectivity, zero-trust
Route Management	Update route tables in both VPCs	Centralized route tables on TGW	No route table changes needed
Security	Security groups + NACLs	Security groups + NACLs + TGW route tables	Minimum exposure (only service port)

When to Choose Which

Choose VPC Peering – Small number of VPCs (2-5), simple point-to-point connectivity, lowest cost, no transitive routing needed.
Choose Transit Gateway – Many VPCs needing full mesh connectivity, centralized VPN/Direct Connect, shared services VPC, network segmentation with route tables.
Choose PrivateLink – Expose a specific service to other accounts/VPCs without full network access, overlapping CIDRs, SaaS service consumption, zero-trust architecture.
Combine TGW + PrivateLink – Transit Gateway for general connectivity between VPCs, PrivateLink for specific service access with minimal exposure.

AWS Certification Exam Practice Questions

A company has 50 VPCs that all need to communicate with a shared services VPC and a centralized Direct Connect connection. Which connectivity solution scales best?
1. VPC Peering (50 connections)
2. Transit Gateway
3. PrivateLink
4. VPN to each VPC
A SaaS provider needs to expose their service to customers in different AWS accounts without exposing their entire VPC network. The customer VPCs have overlapping CIDR ranges. Which solution works?
1. VPC Peering
2. Transit Gateway
3. PrivateLink
4. Site-to-Site VPN
Two VPCs in the same region need connectivity. The traffic volume is minimal, cost is a priority, and no transitive routing is needed. What is the most cost-effective solution?
1. Transit Gateway
2. VPC Peering
3. PrivateLink
4. AWS Cloud WAN
An organization needs VPC A to route traffic through VPC B to reach VPC C. Which service supports this transitive routing?
1. VPC Peering
2. PrivateLink
3. Transit Gateway
4. Internet Gateway

References

AWS Transit Gateway Guide

VPC Peering Guide

AWS PrivateLink Guide

AWS Route 53 Routing Policies Comparison

Route 53 Routing Policies Comparison

Simple Routing

Weighted Routing

Latency-based Routing

Failover Routing

Geolocation Routing

Geoproximity Routing

Multivalue Answer Routing

IP-based Routing

Combining Routing Policies

AWS Certification Exam Practice Questions

Related Posts

References

AWS DynamoDB vs DocumentDB vs Neptune

DynamoDB vs DocumentDB vs Neptune Comparison

Amazon DynamoDB

Amazon DocumentDB

Amazon Neptune

When to Choose Which

AWS Certification Exam Practice Questions

Related Posts

References

AWS Step Functions vs EventBridge

Step Functions vs EventBridge Comparison

AWS Step Functions

Amazon EventBridge

When to Choose Which

AWS Certification Exam Practice Questions

Related Posts

References

AWS CloudWatch vs CloudTrail vs Config

CloudWatch vs CloudTrail vs Config Comparison

Amazon CloudWatch

AWS CloudTrail

AWS Config

How They Work Together

When to Choose Which

AWS Certification Exam Practice Questions

Related Posts

References

AWS KMS vs CloudHSM vs Secrets Manager vs Parameter Store

KMS vs CloudHSM vs Secrets Manager vs Parameter Store Comparison

AWS KMS (Key Management Service)

AWS CloudHSM

AWS Secrets Manager

Systems Manager Parameter Store

When to Choose Which

AWS Certification Exam Practice Questions

Related Posts

References

AWS Container Services Cheat Sheet

Container Orchestration

Amazon ECS (Elastic Container Service)

Amazon EKS (Elastic Kubernetes Service)

Compute

AWS Fargate

Container Registry

Amazon ECR (Elastic Container Registry)

Networking & Service Discovery

AWS App Mesh

Amazon VPC Lattice

AWS Cloud Map

CI/CD & DevOps

Monitoring & Logging

AWS Certification Exam Practice Questions

Related Posts

References

AWS Storage Services Cheat Sheet

Object Storage

Amazon S3

Block Storage

Amazon EBS

EC2 Instance Store

File Storage

Amazon EFS

Amazon FSx

Hybrid & Edge Storage

AWS Storage Gateway

AWS Snow Family