What are the EBS volume types in AWS?

AWS offers 6 EBS volume types: gp3 and gp2 (General Purpose SSD), io2 and io1 (Provisioned IOPS SSD), st1 (Throughput Optimized HDD), and sc1 (Cold HDD).

What is the difference between gp2 and gp3?

gp3 provides a baseline of 3,000 IOPS and 125 MB/s regardless of volume size, while gp2 performance scales with volume size at 3 IOPS/GB. gp3 is 20% cheaper than gp2 for most workloads.

Which EBS volume type is best for databases?

For databases requiring high IOPS, io2 Block Express is best (up to 256,000 IOPS). For general databases, gp3 with provisioned IOPS up to 16,000 is cost-effective.

Can EBS volumes be used as boot volumes?

Only SSD-based volumes (gp2, gp3, io1, io2) can be used as boot volumes. HDD volumes (st1, sc1) cannot be used as boot volumes.

AWS EBS Snapshots – Backup, Copy & Encryption

November 20, 2022 ~ Last updated on : July 4, 2026 ~ jayendrapatil ~ 38 Comments

EBS Snapshot

EBS provides the ability to create snapshots (backups) of any EBS volume and write a copy of the data in the volume to S3, where it is stored redundantly in multiple Availability Zones
Snapshots are incremental backups and store only the data that was changed from the time the last snapshot was taken.
Snapshots can be used to create new volumes, increase the size of the volumes or replicate data across Availability Zones.
Snapshot size can probably be smaller than the volume size as the data is compressed before being saved to S3.
Even though snapshots are saved incrementally, the snapshot deletion process is designed so that you need to retain only the most recent snapshot in order to restore the volume.
EBS Snapshots can be used to migrate or create EBS volumes in different AZs or regions.
Snapshot data is automatically replicated across all Availability Zones in the Region.

Multi-Volume Snapshots

Snapshots can be used to create a backup of critical workloads, such as a large database or a file system that spans across multiple EBS volumes.
Multi-volume snapshots help take exact point-in-time, data-coordinated, and crash-consistent snapshots across multiple EBS volumes attached to an EC2 instance.
It is no longer required to stop the instance or to coordinate between volumes to ensure crash consistency because snapshots are automatically taken across multiple EBS volumes.

EBS Snapshot creation

Snapshots can be created from EBS volumes periodically and are point-in-time snapshots.
Snapshots are incremental and only store the blocks on the device that changed since the last snapshot was taken
Snapshots occur asynchronously; the point-in-time snapshot is created immediately while it takes time to upload the modified blocks to S3. While it is completing, an in-progress snapshot is not affected by ongoing reads and writes to the volume.
Snapshots can be taken from in-use volumes. However, snapshots will only capture the data that was written to the EBS volumes at the time the snapshot command is issued excluding the data which is cached by any applications of OS.
Recommended ways to create a Snapshot from an EBS volume are
- Pause all file writes to the volume
- Unmount the Volume -> Take Snapshot -> Remount the Volume
- Stop the instance – Take Snapshot (for root EBS volumes)
EBS volume created based on a snapshot
- begins as an exact replica of the original volume that was used to create the snapshot.
- replicated volume loads data in the background so that it can be used immediately.
- If data that hasn’t been loaded yet is accessed, the volume immediately downloads the requested data from S3 and then continues loading the rest of the volume’s data in the background.

EBS Snapshot Deletion

When a snapshot is deleted only the data exclusive to that snapshot is removed.
Deleting previous snapshots of a volume does not affect the ability to restore volumes from later snapshots of that volume.
Active snapshots contain all of the information needed to restore your data (from the time the snapshot was taken) to a new EBS volume.
Even though snapshots are saved incrementally, the snapshot deletion process is designed so that you need to retain only the most recent snapshot in order to restore the volume.
Snapshot of the root device of an EBS volume used by a registered AMI can’t be deleted. AMI needs to be deregistered to be able to delete the snapshot.

EBS Snapshot Copy

Snapshots are constrained to the region in which they are created and can be used to launch EBS volumes within the same region only
Snapshots can be copied across regions to make it easier to leverage multiple regions for geographical expansion, data center migration, and disaster recovery
Snapshots are copied with S3 server-side encryption (256-bit Advanced Encryption Standard) to encrypt the data and the snapshot copy receives a snapshot ID that’s different from the original snapshot’s ID.
User-defined tags are not copied from the source to the new snapshot.
First Snapshot copy to another region is always a full copy, while the rest are always incremental.
When a snapshot is copied,
- it can be encrypted if currently unencrypted or
- can be encrypted using a different encryption key. Changing the encryption status of a snapshot or using a non-default EBS KMS key during a copy operation always results in a full copy (not incremental)

Time-based Snapshot Copy

Time-based Copy (launched Nov 2024) allows specifying a desired completion duration (15 minutes to 48 hours) when copying a snapshot within or between AWS Regions and/or accounts.
Helps meet time-based compliance and business requirements such as Recovery Point Objectives (RPOs) for disaster recovery.
Duration can range from 15 minutes to 48 hours in 15-minute increments, specified on a per-copy basis.
Maximum per-snapshot throughput is 500 MiB/second; default per-account limit is 2000 MiB/second between each source and destination pair.
If the copy cannot be completed within the specified duration, an EventBridge copyMissedCompletionDuration event is sent.
Available in all AWS Regions.

EBS Snapshot Sharing

Snapshots can be shared by making them public or with specific AWS accounts by modifying the access permissions of the snapshots
Encrypted snapshots cannot be made available publicly.
Encrypted snapshots can be shared with specific AWS accounts by sharing the customer managed KMS key used to encrypt it.
Cross-account permissions may be applied to a KMS key either when it is created or at a later time.
Users, with access to snapshots, can copy the snapshot and create their own EBS volumes based on the snapshot while the original snapshot remains unaffected
AWS prevents you from sharing snapshots that were encrypted with the default AWS managed key (aws/ebs). Only snapshots encrypted with customer managed KMS keys can be shared.
To share an encrypted snapshot cross-region, copy the snapshot to the destination region first and then share the copy.

EBS Snapshot Encryption

EBS snapshots fully support EBS encryption.
Snapshots of encrypted volumes are automatically encrypted
Volumes created from encrypted snapshots are automatically encrypted
All data in flight between the instance and the volume is encrypted
Volumes created from an unencrypted snapshot owned or have access to can be encrypted on the fly.
Unencrypted snapshots can be encrypted during the copy process.
Encrypted snapshots that you own or have access to, can be encrypted with a different key during the copy process.
First snapshot of an encrypted volume that has been created from an unencrypted snapshot is always a full snapshot.
First snapshot of a re-encrypted volume, which has a different KMS key compared to the source snapshot, is always a full snapshot.

EBS Fast Snapshot Restore (FSR)

Fast Snapshot Restore (FSR) eliminates the need for the traditional initialization process when creating a volume from a snapshot.
Volumes created from FSR-enabled snapshots instantly deliver all of their provisioned performance without the latency penalty of lazy loading blocks from S3.
FSR is enabled on selected snapshots in specific Availability Zones.
FSR is disabled for a snapshot by default. When enabled or disabled, the changes apply to your account only.
FSR is useful for workloads requiring rapid provisioning such as VDI (Virtual Desktop Infrastructure), backup & restore, test/dev volume copies, and booting from custom AMIs.
FSR supports io2 Block Express volumes.
FSR is not supported with AWS Outposts, Local Zones, and Wavelength Zones.
FSR-enabled snapshots shared with your account can be used with FSR enabled in your account independently.
Additional charges apply for each minute that FSR is enabled on a snapshot in a particular AZ.

EBS Snapshot Archive

EBS Snapshots Archive is a low-cost storage tier for rarely accessed snapshots that provides up to 75% lower storage costs.
Designed for snapshots retained for 90 days or longer that do not require frequent or fast retrieval.
When a snapshot is archived, the incremental snapshot is converted to a full snapshot and moved to the archive tier.
Archived snapshots have a minimum retention period of 90 days.
To use an archived snapshot, it must first be restored to the standard tier, after which it can be used like any other snapshot.
Restoring from archive takes 24-72 hours depending on the snapshot size.
EBS Direct APIs cannot be used with archived snapshots.
Snapshot locks can be applied to snapshots that have already been archived.
Amazon Data Lifecycle Manager can automatically archive snapshots based on policies.
AWS Backup supports EBS Snapshots Archive in backup policies.

EBS Snapshot Lock

EBS Snapshot Lock (launched Nov 2023) protects snapshots from inadvertent or malicious deletions, including ransomware attacks.
Locked snapshots cannot be deleted until the lock expires or is released.
Lock duration can range from one day to approximately 100 years.
Two lock modes are available:
- Governance mode – Protects from deletion by all users. With proper IAM permissions, the lock duration can be extended/shortened, the lock deleted, or mode changed to Compliance.
- Compliance mode – Protects from deletion by all users including the root user. After a cooling-off period (up to 72 hours), neither the snapshot nor the lock can be deleted until the lock expires. Lock duration can only be extended, not shortened.
Snapshots in either mode can still be shared, copied, or archived.
Supports WORM (Write Once Read Many) compliance requirements.
No additional charges for using Snapshot Lock; standard snapshot storage rates apply.
Available in all commercial AWS Regions.
If using customer managed KMS keys for encryption, ensure the key remains valid for the lifetime of the locked snapshot.

Recycle Bin for EBS Snapshots

Recycle Bin enables recovery of accidentally deleted EBS snapshots and EBS-backed AMIs.
Retention rules specify a retention period (1 day to 1 year) during which deleted snapshots are retained before permanent deletion.
A recovered snapshot retains all its attributes including tags, permissions, and encryption status.
Recovered snapshots can be used immediately for creating volumes.
Rule Lock can be applied to Recycle Bin retention rules to prevent them from being modified or deleted, providing additional protection.
Recycle Bin also supports EBS-backed AMIs and EBS Volumes (added 2025).

EBS Direct APIs

EBS Direct APIs allow creating EBS snapshots, writing data directly to snapshots, reading snapshot data, and identifying differences between two snapshots.
Key operations include:
- ListSnapshotBlocks – Returns block indexes and block tokens of blocks in a snapshot.
- ListChangedBlocks – Returns blocks that are different between two snapshots of the same volume.
- GetSnapshotBlock – Returns data in a block for a given snapshot.
- StartSnapshot – Creates a new snapshot (can be used to create snapshots from on-premises data).
- PutSnapshotBlock – Adds data to a started snapshot in the form of individual blocks.
- CompleteSnapshot – Completes a snapshot after all blocks have been written.
Enables backup of on-premises data directly into EBS snapshots without needing an EC2 instance.
Useful for incremental backup solutions and disaster recovery from on-premises to AWS.
Does not support public snapshots or archived snapshots.

Local Snapshots

Local Snapshots allow creating and storing snapshots in AWS Local Zones and Dedicated Local Zones.
By default, snapshots of EBS volumes in a Local Zone are stored in S3 in the parent Region.
With Local Snapshots, backups can be stored within the same geographical boundary as the EBS volumes, helping meet data residency and data isolation requirements.
Snapshot copy is supported for Local Zones, allowing copies to be sent to the Region or another Local Zone.
EBS Direct APIs do not support local snapshots on Outposts.

EBS Snapshot Lifecycle Automation

Amazon Data Lifecycle Manager (DLM) can be used to automate the creation, retention, and deletion of snapshots taken to back up the EBS volumes.
Automating snapshot management helps you to:
- Protect valuable data by enforcing a regular backup schedule.
- Retain backups as required by auditors or internal compliance.
- Reduce storage costs by deleting outdated backups.
- Automatically archive snapshots to the Archive tier based on policies.
AWS Backup provides a centralized, policy-based approach to manage EBS snapshot backups across AWS accounts and regions.
AWS Backup supports EBS Snapshots Archive in backup policies for cost-optimized long-term retention.

EBS Snapshot Resource-Level Permissions

Enhanced resource-level permissions (2025) allow specifying additional resource-level authorization in IAM policies for source snapshots when creating volumes (CreateVolume) or copying snapshots (CopySnapshot).
Enables fine-grained access control over which snapshots can be used as sources for volume creation or copy operations.

AWS Certification Exam Practice Questions

Questions are collected from Internet and the answers are marked as per my knowledge and understanding (which might differ with yours).

AWS services are updated everyday and both the answers and questions might be outdated soon, so research accordingly.

AWS exam questions are not updated to keep up the pace with AWS updates, so even if the underlying feature has changed the question might not be updated

Open to further feedback, discussion and correction.

An existing application stores sensitive information on a non-boot Amazon EBS data volume attached to an Amazon Elastic Compute Cloud instance. Which of the following approaches would protect the sensitive data on an Amazon EBS volume?
1. Upload your customer keys to AWS CloudHSM. Associate the Amazon EBS volume with AWS CloudHSM. Remount the Amazon EBS volume.
2. Create and mount a new, encrypted Amazon EBS volume. Move the data to the new volume. Delete the old Amazon EBS volume.
3. Unmount the EBS volume. Toggle the encryption attribute to True. Re-mount the Amazon EBS volume.
4. Snapshot the current Amazon EBS volume. Restore the snapshot to a new, encrypted Amazon EBS volume. Mount the Amazon EBS volume (Need to create a snapshot, create an encrypted copy of snapshot and then create an EBS volume and mount it)
Is it possible to access your EBS snapshots?
1. Yes, through the Amazon S3 APIs.
2. Yes, through the Amazon EC2 APIs
3. No, EBS snapshots cannot be accessed; they can only be used to create a new EBS volume.
4. EBS doesn’t provide snapshots.
Which of the following approaches provides the lowest cost for Amazon Elastic Block Store snapshots while giving you the ability to fully restore data?
1. Maintain two snapshots: the original snapshot and the latest incremental snapshot
2. Maintain a volume snapshot; subsequent snapshots will overwrite one another
3. Maintain a single snapshot the latest snapshot is both Incremental and complete
4. Maintain the most current snapshot, archive the original and incremental to Amazon Glacier.
Which procedure for backing up a relational database on EC2 that is using a set of RAIDed EBS volumes for storage minimizes the time during which the database cannot be written to and results in a consistent backup?
1. Detach EBS volumes, 2. Start EBS snapshot of volumes, 3. Re-attach EBS volumes
2. Stop the EC2 Instance. 2. Snapshot the EBS volumes
3. Suspend disk I/O, 2. Create an image of the EC2 Instance, 3. Resume disk I/O
4. Suspend disk I/O, 2. Start EBS snapshot of volumes, 3. Resume disk I/O
5. Suspend disk I/O, 2. Start EBS snapshot of volumes, 3. Wait for snapshots to complete, 4. Resume disk I/O
How can an EBS volume that is currently attached to an EC2 instance be migrated from one Availability Zone to another?
1. Detach the volume and attach it to another EC2 instance in the other AZ.
2. Simply create a new volume in the other AZ and specify the original volume as the source.
3. Create a snapshot of the volume, and create a new volume from the snapshot in the other AZ
4. Detach the volume, then use the ec2-migrate-volume command to move it to another AZ.
How are the EBS snapshots saved on Amazon S3?
1. Exponentially
2. Incrementally
3. EBS snapshots are not stored in the Amazon S3
4. Decrementally
EBS Snapshots occur _____
1. Asynchronously
2. Synchronously
3. Weekly
What will be the status of the snapshot until the snapshot is complete?
1. Running
2. Working
3. Progressing
4. Pending
Before I delete an EBS volume, what can I do if I want to recreate the volume later?
1. Create a copy of the EBS volume (not a snapshot)
2. Create and Store a snapshot of the volume
3. Download the content to an EC2 instance
4. Back up the data in to a physical disk
Which of the following are true regarding encrypted Amazon Elastic Block Store (EBS) volumes? Choose 2 answers
1. Supported on all Amazon EBS volume types
2. Snapshots are automatically encrypted
3. Available to all instance types
4. Existing volumes can be encrypted
5. Shared volumes can be encrypted
Amazon EBS snapshots have which of the following two characteristics? (Choose 2.) Choose 2 answers
1. EBS snapshots only save incremental changes from snapshot to snapshot
2. EBS snapshots can be created in real-time without stopping an EC2 instance (the snapshot can be taken real time however it will not be consistent and the recommended way is to stop or freeze the IO)
3. EBS snapshots can only be restored to an EBS volume of the same size or smaller (EBS volume restored from snapshots need to be of the same size of larger size)
4. EBS snapshots can only be restored and mounted to an instance in the same Availability Zone as the original EBS volume (Snapshots are specific to Region and can be used to create a volume in any AZ and does not depend on the original EBS volume AZ)
A user is planning to schedule a backup for an EBS volume. The user wants security of the snapshot data. How can the user achieve data encryption with a snapshot?
1. Use encrypted EBS volumes so that the snapshot will be encrypted by AWS (Refer link)
2. While creating a snapshot select the snapshot with encryption
3. By default the snapshot is encrypted by AWS
4. Enable server side encryption for the snapshot using S3
A sys admin is trying to understand EBS snapshots. Which of the below mentioned statements will not be useful to the admin to understand the concepts about a snapshot?
1. Snapshot is synchronous
2. It is recommended to stop the instance before taking a snapshot for consistent data
3. Snapshot is incremental
4. Snapshot captures the data that has been written to the hard disk when the snapshot command was executed
When creation of an EBS snapshot is initiated but not completed, the EBS volume
1. Cannot be detached or attached to an EC2 instance until me snapshot completes
2. Can be used in read-only mode while me snapshot is in progress
3. Can be used while the snapshot is in progress
4. Cannot be used until the snapshot completes
You have a server with a 5O0GB Amazon EBS data volume. The volume is 80% full. You need to back up the volume at regular intervals and be able to re-create the volume in a new Availability Zone in the shortest time possible. All applications using the volume can be paused for a period of a few minutes with no discernible user impact. Which of the following backup methods will best fulfill your requirements?
1. Take periodic snapshots of the EBS volume
2. Use a third-party Incremental backup application to back up to Amazon Glacier
3. Periodically back up all data to a single compressed archive and archive to Amazon S3 using a parallelized multi-part upload
4. Create another EBS volume in the second Availability Zone attach it to the Amazon EC2 instance, and use a disk manager to mirror me two disks
A user is creating a snapshot of an EBS volume. Which of the below statements is incorrect in relation to the creation of an EBS snapshot?
1. Its incremental
2. It can be used to launch a new instance
3. It is stored in the same AZ as the volume (stored in the same region)
4. It is a point in time backup of the EBS volume
A user has created a snapshot of an EBS volume. Which of the below mentioned usage cases is not possible with respect to a snapshot?
1. Mirroring the volume from one AZ to another AZ
2. Launch an instance
3. Decrease the volume size
4. Increase the size of the volume
What is true of the way that encryption works with EBS?
1. Snapshotting an encrypted volume makes an encrypted snapshot; restoring an encrypted snapshot creates an encrypted volume when specified / requested.
2. Snapshotting an encrypted volume makes an encrypted snapshot when specified / requested; restoring an encrypted snapshot creates an encrypted volume when specified / requested.
3. Snapshotting an encrypted volume makes an encrypted snapshot; restoring an encrypted snapshot always creates an encrypted volume.
4. Snapshotting an encrypted volume makes an encrypted snapshot when specified / requested; restoring an encrypted snapshot always creates an encrypted volume.
Why are more frequent snapshots of EBS Volumes faster?
1. Blocks in EBS Volumes are allocated lazily, since while logically separated from other EBS Volumes, Volumes often share the same physical hardware. Snapshotting the first time forces full block range allocation, so the second snapshot doesn’t need to perform the allocation phase and is faster.
2. The snapshots are incremental so that only the blocks on the device that have changed after your last snapshot are saved in the new snapshot.
3. AWS provisions more disk throughput for burst capacity during snapshots if the drive has been pre-warmed by snapshotting and reading all blocks.
4. The drive is pre-warmed, so block access is more rapid for volumes when every block on the device has already been read at least one time.
Which is not a restriction on AWS EBS Snapshots?
1. Snapshots which are shared cannot be used as a basis for other snapshots (Snapshots shared with other users are usable in full by the recipient, including but limited to the ability to base modified volumes and snapshots)
2. You cannot share a snapshot containing an AWS Access Key ID or AWS Secret Access Key
3. You cannot share snapshots encrypted with the default AWS managed key (NOTE: Encrypted snapshots CAN be shared with specific accounts if encrypted with a customer managed KMS key. Only snapshots encrypted with the default aws/ebs key cannot be shared.)
4. Snapshot restorations are restricted to the region in which the snapshots are created
There is a very serious outage at AWS. EC2 is not affected, but your EC2 instance deployment scripts stopped working in the region with the outage. What might be the issue?
1. The AWS Console is down, so your CLI commands do not work.
2. S3 is unavailable, so you can’t create EBS volumes from a snapshot you use to deploy new volumes. (EBS volume snapshots are stored in S3. If S3 is unavailable, snapshots are unavailable)
3. AWS turns off the DeployCode API call when there are major outages, to protect from system floods.
4. None of the other answers make sense. If EC2 is not affected, it must be some other issue.

New Practice Questions – EBS Snapshot Features (2021-2025)

A company needs to ensure that critical EBS snapshots cannot be deleted by any user, including the root user, for a period of 5 years to meet regulatory compliance. Which feature should be used?
1. AWS Backup Vault Lock
2. Recycle Bin with 5-year retention
3. EBS Snapshot Lock in Compliance mode
4. EBS Snapshot Lock in Governance mode
A company wants to reduce storage costs for EBS snapshots that are retained for compliance but rarely accessed. The snapshots need to be kept for at least 2 years. Which approach provides the lowest cost?
1. Keep snapshots in standard tier and use lifecycle policies to delete after 2 years
2. Move snapshots to EBS Snapshots Archive tier
3. Copy snapshots to S3 Glacier
4. Use Recycle Bin with a 2-year retention period
An organization accidentally deleted an EBS snapshot that was needed for disaster recovery. Which AWS feature could have prevented permanent data loss?
1. EBS Snapshot Lock
2. Multi-volume snapshots
3. Recycle Bin for EBS Snapshots
4. Fast Snapshot Restore
A company needs to copy an EBS snapshot to another region and must ensure the copy completes within 2 hours to meet their RPO requirements. Which feature should they use?
1. Fast Snapshot Restore
2. Time-based Snapshot Copy with a 2-hour completion duration
3. EBS Direct APIs
4. Standard cross-region copy with CloudWatch monitoring
A company runs a VDI environment and needs volumes created from snapshots to deliver full provisioned performance immediately without any initialization penalty. Which feature should be enabled?
1. EBS Snapshot Archive
2. Time-based Snapshot Copy
3. EBS Direct APIs
4. Fast Snapshot Restore (FSR)
A backup solution needs to create EBS snapshots directly from on-premises block storage data without using an EC2 instance as an intermediary. Which approach enables this?
1. AWS Storage Gateway
2. AWS DataSync
3. EBS Direct APIs (StartSnapshot, PutSnapshotBlock, CompleteSnapshot)
4. S3 Transfer Acceleration with snapshot import
What happens when an EBS snapshot is archived? (Choose 2)
1. The incremental snapshot is converted to a full snapshot
2. The snapshot remains incremental in the archive tier
3. The snapshot is moved from the standard tier to the archive tier
4. The snapshot is automatically deleted after 90 days
5. The snapshot can still be used directly to create volumes without restoration

AWS EBS Performance

November 18, 2022 ~ Last updated on : July 4, 2026 ~ jayendrapatil ~ 9 Comments

AWS EBS Performance Tips

EBS Performance depends on several factors including I/O characteristics, instances and volumes configuration and can be improved using Provisioned IOPS (io2 Block Express), EBS-Optimized instances, proper volume type selection, and RAID configuration.

📢 Key Updates (2025-2026)

gp3 volumes enhanced (Sept 2025) – Now support up to 64 TiB size (4x increase), 80,000 IOPS (5x increase), and 2,000 MiB/s throughput (2x increase).
io2 Block Express – Delivers up to 256,000 IOPS, 4,000 MB/s throughput, 64 TiB capacity with sub-millisecond latency and 99.999% durability.
Instance Bandwidth Weighting – New feature allows shifting up to 25% of network bandwidth to EBS for I/O-intensive workloads.
io1 → io2 migration recommended – AWS recommends upgrading io1 to io2 for better performance and durability at the same cost.
gp2 → gp3 migration recommended – gp3 offers 20% lower cost with better baseline performance (3,000 IOPS, 125 MiB/s).
RAID 0 less necessary – With gp3’s increased limits (80,000 IOPS per volume), many workloads no longer require multi-volume striping.

EBS Volume Type Selection for Performance

Selecting the right volume type is the most impactful decision for EBS performance.
gp3 (General Purpose SSD) – Recommended default for most workloads.
- Baseline: 3,000 IOPS and 125 MiB/s at any volume size (no burst credits needed)
- Max: 80,000 IOPS and 2,000 MiB/s throughput
- Size: up to 64 TiB
- Performance is provisioned independently of storage capacity
- 20% lower cost per GB than gp2
io2 Block Express (Provisioned IOPS SSD) – For mission-critical, latency-sensitive workloads.
- Max: 256,000 IOPS, 4,000 MB/s throughput, 64 TiB capacity
- Sub-millisecond latency (avg. under 500 microseconds)
- 99.999% durability (vs 99.8-99.9% for gp3)
- Up to 1,000 IOPS per GiB ratio
- Multi-Attach support (up to 16 instances simultaneously)
- Available on all Nitro-based EC2 instances
gp2 (Previous Generation General Purpose SSD) – Still available but migration to gp3 is recommended.
- Max: 16,000 IOPS, 250 MB/s throughput, 16 TiB
- IOPS scales with volume size at 3 IOPS/GiB
- Burst credit model for volumes under 1 TiB
io1 (Previous Generation Provisioned IOPS SSD) – Migration to io2 Block Express is recommended.
- Max: 64,000 IOPS, 1,000 MB/s throughput, 16 TiB
- 50 IOPS per GiB ratio
- 99.8-99.9% durability

EBS-Optimized or 10 Gigabit Network Instances

An EBS-Optimized instance uses an optimized configuration stack and provides additional, dedicated capacity for EBS I/O.
Optimization provides the best performance for the EBS volumes by minimizing contention between EBS I/O and other traffic from an instance.
EBS-Optimized instances deliver dedicated throughput to EBS depending on the instance type used.
All current-generation EC2 instance types are EBS-optimized by default at no additional cost.
Some previous-generation instance types support EBS-optimization as an optional feature with an additional hourly fee.
When attached to an EBS–optimized instance,
- General Purpose (gp3) volumes are designed to deliver within 10% of their provisioned performance 99% of the time in a given year.
- Provisioned IOPS (io2 Block Express) volumes are designed to deliver within 10% of their provisioned performance 99.9% of the time in a given year.
The maximum EBS throughput varies by instance type – for example, latest generation instances like C8gd/M8gd/R8gd provide up to 40 Gbps of EBS bandwidth.

Instance Bandwidth Weighting

EC2 instances on select Nitro-based instance types support configurable bandwidth weighting between EBS and VPC networking.
Using the ebs-1 bandwidth weighting option increases EBS bandwidth by up to 25%, which reduces VPC network bandwidth by the same amount.
This is beneficial for I/O-intensive workloads that require higher EBS throughput but have lower network requirements.
The total available baseline bandwidth for the instance remains the same; it only shifts the allocation.
Network PPS and EBS IOPS specifications are unaffected by bandwidth weighting changes.
Can be configured at launch time using launch templates or modified on running instances.

EBS Volume Initialization – Pre-warming

Empty EBS volumes receive their maximum performance the moment that they are available and DO NOT require initialization (pre-warming).
EBS volumes needed a pre-warming, previously, before being used to get maximum performance to start with. Pre-warming of the volume was possible by writing to the entire volume with 0 for new volumes or reading the entire volume for volumes from snapshots.
Storage blocks on volumes that were restored from snapshots must be initialized (pulled down from S3 and written to the volume) before the block can be accessed.
This preliminary action takes time and can cause a significant increase in the latency of an I/O operation the first time each block is accessed.
To avoid this initial performance hit in a production environment, the following options can be used:
- Force the immediate initialization of the entire volume by using the dd or fio utilities to read from all of the blocks on a volume.
- Enable Fast Snapshot Restore (FSR) on a snapshot to ensure that the EBS volumes created from it are fully-initialized at creation and instantly deliver all of their provisioned performance.
Fast Snapshot Restore (FSR) considerations:
- FSR eliminates the latency of I/O operations on first access of snapshot-restored volumes.
- Available in all commercial AWS regions (expanded to 6 additional regions in August 2024).
- Not supported with AWS Outposts, Local Zones, and Wavelength Zones.
- FSR is charged per AZ per hour per snapshot enabled, so cost should be considered.

Elastic Volumes

Elastic Volumes allows modifying EBS volume size, type, IOPS, and throughput without detaching the volume or stopping the instance.
Supported on all current-generation instances and several previous-generation instances (C1, C3, C4, G2, I2, M1, M3, M4, R3, R4).
Modifications include:
- Increasing volume size (cannot decrease)
- Changing volume type (e.g., gp2 → gp3, io1 → io2)
- Adjusting provisioned IOPS and throughput (gp3, io1, io2)
Size increases take effect once the modification reaches the “optimizing” state (usually seconds).
The file system must be extended within the OS after a size increase.
A volume can only be modified once every 6 hours.

RAID Configuration

EBS volumes can be striped, if a single EBS volume does not meet the performance requirements.
Note: With gp3 volumes now supporting up to 80,000 IOPS and 2,000 MiB/s per volume, many workloads that previously required RAID 0 can now use a single volume, improving resiliency.
Striping volumes allows pushing tens of thousands of IOPS beyond single-volume limits.
EBS volumes are already replicated across multiple servers in an AZ for availability and durability, so AWS generally recommends striping for performance rather than durability.
For greater I/O performance than can be achieved with a single volume, RAID 0 can stripe multiple volumes together; for on-instance redundancy, RAID 1 can mirror two volumes together.
RAID 0 allows I/O distribution across all volumes in a stripe, allowing straight gains with each addition.
RAID 1 can be used for durability to mirror volumes, but in this case, it requires more EC2 to EBS bandwidth as the data is written to multiple volumes simultaneously and should be used with EBS–optimization.
EBS volume data is replicated across multiple servers in an AZ to prevent the loss of data from the failure of any single component.
AWS doesn’t recommend RAID 5 and 6 because the parity write operations of these modes consume the IOPS available for the volumes and can result in 20-30% fewer usable IOPS than RAID 0.
A 2-volume RAID 0 config can outperform a 4-volume RAID 6 that costs twice as much.
Durability consideration: Each additional volume in a RAID 0 stripe reduces effective durability (e.g., 4 gp3 volumes in RAID 0 = ~99.6% effective durability vs. 99.9% for a single volume). With increased gp3 limits, fewer volumes are needed for the same performance.

RAID Configuration

EBS Performance Best Practices Summary

Use gp3 as the default volume type – provides better baseline performance than gp2 at 20% lower cost.
Use io2 Block Express for critical databases – sub-millisecond latency, 99.999% durability, up to 256K IOPS.
Right-size your instance – ensure the instance’s EBS bandwidth limit is not the bottleneck (use CloudWatch EBSIOBalance% and EBSByteBalance% metrics).
Use EBS bandwidth weighting for I/O-intensive workloads with lower network needs.
Prefer single larger volumes over RAID 0 when gp3 limits (80,000 IOPS, 2,000 MiB/s) are sufficient – simpler and more durable.
Enable Fast Snapshot Restore for production volumes restored from snapshots to avoid first-access latency.
Monitor with CloudWatch – track VolumeReadOps, VolumeWriteOps, VolumeQueueLength, and BurstBalance (gp2) metrics.
Migrate legacy volumes – upgrade gp2 → gp3 and io1 → io2 using Elastic Volumes (no downtime required).

AWS Certification Exam Practice Questions

Questions are collected from Internet and the answers are marked as per my knowledge and understanding (which might differ with yours).

AWS services are updated everyday and both the answers and questions might be outdated soon, so research accordingly.

AWS exam questions are not updated to keep up the pace with AWS updates, so even if the underlying feature has changed the question might not be updated

Open to further feedback, discussion and correction.

A user is trying to pre-warm a blank EBS volume attached to a Linux instance. Which of the below mentioned steps should be performed by the user?
1. There is no need to pre-warm an EBS volume (with latest update no pre-warming is needed)
2. Contact AWS support to pre-warm (This used to be the case before, but pre warming is not necessary now)
3. Unmount the volume before pre-warming
4. Format the device
A user has created an EBS volume of 10 GB and attached it to a running instance. The user is trying to access EBS for first time. Which of the below mentioned options is the correct statement with respect to a first time EBS access?
1. The volume will show a size of 8 GB
2. The volume will show a loss of the IOPS performance the first time (the volume needed to be wiped cleaned before for new volumes, however pre warming is not needed any more)
3. The volume will be blank
4. If the EBS is mounted it will ask the user to create a file system
You are running a database on an EC2 instance, with the data stored on Elastic Block Store (EBS) for persistence At times throughout the day, you are seeing large variance in the response times of the database queries Looking into the instance with the isolate command you see a lot of wait time on the disk volume that the database’s data is stored on. What two ways can you improve the performance of the database’s storage while maintaining the current persistence of the data? Choose 2 answers
1. Move to an SSD backed instance
2. Move the database to an EBS-Optimized Instance
3. Use Provisioned IOPs EBS
4. Use the ephemeral storage on an m2.4xLarge Instance Instead
You have launched an EC2 instance with four (4) 500 GB EBS Provisioned IOPS volumes attached. The EC2 Instance is EBS-Optimized and supports 500 Mbps throughput between EC2 and EBS. The two EBS volumes are configured as a single RAID 0 device, and each Provisioned IOPS volume is provisioned with 4,000 IOPS (4000 16KB reads or writes) for a total of 16,000 random IOPS on the instance. The EC2 Instance initially delivers the expected 16,000 IOPS random read and write performance. Sometime later in order to increase the total random I/O performance of the instance, you add an additional two 500 GB EBS Provisioned IOPS volumes to the RAID. Each volume is provisioned to 4,000 IOPS like the original four for a total of 24,000 IOPS on the EC2 instance Monitoring shows that the EC2 instance CPU utilization increased from 50% to 70%, but the total random IOPS measured at the instance level does not increase at all. What is the problem and a valid solution?
1. Larger storage volumes support higher Provisioned IOPS rates: increase the provisioned volume storage of each of the 6 EBS volumes to 1TB.
2. EBS-Optimized throughput limits the total IOPS that can be utilized use an EBS-Optimized instance that provides larger throughput. (EC2 Instance types have limit on max throughput and would require larger instance types to provide 24000 IOPS)
3. Small block sizes cause performance degradation, limiting the I’O throughput, configure the instance device driver and file system to use 64KB blocks to increase throughput.
4. RAID 0 only scales linearly to about 4 devices, use RAID 0 with 4 EBS Provisioned IOPS volumes but increase each Provisioned IOPS EBS volume to 6.000 IOPS.
5. The standard EBS instance root volume limits the total IOPS rate, change the instant root volume to also be a 500GB 4,000 Provisioned IOPS volume
A user has deployed an application on an EBS backed EC2 instance. For a better performance of application, it requires dedicated EC2 to EBS traffic. How can the user achieve this?
1. Launch the EC2 instance as EBS provisioned with PIOPS EBS
2. Launch the EC2 instance as EBS enhanced with PIOPS EBS
3. Launch the EC2 instance as EBS dedicated with PIOPS EBS
4. Launch the EC2 instance as EBS optimized with PIOPS EBS
A company is running an I/O-intensive database on a gp2 EBS volume and experiencing inconsistent performance. The DBA wants to achieve consistent 50,000 IOPS with the lowest cost. Which approach should they use?
1. Use multiple gp2 volumes in RAID 0 configuration
2. Migrate to a single gp3 volume and provision 50,000 IOPS
3. Use an io2 Block Express volume with 50,000 provisioned IOPS
4. Use multiple gp3 volumes in RAID 0 to aggregate IOPS
(gp3 now supports up to 80,000 IOPS per volume at a lower cost than io2. For 50,000 IOPS without needing 99.999% durability, gp3 is the most cost-effective choice.)
An application requires 256,000 IOPS with sub-millisecond latency for a critical Oracle database. Which EBS configuration provides the required performance?
1. Four gp3 volumes with 64,000 IOPS each in RAID 0
2. Multiple io1 volumes in RAID 0 configuration
3. A single io2 Block Express volume with 256,000 provisioned IOPS on a Nitro-based instance
4. Eight gp3 volumes with 32,000 IOPS each in RAID 0
(io2 Block Express supports up to 256,000 IOPS per volume with sub-millisecond latency. A single volume approach is simpler and provides higher durability (99.999%) than RAID configurations.)
A team has restored an EBS volume from a snapshot and needs to serve production traffic immediately with full provisioned IOPS. What should they do?
1. Pre-warm the volume by reading all blocks using the dd utility
2. Wait for 24 hours for background initialization to complete
3. Enable Fast Snapshot Restore (FSR) on the snapshot before creating the volume
4. Attach the volume to an EBS-optimized instance to speed up initialization
(Fast Snapshot Restore ensures volumes created from the snapshot are fully initialized at creation, eliminating first-access latency. This must be enabled before creating the volume.)
An EC2 instance is running an I/O-heavy analytics workload with low network traffic requirements. The team wants to maximize EBS throughput without changing the instance type. What feature can help?
1. Enable Enhanced Networking on the instance
2. Configure the instance with ebs-1 bandwidth weighting to increase EBS bandwidth by 25%
3. Enable placement groups for the instance
4. Attach additional network interfaces to the instance
(Instance bandwidth weighting allows reallocating up to 25% of VPC network bandwidth to EBS, beneficial for workloads with high I/O but low networking needs.)
A company wants to migrate their existing io1 volumes to a newer volume type with better durability and performance without application downtime. Which approach is recommended? (Select TWO)
1. Create new io2 volumes from snapshots and switch
2. Use Elastic Volumes to modify the volume type from io1 to io2 without detaching
3. The migration provides 99.999% durability (up from 99.8-99.9%) at the same cost
4. io1 to io2 migration requires stopping the instance
5. io2 volumes cost 50% more than io1 for the same IOPS
(Elastic Volumes supports online type change from io1 to io2. io2 provides higher durability and performance (1,000 IOPS/GiB vs 50 IOPS/GiB) at the same storage and IOPS pricing.)

AWS EBS Volume Types – gp3, io2, st1, sc1 Compared

November 12, 2022 ~ Last updated on : July 22, 2026 ~ jayendrapatil ~ 39 Comments

AWS EBS Volume Types

🆕 Major Update – September 2025

Amazon EBS gp3 volumes now support up to 64 TiB size, 80,000 IOPS, and 2,000 MiB/s throughput — a 4X, 5X, and 2X increase respectively over previous limits. Additionally, as of January 2026, Elastic Volumes now supports up to 4 modifications per 24-hour rolling window (previously limited by a 6-hour cooldown between modifications).

AWS provides the following EBS volume types, which differ in performance characteristics and price and can be tailored for storage performance and cost to the needs of the applications.
Solid state drives (SSD-backed) volumes optimized for transactional workloads involving frequent read/write operations with small I/O size, where the dominant performance attribute is IOPS
- General Purpose SSD (gp3/gp2)
- Provisioned IOPS SSD (io2 Block Express/io1)
Hard disk drives (HDD-backed) volumes optimized for large streaming workloads where throughput (measured in MiB/s) is a better performance measure than IOPS
- Throughput Optimized HDD (st1)
- Cold HDD (sc1)
- ~~Magnetic Volumes (standard)~~ (Previous Generation)

EBS Volume Types Summary

Solid state drives (SSD-backed) volumes

General Purpose SSD Volumes (gp3/gp2)

General Purpose SSD volumes offer cost-effective storage that is ideal for a broad range of workloads.
General Purpose SSD volumes deliver single-digit millisecond latencies.
General Purpose SSD (gp3) volumes (Recommended)
- can range in size from 1 GiB to 64 TiB (increased from 16 TiB in September 2025).
- deliver a consistent baseline rate of 3,000 IOPS and 125 MiB/s, included with the price of storage.
- additional IOPS (up to 80,000) and throughput (up to 2,000 MiB/s) can be provisioned for an additional cost.
- the maximum ratio of provisioned IOPS to provisioned volume size is 500 IOPS per GiB.
- the maximum ratio of provisioned throughput to provisioned IOPS is .25 MiB/s per IOPS.
- performance is provisioned independently from storage capacity, allowing even small volumes to achieve high performance.
- provides up to 20% lower price per GB compared to gp2 volumes.
- Note: On Outposts, gp3 volumes support sizes up to 16 TiB, IOPS up to 16,000, and throughput up to 1,000 MiB/s.
General Purpose SSD (gp2) volumes
- can range in size from 1 GiB to 16 TiB.
- has a maximum throughput of 250 MiB/s (depending on volume size).
- provides a baseline performance of 3 IOPS/GiB.
- provides the ability to burst to 3,000 IOPS for extended periods of time for volume size less than 1 TiB and up to a maximum of 16,000 IOPS (at 5,334 GiB).
- If the volume performance is frequently limited to the baseline level (due to an empty I/O credit balance),
  - consider using a larger General Purpose SSD volume (with a higher baseline performance level) or
  - switching to a gp3 volume for independent IOPS/throughput provisioning or
  - switching to a Provisioned IOPS SSD volume for workloads that require sustained IOPS performance greater than 80,000 IOPS.
- AWS recommends migrating gp2 volumes to gp3 for better performance and lower cost.

I/O Credits and Burst Performance (gp2 only)

I/O credits represent the available bandwidth that the General Purpose SSD (gp2) volume can use to burst large amounts of I/O when more than the baseline performance is needed.
General Purpose SSD (gp2) volume performance is governed by volume size, which dictates the baseline performance level of the volume for e.g. 100 GiB volume has a 300 IOPS @ 3 IOPS/GiB
General Purpose SSD (gp2) volume size also determines how quickly it accumulates I/O credits for e.g. 100 GiB with a performance of 300 IOPS can accumulate 180K IOPS/10 mins (300 * 60 * 10).
Larger volumes have higher baseline performance levels and accumulate I/O credits faster for e.g. 1 TiB has a baseline performance of 3000 IOPS
More credits the volume has for I/O, the more time it can burst beyond its baseline performance level and the better it performs when more performance is needed for e.g. 300 GiB volume with 180K I/O credit can burst @ 3000 IOPS for 1 minute (180K/3000)
Each volume receives an initial I/O credit balance of 5,400,000 I/O credits, which is enough to sustain the maximum burst performance of 3,000 IOPS for 30 minutes.
Initial credit balance is designed to provide a fast initial boot cycle for boot volumes and a good bootstrapping experience for other applications.
Each volume can accumulate I/O credits over a period of time which can be to burst to the required performance level, up to a max of 3,000 IOPS
Unused I/O credit cannot go beyond 54,00,000 I/O credits.
Note: gp3 volumes do NOT use the I/O credit/burst model — they provide consistent baseline performance of 3,000 IOPS regardless of volume size.

Volumes till 1 TiB can burst up to 3000 IOPS over and above its baseline performance
Volumes larger than 1 TiB have a baseline performance that is already equal to or greater than the maximum burst performance, and their I/O credit balance never depletes.
Baseline performance cannot be beyond 16,000 IOPS for gp2 volumes and this limit is reached @ 5,334 GiB

Baseline Performance (gp2)

Formula – 3 IOPS i.e. GiB * 3
Calculation example
- 1 GiB volume size = 3 IOPS (1 * 3 IOPS)
- 250 GiB volume size = 750 IOPS (250* 3 IOPS)

Maximum burst duration @ 3000 IOPS (gp2)

How much time can 5400000 IO credit be sustained @ the burst performance of 3000 IOPS. Subtract the baseline performance from 3000 IOPS which would be contributed by the volume size
Formula – 5400000/(3000 – Baseline performance)
Calculation example
- 1 GiB volume size @ 3000 IOPS with 5400000 the burst performance can be maintained for 5400000/(3000-3) = 1802 secs
- 250 GiB volume size @ 3000 IOPS with 5400000 the burst performance can be maintained for 5400000/(3000-3*250) = 2400 secs

Time to fill the 5400000 I/O credit balance (gp2)

Formula – 5400000/Baseline performance
Calculation
- 1 GiB volume size @ 3 IOPS would require 5400000/3 = 1800000 secs
- 250 GiB volume size @ 750 IOPS would require 5400000/750 = 7200 secs

Provisioned IOPS SSD (io2 Block Express / io1) Volumes

are designed to meet the needs of I/O intensive workloads, particularly database workloads, that are sensitive to storage performance and consistency in random access I/O throughput.
IOPS rate can be specified when the volume is created, and EBS delivers within 10% of the provisioned IOPS performance 99.9% of the time over a given year.

io2 Block Express (Recommended)

offers the highest performance block storage among EBS volumes with an average latency of under 500 microseconds for 16KiB I/O operations.
can range in size from 4 GiB to 64 TiB.
supports up to 256,000 IOPS per volume (16 KiB I/O) — requires Nitro-based instances.
supports up to 4,000 MiB/s throughput per volume.
provides 99.999% durability (0.001% annual failure rate) — 100X higher durability than io1/gp2/gp3.
Ratio of IOPS provisioned to volume size is up to 1,000 IOPS per GiB — 20X higher than io1.
Available at the same price as io1.
Supports Multi-Attach — allows a single volume to be attached to up to 16 Nitro-based instances simultaneously.
Supports NVMe reservations for shared storage cluster coordination.
delivers better outlier latency compared to General Purpose volumes, reducing the frequency of IOs exceeding 800 microseconds by over 10X.
AWS recommends migrating io1 volumes to io2 Block Express for higher performance, durability, and IOPS/GiB ratio at no additional cost.

io1 (Previous Generation Provisioned IOPS)

can range in size from 4 GiB to 16 TiB.
have a throughput limit of up to 1,000 MiB/s (at 64,000 IOPS on Nitro instances).
can provision up to 64,000 IOPS per volume.
Ratio of IOPS provisioned to the volume size requested can be a maximum of 50 IOPS per GiB; e.g., a volume with 5,000 IOPS must be at least 100 GiB.
99.8% – 99.9% durability (0.1% – 0.2% annual failure rate).
can be striped together in a RAID configuration for larger size and greater performance.
Note: AWS recommends migrating to io2 Block Express for better durability, performance, and IOPS/GiB ratio at the same price.

Hard disk drives (HDD-backed) volumes

Throughput Optimized HDD (st1) Volumes

provide low-cost magnetic storage that defines performance in terms of throughput rather than IOPS.
is a good fit for large, sequential workloads such as EMR, ETL, data warehouses, and log processing.
do not support boot volumes.
can range in size from 125 GiB to 16 TiB.
are designed to support frequently accessed data.
maximum throughput of 500 MiB/s per volume.
maximum IOPS of 500 (1 MiB I/O).
uses a burst-bucket model for performance similar to gp2. Volume size determines the baseline throughput of the volume, which is the rate at which the volume accumulates throughput credits. Volume size also determines the burst throughput of your volume, which is the rate at which you can spend credits when they are available.

Cold HDD (sc1) Volumes

provide low-cost magnetic storage that defines performance in terms of throughput rather than IOPS.
With a lower throughput limit than st1, sc1 is a good fit ideal for large, sequential cold-data workloads.
ideal for infrequent access to data and are looking to save costs, sc1 provides inexpensive block storage.
do not support boot volumes.
can range in size from 125 GiB to 16 TiB.
maximum throughput of 250 MiB/s per volume.
maximum IOPS of 250 (1 MiB I/O).
though are similar to Throughput Optimized HDD (st1) volumes, are designed to support infrequently accessed data.
uses a burst-bucket model for performance similar to gp2. Volume size determines the baseline throughput of the volume, which is the rate at which the volume accumulates throughput credits. Volume size also determines the burst throughput of your volume, which is the rate at which you can spend credits when they are available.

Magnetic Volumes (standard) – Previous Generation

Magnetic volumes provide the lowest cost per gigabyte of all EBS volume types. Magnetic volumes are backed by magnetic drives and are ideal for workloads performing sequential reads, workloads where data is accessed infrequently, and scenarios where the lowest storage cost is important.

~~Magnetic volumes can range in size from 1 GiB to 1 TiB~~
~~These volumes deliver approximately 100 IOPS on average, with burst capability of up to hundreds of IOPS~~
~~Magnetic volumes can be striped together in a RAID configuration for larger size and greater performance.~~
Note: Magnetic (standard) is a previous generation volume type. AWS recommends using current generation volume types (gp3, io2, st1, sc1) for better performance and cost-effectiveness. For infrequent access cold data, consider sc1 instead.

EBS Volume Types (Previous Generation – Reference Only)

EBS Elastic Volumes

Elastic Volumes allows you to dynamically increase capacity, tune performance, and change the type of live volumes with no downtime or performance impact.
(January 2026 Update) You can now modify a volume up to 4 times within a rolling 24-hour window — the previous 6-hour cooldown between modifications has been eliminated.
A new modification can be initiated as soon as the previous one completes.
Supported modifications include: increasing size, changing volume type, and adjusting provisioned performance (IOPS/throughput).
Note: Volume size can only be increased, not decreased. To reduce size, create a new smaller volume and migrate data.

AWS Certification Exam Practice Questions

Questions are collected from Internet and the answers are marked as per my knowledge and understanding (which might differ with yours).

AWS services are updated everyday and both the answers and questions might be outdated soon, so research accordingly.

AWS exam questions are not updated to keep up the pace with AWS updates, so even if the underlying feature has changed the question might not be updated

Open to further feedback, discussion and correction.

You are designing an enterprise data storage system. Your data management software system requires mountable disks and a real filesystem, so you cannot use S3 for storage. You need persistence, so you will be using AWS EBS Volumes for your system. The system needs as low-cost storage as possible, and access is not frequent or high throughput, and is mostly sequential reads. Which is the most appropriate EBS Volume Type for this scenario?
1. gp1
2. io1
3. sc1 (Cold HDD sc1 volumes are designed for infrequently accessed data with lowest storage cost. Note: The original answer was “standard/Magnetic” but for modern deployments, sc1 is the recommended low-cost option for infrequent sequential access. Magnetic (standard) is previous generation.)
4. gp2
Which EBS volume type is best for high performance NoSQL cluster deployments?
1. io1/io2 Block Express (Provisioned IOPS SSD volumes are best for: Critical business applications that require sustained IOPS performance, or more than 80,000 IOPS or 2,000 MiB/s of throughput per volume, like large database workloads such as MongoDB. io2 Block Express is now recommended over io1 for up to 256,000 IOPS.)
2. gp1
3. standard
4. gp2
Provisioned IOPS Costs: you are charged for the IOPS and storage whether or not you use them in a given month.
1. FALSE
2. TRUE
A user is trying to create a PIOPS EBS volume with 8 GB size and 450 IOPS. Will AWS create the volume?
1. Yes, since the ratio between EBS and IOPS is less than 50 for io1 (or less than 1000 for io2 Block Express)
2. No, since the PIOPS and EBS size ratio is less than 50
3. No, the EBS size is less than 10 GB
4. Yes, since PIOPS is higher than 100
A user has provisioned 2000 IOPS to the EBS volume. The application hosted on that EBS is experiencing fewer IOPS than provisioned. Which of the below mentioned options does not affect the IOPS of the volume?
1. The application does not have enough IO for the volume
2. Instance is EBS optimized
3. The EC2 instance has 10 Gigabit Network connectivity
4. Volume size is too large
A user is trying to create a PIOPS EBS volume with 6000 IOPS and 100 GB size. AWS does not allow the user to create this volume. What is the possible root cause for this?
1. The ratio between IOPS and the EBS volume is higher than 50 (For io1 volumes, maximum ratio is 50 IOPS per GiB. 6000/100 = 60, which exceeds 50. Note: For io2 Block Express, this would be allowed as the ratio limit is 1000 IOPS per GiB.)
2. The maximum IOPS supported by EBS is 3000
3. The ratio between IOPS and the EBS volume is lower than 100
4. PIOPS is supported for EBS higher than 500 GB size
A company needs a database storage solution that provides consistent sub-millisecond latency, 99.999% durability, and supports up to 256,000 IOPS. Which EBS volume type should they choose?
1. gp3
2. io1
3. io2 Block Express (io2 Block Express delivers sub-millisecond latency, 99.999% durability, and supports up to 256,000 IOPS with 4,000 MiB/s throughput per volume.)
4. st1
A solutions architect needs to consolidate multiple striped gp3 volumes into a single volume for a containerized workload that requires 50,000 IOPS and 30 TiB of storage. Which volume type supports this requirement with a single volume?
1. gp2
2. gp3 (Since September 2025, gp3 supports up to 64 TiB size and 80,000 IOPS, allowing consolidation of previously striped volumes into a single gp3 volume.)
3. io1
4. st1
What is the maximum IOPS-to-storage ratio for io2 Block Express volumes?
1. 50 IOPS per GiB
2. 500 IOPS per GiB
3. 1,000 IOPS per GiB (io2 Block Express supports up to 1,000 IOPS per GiB, which is 20X higher than io1’s 50 IOPS per GiB ratio.)
4. 100 IOPS per GiB
Which of the following are advantages of io2 Block Express over io1? (Select THREE)
1. 100X higher durability (99.999% vs 99.8-99.9%)
2. 20X higher IOPS-to-storage ratio (1000 vs 50 IOPS/GiB)
3. 4X higher maximum IOPS (256,000 vs 64,000)
4. Lower cost per provisioned IOPS
5. Support for HDD-backed storage

References

AWS EC2 Network – Enhanced Networking

November 10, 2022 ~ Last updated on : June 19, 2026 ~ jayendrapatil ~ 9 Comments

EC2 Enhanced Networking

Enhanced networking results in higher bandwidth, higher packet per second (PPS) performance, lower latency, consistency, scalability and lower jitter
EC2 provides enhanced networking capabilities using single root I/O virtualization (SR-IOV) only on supported instance types
- SR-IOV is a method of device virtualization that provides higher I/O performance and lower CPU utilization
There is no additional charge for using enhanced networking.
Enhanced networking is supported only in a VPC.
All current-generation instances built on the AWS Nitro System use ENA for enhanced networking by default.
Amazon Linux AMIs, Ubuntu HVM AMIs, and Windows Server AMIs already have the ENA module installed with the attributes set and do not require any additional configurations.
It can be enabled for other OS distributions by installing the module with the correct attributes configured
Enhanced Networking is supported using
- Elastic Network Adapter (ENA)
  - The Elastic Network Adapter (ENA) supports network speeds of up to 200 Gbps for supported instance types (e.g., C6in, R6in, M6in instances). Some accelerated instances like P4d support up to 400 Gbps.
  - All Nitro-based instances use ENA for enhanced networking.
  - The following Xen-based instances also use ENA: H1, I3, G3, m4.16xlarge, P3, P3dn, and R4.
  - ENA is the recommended and standard adapter for all current-generation workloads.
- Intel 82599 Virtual Function (VF) interface
  - The Intel 82599 Virtual Function interface supports network speeds of up to 10 Gbps for supported instance types.
  - Supported only on previous-generation instance types: C3, C4, D2, I2, M4 (excl. m4.16xlarge), and R3.
  - These are all previous-generation instances. AWS recommends migrating to current-generation Nitro-based instances with ENA for better performance.

ENA Express

ENA Express is powered by AWS Scalable Reliable Datagram (SRD) technology, a high-performance network transport protocol.
ENA Express increases the maximum single flow bandwidth from 5 Gbps up to 25 Gbps within the same Region, up to the aggregate instance limit.
Reduces tail latency: up to 50% reduction in P99 latency and up to 85% reduction in P99.9 latency compared to TCP.
Works transparently with existing TCP and UDP applications — no code changes required.
SRD distributes packets across different network paths and dynamically adjusts when congestion is detected.
Handles packet reordering on the receiving end and most retransmits in the network layer.
Cross-AZ support (May 2026): ENA Express now supports traffic between instances in different Availability Zones within the same Region, delivering up to 25 Gbps single-flow bandwidth.
Requirements:
- Both sending and receiving instances must be supported instance types.
- Both instances must have ENA Express enabled on their network interface attachment.
- The network path must not include middleware boxes.
- Linux instances require ENA driver version 2.2.9 or higher for full bandwidth; version 2.8+ for metrics.
ENA Express is available on supported 6th generation and later instance types (e.g., m6i, m6a, c6i, r6i, and newer).
If ENA Express is not supported on both ends, communication falls back to standard ENA transmission.
Note: For workloads requiring high packets-per-second with lowest latency during uncongested periods, standard enhanced networking (without ENA Express) may be more appropriate.

Elastic Fabric Adapter (EFA)

An Elastic Fabric Adapter (EFA) is a network device for Amazon EC2 instances to accelerate AI/ML, and High Performance Computing (HPC) applications.
EFA provides lower and more consistent latency and higher throughput than TCP transport for inter-instance communication.
Supports Message Passing Interface (MPI) for HPC and NVIDIA Collective Communications Library (NCCL) for ML workloads, scaling to thousands of cores or GPUs.
Available as an optional EC2 networking feature at no additional cost on supported instance types.
EFA uses OS-bypass capabilities to provide low-latency, high-bandwidth RDMA-like networking.
EFA decoupled from ENA (October 2024): AWS introduced a new interface type that decouples EFA from ENA, enabling dedicated high-bandwidth, low-latency networking crucial for scaling AI/ML workloads.
EFA-only interfaces (June 2026): Amazon SageMaker HyperPod supports EFA-only network interfaces without ENA for IP networking, enabling dedicated accelerator networking.
Supported on instances like P4d (400 Gbps), P5, Trn1, Trn2, Hpc6a, Hpc7a, Hpc7g (200 Gbps), and others.
EFA is ideal for tightly coupled workloads requiring high internode communication bandwidth.

ENA Enhanced Networking Requirements

Instance must be in a VPC (EC2-Classic was fully retired in August 2023)
An HVM virtualization type AMI
Instance must be based on the Nitro System (for current-generation instances)
For Xen-based instances (H1, I3, G3, m4.16xlarge, P3, R4): must have ENA module installed and enaSupport attribute enabled
Supported instance types: All Nitro-based instances (5th generation and later: C5, M5, R5, C6i, M6i, R6i, C7g, M7g, R7g, C8g, M8g, etc.)
Enhanced networking cannot be managed from the Amazon EC2 console — use AWS CLI or CloudShell

Intel 82599 VF Enhanced Networking Requirements (Previous Generation)

VPC (EC2-Classic was fully retired in August 2023)
An HVM virtualization type AMI
Instance kernel version
- Linux kernel version of 2.6.32+
- Windows: Server 2008 R2+
Appropriate Virtual Function (VF) driver
- Linux – should have the ixgbevf module installed and that sriovNetSupport attribute set for the instance
- Windows – Intel 82599 Virtual Function driver
Supported instance types (previous generation only): C3, C4, D2, I2, M4 (excl. m4.16xlarge), and R3.
Note: AWS recommends migrating to current-generation Nitro-based instances with ENA for significantly better networking performance (up to 200 Gbps vs. 10 Gbps).

Enhanced Networking vs. ENA Express vs. EFA

Enhanced Networking (ENA/VF): Higher PPS, lower latency, lower jitter using SR-IOV. Available on all Nitro instances. Best for general workloads requiring consistent network performance.
ENA Express: Uses SRD protocol on top of ENA. Increases single-flow bandwidth to 25 Gbps and significantly reduces tail latency. Best for workloads with large data transfers or latency-sensitive applications. Available on 6th gen+ instances.
Elastic Fabric Adapter (EFA): Network device providing OS-bypass RDMA-like capabilities. Best for HPC (MPI) and AI/ML (NCCL) workloads requiring ultra-low latency inter-node communication. Available on specific compute/GPU instances.

AWS Certification Exam Practice Questions

Questions are collected from Internet and the answers are marked as per my knowledge and understanding (which might differ with yours).

AWS services are updated everyday and both the answers and questions might be outdated soon, so research accordingly.

AWS exam questions are not updated to keep up the pace with AWS updates, so even if the underlying feature has changed the question might not be updated

Open to further feedback, discussion and correction.

You have multiple Amazon EC2 instances running in a cluster across multiple Availability Zones within the same region. What combination of the following should be used to ensure the highest network performance (packets per second), lowest latency, and lowest jitter? Choose 3 answers
1. Amazon EC2 placement groups (Cluster placement groups are within a single AZ, would not work for multiple AZs)
2. Enhanced networking (provides network performance, lowest latency)
3. Amazon PV AMI (Requires HVM)
4. Amazon HVM AMI (Requires HVM)
5. Amazon Linux (Can be on others as well)
6. Amazon VPC (works only in VPC; EC2-Classic was retired August 2023)
A group of researchers is studying the migration pattern of a beetle that eats and destroys grain. The researchers must process massive amounts of data and run statistics. Which one of the following options provides the high performance computing for this purpose.
1. Configure an Autoscaling Scaling group to launch dozens of spot instances to run the statistical analysis simultaneously
2. Launch AMI instances that support SR-IOV in a single Availability Zone
3. Launch compute optimized (C4) instances in at least two Availability Zones
4. Launch enhanced network type instances in a placement group
A company is running a latency-sensitive financial trading application on EC2 instances. They need to maximize single-flow bandwidth between two instances in the same Availability Zone. Which feature should they enable?
1. Enhanced networking with Intel 82599 VF
2. Elastic Fabric Adapter (EFA)
3. ENA Express (ENA Express uses SRD to increase single-flow bandwidth from 5 Gbps to 25 Gbps and reduces tail latency)
4. Placement group with standard ENA
A machine learning team needs to scale their distributed training workload across hundreds of GPU instances with the lowest possible inter-node latency. Which networking feature is most appropriate?
1. ENA Express with SRD protocol
2. Enhanced networking with cluster placement groups
3. Elastic Fabric Adapter (EFA) (EFA provides OS-bypass, RDMA-like capabilities optimized for MPI and NCCL workloads at scale)
4. Multiple Elastic Network Interfaces
Which of the following statements about ENA Express are correct? (Choose 2)
1. ENA Express uses AWS Scalable Reliable Datagram (SRD) protocol to improve network performance (Correct – SRD is the underlying protocol)
2. ENA Express requires application code changes to work
3. ENA Express only works with TCP traffic
4. ENA Express can increase single-flow bandwidth from 5 Gbps up to 25 Gbps (Correct – major benefit of ENA Express)
A company wants to migrate from C3 instances to improve network performance. Which statement is correct regarding the migration?
1. C3 instances support ENA with speeds up to 100 Gbps
2. C3 instances use Intel 82599 VF (up to 10 Gbps) and should be migrated to current-generation Nitro instances with ENA for up to 200 Gbps (C3 is previous gen with VF; current gen instances offer significantly better networking)
3. C3 instances cannot use enhanced networking
4. C3 instances already support ENA Express

References

AWS WAF – Web Application Firewall Rules & ACLs

October 7, 2022 ~ Last updated on : July 4, 2026 ~ jayendrapatil

AWS Web Application Firewall – WAF

⚠️ AWS WAF Classic End of Life: AWS WAF Classic support ended on September 30, 2025. All customers must use AWS WAF (v2). This post covers the current AWS WAF (v2) service. If you are still on WAF Classic, use the automated migration tool in the AWS WAF console.

AWS WAF – Web Application Firewall protects web applications from attacks by allowing rules configuration that allow, block, or monitor (count) web requests based on defined conditions.
helps protect from common attack techniques like SQL injection and Cross-Site Scripting (XSS). Conditions can be based on IP addresses, HTTP headers, HTTP body, URI strings, geographic location, and rate of requests.
tightly integrates with the following AWS services:
- Amazon CloudFront distribution
  - AWS WAF rules run in all AWS Edge Locations, located around the world close to the end users.
  - Blocked requests are stopped before they reach the web servers.
  - Helps support custom origins outside of AWS.
- Application Load Balancer (ALB)
  - WAF rules run in the region and can be used to protect internet-facing as well as internal load balancers.
- Amazon API Gateway REST API
  - Can help secure and protect the REST APIs.
- AWS AppSync GraphQL API
  - Protects GraphQL APIs from common web exploits.
- Amazon Cognito user pool
  - Protects user authentication and registration endpoints.
- AWS App Runner service
  - Protects containerized web applications deployed on App Runner.
  - Note: AWS App Runner is closed to new customers starting April 30, 2026.
- AWS Verified Access instance
  - Adds web application firewall capabilities to zero-trust access.
- AWS Amplify application
  - Protects Amplify-hosted web applications directly.
helps protect applications and can inspect web requests transmitted over HTTP or HTTPS.
provides Managed Rules which are pre-configured rules to protect applications from common threats like application vulnerabilities like OWASP, bots, or Common Vulnerabilities and Exposures (CVE).
logs can be sent to CloudWatch Logs log group, an S3 bucket, or Amazon Data Firehose (formerly Kinesis Data Firehose).
supports body inspection up to 64 KB for regional resources (API Gateway, Cognito, App Runner, Verified Access), with a default of 16 KB. CloudFront supports up to 64 KB with an 8 KB default.

WAF Benefits

Additional protection against web attacks using specified conditions
Conditions can be defined by using characteristics of web requests such as the following:
- IP addresses that the requests originate from
- Values in request headers
- Strings that appear in the requests
- Length of requests
- Presence of SQL code that is likely to be malicious (SQL injection)
- Presence of a script that is likely to be malicious (cross-site scripting)
- Geographic location (country) of the request origin
- Rate of requests from a single IP or other aggregation key
Managed Rules to get started quickly with pre-configured protection packs
Rules that can be reused for multiple web applications
Real-time metrics, sampled web requests, and dashboards
Automated administration using the WAF API
CloudFront Security Dashboard for unified CDN and security experience
Simplified console with up to 80% reduction in configuration steps (launched June 2025)

How WAF Works

WAF allows controlling the behaviour of web requests by creating conditions, rules, and web access control lists (web ACLs), now also called protection packs in the new console experience.

WAF Works

Conditions

Conditions define basic characteristics to watch for in a web request
- Malicious script – XSS (Cross Site Scripting) – Attackers embed scripts that can exploit vulnerabilities in web applications
- IP addresses or address ranges that requests originate from.
- Size – Length of specified parts of the request, such as the query string.
- Malicious SQL – SQL injection – Attackers try to extract data from the database by embedding malicious SQL code in a web request
- Geographic match – Allow or block requests based on the country from which the requests originate.
- Strings that appear in the request, for e.g., values that appear in the User-Agent header or text strings that appear in the query string.
- Regex match – Match request components against regular expressions.
- Label match – Match against labels added by prior rules in the web ACL evaluation.

Actions

Allow – allows the request to be forwarded to the protected resource.
Block – blocks the request. By default returns HTTP 403 (Forbidden), but can be configured with custom responses.
Count – counts the requests that match the rule without allowing or blocking. Useful for testing rules before enforcing them.
CAPTCHA – runs a CAPTCHA puzzle challenge against the request to verify a human is sending it. If solved, the request is allowed with a valid token.
Challenge – runs a silent browser challenge (JavaScript) to verify the client is a legitimate browser without user interaction. Useful for detecting bots without impacting user experience.

Rules

AWS WAF rule defines how to inspect HTTP(S) web requests and the action to take on a request when it matches the inspection criteria.
Each rule requires one top-level rule statement, which might contain nested statements at any depth, depending on the rule and statement type.
AWS WAF supports logical statements for AND, OR, and NOT that can be used to combine statements in a rule. for e.g.,
- based on recent requests from an attacker, a rule might include the following conditions with logical AND:
  - The requests come from 192.0.2.44.
  - They contain the value BadBot in the User-Agent header.
  - They appear to include malicious SQL code in the query string.
- All 3 conditions should be satisfied for the Rule to be passed and the associated action to be taken.
Rules can also add labels to matching requests. Labels are metadata that can be used by subsequent rules in the same web ACL for more complex logic.

Rate-Based Rules

Rate-based rules track and limit the rate of requests from individual sources.
Aggregation can be by IP address, forwarded IP, custom keys (headers, query parameters), or combinations.
Minimum rate limit is 10 requests per 5-minute window (reduced from 100 in 2025).
Scope-down statements can narrow which requests are counted, for e.g., only count requests to /login path.
Automatically blocks source IPs (or other aggregation keys) when the rate exceeds the threshold.
Useful for protecting against HTTP flood DDoS attacks and brute-force login attempts.

Rule Groups

A Rule Group is a reusable set of rules that can be added to a Web ACL.
Rule groups fall into the following main categories:
- AWS Managed rule groups – maintained by AWS, includes:
  - Core rule set (CRS) – common web vulnerabilities
  - Known bad inputs – patterns associated with exploitation
  - SQL injection and XSS rules
  - IP reputation list
  - Anonymous IP list (VPNs, proxies, Tor)
  - Bot Control rule group
  - Account Takeover Prevention (ATP) rule group
  - Account Creation Fraud Prevention (ACFP) rule group
  - Anti-DDoS rule group (AWSManagedRulesAntiDDoSRuleSet) – launched June 2025
- AWS Marketplace rule groups – third-party managed rules
- Your own rule groups – custom rules you create and maintain
- Service-owned rule groups – managed by AWS Firewall Manager and Shield Advanced

Web ACLs – Access Control Lists (Protection Packs)

A Web Access Control List (Web ACL), also called a protection pack in the new console, provides fine-grained control over all HTTP(S) web requests that the protected resource responds to.
Web ACLs provide:
- Rule Groups OR Combination of Rules
- Action – allow, block, count, CAPTCHA, or Challenge for each rule
  - WAF compares a request with the rules in a web ACL in the order listed and takes the action associated with the first rule that matches.
  - When a web request matches all conditions in a rule, WAF immediately takes the action (allow or block) and doesn’t evaluate the remaining rules.
- Default action
  - Determines whether WAF allows or blocks a request that does not match any of the rules.
Supports criteria like the following to allow or block requests:
- IP address origin of the request
- Country of origin of the request
- String match or regular expression (regex) match in a part of the request
- Size of a particular part of the request
- Detection of malicious SQL code or scripting
- Rate-based rules
- Label match from prior rules

AWS WAF Bot Control

Bot Control provides visibility and control over common and pervasive bot traffic.
Bot Control detection catalog covers more than 650 unique bots and agents (as of 2026), including:
- AI search engine crawlers
- AI data collectors and scrapers
- AI assistants and agents
- Large language model (LLM) training crawlers
- Traditional scrapers, scanners, crawlers, and status monitors
Two levels of protection:
- Common – identifies self-identifying bots through request headers verification
- Targeted – advanced detection using behavioral analysis, browser fingerprinting, and ML-based detection for sophisticated bots that don’t self-identify
Actions available: Block, Allow, Count, CAPTCHA, Challenge, or custom response.
Uses AWS WAF token management for client session tracking.

AI Activity Dashboard (Feb 2026)

Provides centralized visibility into AI bot and agent traffic reaching applications.
Visualize AI traffic trends over time.
Identify most active bots and frequently accessed paths.
Analyze request volumes by bot category and verification status.
Take action directly: allow verified AI search crawlers while rate-limiting or blocking unverified agents.
Classifies AI bots into three types:
- AI scrapers – systematically collect data to train AI models
- AI tools – surface data from applications in AI applications using function calling
- AI agents – autonomously navigate and interact dynamically with applications
Available at no additional cost for all WAF customers.

AI Traffic Monetization (June 2026)

Gives digital content owners and publishers a way to charge AI bots and agents for access to protected web content at the network edge.
Configure pricing through the AWS WAF console.
Define AI bot or agent policies based on verification status.
Supports Web Bot Auth signatures for bot identity verification.
Available at no additional WAF charge.

AWS WAF Fraud Control

Provides intelligent threat mitigation for fraud prevention.
Two managed rule groups:
- Account Takeover Prevention (ATP)
  - Detects and blocks credential stuffing and brute-force login attempts.
  - Analyzes login requests for compromised credentials.
  - Uses stolen credential databases to identify credential stuffing.
- Account Creation Fraud Prevention (ACFP)
  - Monitors sign-up and registration pages for anomalous activity.
  - Detects automated account creation using bots.
  - Blocks suspicious requests based on request identifiers and behavioral analysis.
Blocks fraud at the network edge when used with CloudFront, minimizing impact on application performance.
Uses client-side interrogation with JavaScript challenges and behavioral analysis.

AWS WAF Anti-DDoS Protection

The Anti-DDoS Managed Rule Group (AWSManagedRulesAntiDDoSRuleSet) launched in June 2025 provides automatic application-layer (Layer 7) DDoS protection.
Automatically detects and mitigates DDoS events of any duration in single-digit seconds.
Establishes a traffic baseline and uses it to detect anomalies.
When an attack is detected, labels requests:
- event-detected – added to all incoming requests during an event
- ddos-request – added to requests suspected of contributing to the attack
Supersedes the Shield Advanced Layer 7 Auto Mitigation (L7AM) feature as of March 2026.
Works with CloudFront, ALB, and other AWS WAF-supported services.
Customizable behavior using labels and additional WAF rules.
Managed by AWS Firewall Manager for centralized deployment.

AWS WAF Data Protection

Data Protection settings (Feb 2025) allow granular protection of sensitive information in WAF outputs.
Protects passwords, API keys, authentication tokens, and other confidential data in specific fields (headers, parameters, body content).
Applies to full logs, sampled requests, and Security Lake outputs.
Two transformation options:
- Substitution – replaces sensitive data with static strings
- Cryptographic hashing – replaces with hashed values for correlation without exposure
Configured per web ACL in the Logging and Metrics section.

AWS WAF Labels and Dynamic Label Interpolation

Labels are metadata added to web requests by matching rules, available for subsequent rules in the same web ACL.
Enable complex multi-rule logic without duplicating conditions.
Managed rule groups add labels to indicate match details (e.g., bot category, attack type).
Dynamic Label Interpolation (May 2026) enables forwarding WAF classification signals to origin servers:
- Use ${namespace:} syntax in custom request headers, response headers, and response bodies.
- Forward entire label namespaces at once.
- Eliminates need for multiple rules to pass different classification signals.

New Console Experience (June 2025)

Simplified console reduces web application security configuration steps by up to 80%.
Protection Packs – pre-configured rule packs for specific workloads:
- Recommended – enables recommended protections for selected application categories
- Essentials – enables essential protections
- You build it – select and customize from available options
Automated security recommendations based on AWS Threat Intelligence analysis of allowed traffic patterns.
Unified dashboard with Sankey visualization of protection activity to WAF actions.
Integrated log explorer with pre-built filters.
Direct AWS Marketplace integration for partner security solutions.
Available at no additional cost.

AWS WAF Architecture

AWS WAF integration with CloudFront and Lambda to dynamically update WAF rules
CloudFront receives requests on behalf of the web application, it sends access logs to an S3 bucket that contains detailed information about the requests.
For every new access log stored in the S3 bucket, a Lambda function is triggered. The Lambda function parses the log files and looks for requests that resulted in error codes 400, 403, 404, and 405.
Lambda function then counts the number of bad requests and temporarily stores results in the S3 bucket
Lambda function updates AWS WAF rules to block the IP addresses for a period of time that you specify.
After this blocking period has expired, AWS WAF allows those IP addresses to access your application again, but continues to monitor the requests from those IP addresses.
Lambda function publishes execution metrics in CloudWatch, such as the number of requests analyzed and IP addresses blocked.
CloudWatch metrics can be integrated with SNS for notification

Web Application Firewall Sandwich Architecture (Historical)

NOTE: This is from the older DDoS Resiliency Whitepaper. It uses third-party WAF software on EC2 instances, NOT AWS WAF. With the introduction of AWS WAF Anti-DDoS Managed Rule Group (June 2025), this pattern is largely superseded by native AWS WAF protections.

WAF Sandwich Architecture

DDoS attacks at the application layer commonly target web applications with lower volumes of traffic compared to infrastructure attacks.
WAF can be included as part of the infrastructure to mitigate these types of attacks.
WAFs act as filters that apply a set of rules to web traffic, which cover exploits like XSS and SQL injection but can also help build resiliency against DDoS by mitigating HTTP GET or POST floods.
In the “WAF sandwich,” the EC2 instance running third-party WAF software (not the AWS WAF service) is included in an Auto Scaling group and placed between two ELB load balancers.
With WAF sandwich pattern, the instances can scale and add additional WAF EC2 instances should the traffic spike to elevated levels.
Once the traffic has been inspected and filtered, the WAF EC2 instance forwards traffic to the internal, backend load balancer which then distributes traffic across the application EC2 instances.
Modern Alternative: Use AWS WAF with the Anti-DDoS managed rule group attached to CloudFront or ALB for native Layer 7 DDoS protection without managing EC2-based WAF instances.

AWS Certification Exam Practice Questions

Questions are collected from Internet and the answers are marked as per my knowledge and understanding (which might differ with yours).

AWS services are updated everyday and both the answers and questions might be outdated soon, so research accordingly.

AWS exam questions are not updated to keep up the pace with AWS updates, so even if the underlying feature has changed the question might not be updated

Open to further feedback, discussion and correction.

The Web Application Development team is worried about malicious activity from 200 random IP addresses. Which action will ensure security and scalability from this type of threat?
1. Use inbound security group rules to block the IP addresses.
2. Use inbound network ACL rules to block the IP addresses.
3. Use AWS WAF to block the IP addresses.
4. Write iptables rules on the instance to block the IP addresses.
You’ve been hired to enhance the overall security posture for a very large e-commerce site. They have a well architected multi-tier application running in a VPC that uses ELBs in front of both the web and the app tier with static assets served directly from S3. They are using a combination of RDS and DynamoDB for their dynamic data and then archiving nightly into S3 for further processing with EMR. They are concerned because they found questionable log entries and suspect someone is attempting to gain unauthorized access. Which approach provides a cost effective scalable mitigation to this kind of attack? [Old Exam Question]
1. Recommend that they lease space at a DirectConnect partner location and establish a 1G DirectConnect connection to their VPC they would then establish Internet connectivity into their space, filter the traffic in hardware Web Application Firewall (WAF). And then pass the traffic through the DirectConnect connection into their application running in their VPC. (Not cost effective)
2. Add previously identified hostile source IPs as an explicit INBOUND DENY NACL to the web tier subnet. (does not protect against new sources)
3. Add a WAF tier by creating a new ELB and an AutoScaling group of EC2 Instances running a host-based WAF. They would redirect Route 53 to resolve to the new WAF tier ELB. The WAF tier would then pass the traffic to the current web tier. Web tier Security Groups would be updated to only allow traffic from the WAF tier Security Group
4. Remove all but TLS 1.2 from the web tier ELB and enable Advanced Protocol Filtering. This will enable the ELB itself to perform WAF functionality. (No advanced protocol filtering in ELB)
NOTE: This is an older exam question. In modern architectures, AWS WAF can be directly attached to CloudFront or ALB without needing EC2-based WAF instances.
A company’s web application is experiencing a high volume of automated bot traffic that is consuming resources and scraping proprietary content. The security team needs to implement bot management that can differentiate between legitimate users, verified search engine crawlers, and malicious bots. Which AWS WAF feature should they implement?
1. Rate-based rules with IP-based aggregation
2. AWS WAF Bot Control with Targeted protection level
3. Geographic match rules to block countries with high bot traffic
4. Custom regex rules to match bot User-Agent strings
A media company wants to allow verified AI search crawlers to access their content while blocking unverified AI data scrapers. Which combination of AWS WAF features provides this capability? (Select TWO)
1. AWS WAF Bot Control with AI bot category detection
2. Network ACL rules with IP deny lists
3. AI Activity Dashboard to identify and categorize AI bot traffic
4. AWS Shield Advanced automatic DDoS protection
5. AWS Firewall Manager centralized policy
An organization is experiencing a Layer 7 DDoS attack against their web application hosted behind an Application Load Balancer. They need automatic detection and mitigation without manual intervention. Which is the MOST effective solution?
1. Create a rate-based rule with a threshold of 100 requests per 5 minutes
2. Enable AWS Shield Advanced with automatic application layer mitigation
3. Add the AWS WAF Anti-DDoS Managed Rule Group (AWSManagedRulesAntiDDoSRuleSet) to the web ACL
4. Deploy EC2 instances running third-party WAF software in a WAF sandwich architecture
A security engineer needs to protect login pages from credential stuffing attacks and detect compromised credentials. Which AWS WAF feature should they enable?
1. AWS WAF Bot Control Common level
2. Rate-based rules with URI path scope-down
3. AWS WAF Fraud Control Account Takeover Prevention (ATP)
4. SQL injection rule group from AWS Managed Rules
A company needs to ensure sensitive data like API keys and passwords in web requests are not exposed in WAF logs while still maintaining full logging for security analysis. Which AWS WAF feature addresses this requirement?
1. CloudWatch Logs field-level encryption
2. S3 bucket encryption for WAF log storage
3. AWS WAF Data Protection with substitution or cryptographic hashing
4. Kinesis Data Firehose data transformation

References

AWS Identity Services Cheat Sheet

October 6, 2022 ~ Last updated on : June 21, 2026 ~ jayendrapatil

AWS Identity Services Cheat Sheet

AWS Identity and Security Services

IAM – Identity & Access Management

securely control access to AWS services and resources
helps create and manage user identities and grant permissions for those users to access AWS resources
helps create groups for multiple users with similar permissions
not appropriate for application authentication
is Global and does not need to be migrated to a different region
helps define Policies,
- in JSON format
- all permissions are implicitly denied by default
- most restrictive policy wins
IAM Role
- helps grants and delegate access to users and services without the need of creating permanent credentials
- IAM users or AWS services can assume a role to obtain temporary security credentials that can be used to make AWS API calls
- needs Trust policy to define who and Permission policy to define what the user or service can access
- used with Security Token Service (STS), a lightweight web service that provides temporary, limited privilege credentials for IAM users or for authenticated federated users
- IAM role scenarios
  - Service access for e.g. EC2 to access S3 or DynamoDB
  - Cross Account access for users
    - with user within the same account
    - with user within an AWS account owned the same owner
    - with user from a Third Party AWS account with External ID for enhanced security
  - Identity Providers & Federation
    - AssumeRoleWithWebIdentity – Web Identity Federation, where the user can be authenticated using external authentication Identity providers like Amazon, Google or any OpenId IdP
    - AssumeRoleWithSAML – Identity Provider using SAML 2.0, where the user can be authenticated using on premises Active Directory, Open Ldap or any SAML 2.0 compliant IdP
    - AssumeRole (recommended) or GetFederationToken – For other Identity Providers, use Identity Broker to authenticate and provide temporary Credentials
IAM MFA (Multi-Factor Authentication)
- AWS supports FIDO2 passkeys, virtual MFA devices (authenticator apps), and hardware MFA tokens
- SMS MFA has been discontinued – use FIDO2 passkeys or virtual/hardware MFA devices instead
- AWS enforces MFA for root users across all account types (rolled out 2024-2025)
- FIDO2 passkeys use public key cryptography for phishing-resistant authentication
- Up to 8 MFA devices can be registered per IAM user
IAM Best Practices
- Do not use Root account for anything other than billing
- Create Individual IAM users
- Use groups to assign permissions to IAM users
- Grant least privilege
- Use IAM roles for applications on EC2
- Delegate using roles instead of sharing credentials
- Rotate credentials regularly
- Use Policy conditions for increased granularity
- Use CloudTrail to keep a history of activity
- Enforce a strong IAM password policy for IAM users
- Remove all unused users and credentials
- Enable MFA for all users, especially root accounts – use FIDO2 passkeys for strongest protection
- Use IAM Access Analyzer to identify unused access and overly permissive policies
Increased IAM Quotas (May 2026)
- Roles per account: up to 10,000
- Managed policies per account: up to 10,000
- Role trust policy size: up to 8,192 characters

IAM Roles Anywhere

enables workloads running outside of AWS (on-premises, hybrid, multi-cloud) to access AWS resources using temporary credentials
eliminates the need for long-term AWS access keys for external workloads
uses X.509 certificates from your Certificate Authority (CA) for authentication
integrates with existing enterprise PKI infrastructure
key components:
- Trust Anchor – establishes trust between IAM Roles Anywhere and your CA
- Profile – specifies the IAM roles and session policies
- Credential Helper – tool that runs on the workload to obtain temporary credentials
supports workloads on-premises, in containers, or in other cloud providers
uses the same IAM policies and roles as AWS workloads for consistent access control

IAM Access Analyzer

helps identify resources shared with external entities and validate IAM policies
provides External Access Analysis – identifies resources accessible from outside your account or organization
provides Unused Access Analysis – continuously monitors for:
- Unused IAM roles
- Unused access keys for IAM users
- Unused passwords for IAM users
- Unused services and actions for active roles/users
supports Custom Policy Checks – validates policies before deployment against best practices
generates policy recommendations based on access activity (least privilege)
integrates with AWS Security Hub for centralized findings
zone of trust can be set at account or organization level

AWS Organizations

is an account management service that enables consolidating multiple AWS accounts into an organization that can be centrally managed.
include consolidated billing and account management capabilities that enable one to better meet the budgetary, security, and compliance needs of your business.
As an administrator of an organization, new accounts can be created in an organization and invite existing accounts to join the organization.
enables you to
- Automate AWS account creation and management, and provision resources with AWS CloudFormation Stacksets.
- Maintain a secure environment with policies and management of AWS security services
- Govern access to AWS services, resources, and regions
- Centrally manage policies across multiple AWS accounts
- Audit your environment for compliance
- View and manage costs with consolidated billing
- Configure AWS services across multiple accounts
supports Service Control Policies – SCPs
- offer central control over the maximum available permissions for all of the accounts in your organization, ensuring member accounts stay within the organization’s access control guidelines.
- are available only in an organization that has all features enabled, and aren’t available if the organization has enabled only the consolidated billing features.
- are NOT sufficient for granting access to the accounts in the organization.
- defines a guardrail for what actions accounts within the organization root or OU can do, but IAM policies need to be attached to the users and roles in the organization’s accounts to grant permissions to them.
- Effective permissions are the logical intersection between what is allowed by the SCP and what is allowed by the IAM and resource-based policies.
- with an SCP attached to member accounts, identity-based and resource-based policies grant permissions to entities only if those policies and the SCP allow the action
- don’t affect users or roles in the management account. They affect only the member accounts in your organization.
supports Resource Control Policies (RCPs) – launched Nov 2024
- a new authorization policy type that sets the maximum available permissions on resources within the organization
- complement SCPs – SCPs control what principals can do, RCPs control what can be done on resources
- help centrally restrict external access to AWS resources at scale (establish data perimeters)
- don’t affect resources in the management account – only affect resources in member accounts
- work alongside SCPs to provide comprehensive authorization guardrails
- supported by AWS Control Tower for managed preventive controls
supports Declarative Policies – launched Dec 2024 at re:Invent
- a new management policy type that declares and enforces desired configuration for AWS services at scale
- different from SCPs/RCPs – declarative policies enforce service configurations, not just permissions
- configuration is always maintained even when the service adds new features or APIs
- simplifies governance by defining durable intent for baseline service configurations

AWS Directory Services

gives applications in AWS access to Active Directory services
different from SAML + AD, where the access is granted to AWS services through Temporary Credentials
AWS Managed Microsoft AD
- fully managed Microsoft Active Directory powered by Windows Server
- available in Standard and Enterprise editions
- supports self-service API-driven edition upgrades (Standard to Enterprise) – Oct 2025
- supports dual-stack networking (IPv4 and IPv6) – Sep 2025
- includes Directory Service Data API for built-in object management (users, groups, attributes) – Sep 2024
- Hybrid Edition (Aug 2025) – extends your existing self-managed AD domain to AWS Managed Microsoft AD
  - automatically handles replication between on-premises AD and AWS
  - preserves existing identity and access infrastructure
  - simplifies migration of AD-dependent workloads to AWS
  - supports extending domains from on-premises, AWS, or multi-cloud
Simple AD
- least expensive but does not support Microsoft AD advanced features
- provides a Samba 4 Microsoft Active Directory compatible standalone directory service on AWS
- No single point of Authentication or Authorization, as a separate copy is maintained
- trust relationships cannot be setup between Simple AD and other Active Directory domains
- Don’t use it, if the requirement is to leverage access and control through centralized authentication service
AD Connector
- acts just as an hosted proxy service for instances in AWS to connect to on-premises Active Directory
- enables consistent enforcement of existing security policies, such as password expiration, password history, and account lockouts, whether users are accessing resources on-premises or in the AWS cloud
- needs VPN connectivity (or Direct Connect)
- integrates with existing RADIUS-based MFA solutions to enabled multi-factor authentication
- does not cache data which might lead to latency
Read-only Domain Controllers (RODCs)
- works out as a Read-only Active Directory
- holds a copy of the Active Directory Domain Service (AD DS) database and respond to authentication requests
- they cannot be written to and are typically deployed in locations where physical security cannot be guaranteed
- helps maintain a single point to authentication & authorization controls, however needs to be synced
Writable Domain Controllers
- are expensive to setup
- operate in a multi-master model; changes can be made on any writable server in the forest, and those changes are replicated to servers throughout the entire forest

AWS IAM Identity Center (formerly AWS Single Sign-On)

is the recommended service for managing workforce access to AWS accounts and applications (formerly known as AWS SSO, renamed July 2022)
provides centralized SSO access to all AWS accounts and cloud applications
helps manage access and permissions to commonly used third-party software as a service (SaaS) applications, AWS-integrated applications as well as custom applications that support SAML 2.0.
includes a user portal where end-users can find and access all their assigned AWS accounts, cloud applications, and custom applications in one place.
supports connecting external identity providers (Okta, Microsoft Entra ID, Ping Identity) or using built-in directory
Trusted Identity Propagation
- enables administrators to grant permissions based on user attributes (user ID, group associations) across AWS service boundaries
- eliminates the need for service-specific identity mapping
- supports services like Amazon Redshift, Amazon Q Business, Amazon EMR, and more
Multi-Region Replication (Feb 2026)
- replicate identity configurations across multiple AWS Regions
- provides active access portal endpoints in multiple Regions for improved availability
- available for organization instances connected to external identity providers
- currently available in 17 enabled-by-default commercial AWS Regions
supports customer managed policies and permission boundaries in permission sets

Amazon Cognito

Amazon Cognito provides authentication, authorization, and user management for the web and mobile apps.
Users can sign in directly with a username and password, or through a third party such as Facebook, Amazon, Google, or Apple.
Cognito has two main components.
- User pools are user directories that provide sign-up and sign-in options for the app users.
- Identity pools enable you to grant the users access to other AWS services.
Feature Tiers (Nov 2024) – User pools now offer three tiers:
- Lite – basic authentication features (existing user pools default to this)
- Essentials – includes Managed Login, passwordless authentication (passkeys, email, SMS), access token customization, password reuse prevention (new user pools default to this)
- Plus – adds advanced security features including adaptive authentication, threat protection, and compromised credentials detection
Managed Login (Nov 2024) – fully managed, hosted sign-in/sign-up experience with rich branding customization
Passwordless Authentication (Nov 2024)
- supports passkeys (FIDO standards, public key cryptography) for phishing-resistant sign-in
- supports email and SMS one-time passwords
- available in the Essentials tier
Refresh Token Rotation (Apr 2025) – enables automatic rotation of OAuth 2.0 refresh tokens for improved security
Client Secret Management (Feb 2026) – custom client secrets, on-demand rotation, up to two active secrets per app client
Multi-Region Replication (2026) – replicate user pools across Regions for business continuity and reduced latency
Customer-Managed Keys – full control over data encryption at rest using your own KMS keys
Cognito Sync – Note: AWS recommends using AWS AppSync instead of Cognito Sync for new implementations. AppSync provides similar data synchronization with additional real-time and offline capabilities.

Amazon Verified Permissions

a fully managed, fine-grained authorization service for applications (GA 2023)
uses Cedar, an open-source policy language purpose-built for authorization
externalizes authorization logic from application code for consistent access control
supports both Role-Based Access Control (RBAC) and Attribute-Based Access Control (ABAC)
key components:
- Policy Store – container for Cedar policies, logically isolated from other stores
- Policies and Templates – define who can do what on which resources
- Schema – defines entity types, actions, and their relationships
- Authorization Requests – real-time evaluation of user access against policies
integrates natively with Amazon Cognito for identity context
aligns with Zero Trust principles – least privilege and continuous verification
supports multi-tenant authorization with multiple identity providers
enables security teams to audit and analyze application-level access centrally

AWS Security Services Cheat Sheet

October 6, 2022 ~ Last updated on : June 26, 2026 ~ jayendrapatil

AWS Security Services Cheat Sheet

AWS IAM Identity Center (Successor to AWS SSO)

is a centralized workforce identity management service that provides single sign-on (SSO) access to multiple AWS accounts and business applications.
was renamed from AWS Single Sign-On (AWS SSO) in July 2022.
enables administrators to define, customize, and assign fine-grained access across AWS accounts and applications.
provides workforce users a portal to access AWS accounts and cloud applications assigned to them.
supports integration with external identity providers (IdPs) like Microsoft Active Directory, Okta, and Azure AD.
simplifies multi-account access management through AWS Organizations integration.
provides temporary credentials instead of long-term IAM user credentials.
supports attribute-based access control (ABAC) for fine-grained permissions.

Key Management Service – KMS

is a managed encryption service that allows the creation and control of encryption keys to enable data encryption.
provides a highly available key storage, management, and auditing solution to encrypt the data across AWS services & within applications.
uses hardware security modules (HSMs) that are FIPS 140-3 Security Level 3 certified (upgraded from FIPS 140-2 in May 2023).
seamlessly integrates with several AWS services to make encrypting data in those services easy.
supports multi-region keys, which are AWS KMS keys in different AWS Regions. Multi-Region keys are not global and each multi-region key needs to be replicated and managed independently.
supports External Key Store (XKS) capability (November 2022) allowing customers to store and control encryption keys on-premises or outside AWS cloud while using AWS KMS.
provides three key store options: Default KMS key store, CloudHSM custom key store, and External key store (XKS).
supports on-demand key rotation (April 2024) allowing immediate rotation of symmetric encryption keys without waiting for automatic rotation schedules, with a maximum of 10 on-demand rotations per key.
offers flexible automatic rotation periods (90 days to 2560 days) instead of the previous fixed annual rotation.
supports post-quantum cryptography:
- ML-KEM hybrid post-quantum key exchange for TLS connections to KMS endpoints, protecting against “harvest now, decrypt later” attacks.
- ML-DSA (FIPS 204) post-quantum digital signatures (June 2025) for quantum-resistant signing operations within FIPS 140-3 Level 3 certified HSMs.

CloudHSM

provides secure cryptographic key storage to customers by making hardware security modules (HSMs) available in the AWS cloud
helps manage your own encryption keys using FIPS 140-3 Level 3 validated HSMs (upgraded from FIPS 140-2).
single tenant, dedicated physical device to securely generate, store, and manage cryptographic keys used for data encryption
are inside the VPC (not EC2-classic) & isolated from the rest of the network
can use VPC peering to connect to CloudHSM from multiple VPCs
integrated with Amazon Redshift and Amazon RDS for Oracle
EBS volume encryption, S3 object encryption and key management can be done with CloudHSM but requires custom application scripting
is NOT fault-tolerant and would need to build a cluster as if one fails all the keys are lost
enables quick scaling by adding and removing HSM capacity on-demand, with no up-front costs.
automatically load balance requests and securely duplicates keys stored in any HSM to all of the other HSMs in the cluster.
launched hsm2m.medium instance type (August 2024) with FIPS 140-3 Level 3 certification, increased key storage (16,666 keys), higher elliptic curve performance, mTLS support, and non-FIPS cluster mode option.
deprecated hsm1.medium instance type — no new hsm1 clusters can be created as of April 2025; customers must migrate to hsm2m.medium.
expensive, prefer AWS Key Management Service (KMS) if cost is a criteria.

AWS Payment Cryptography

is a managed service for payment processing cryptographic operations (launched June 2023).
provides payment-specific HSMs that replace on-premises payment hardware security modules.
helps meet PCI (Payment Card Industry) security requirements and compliance needs.
supports cryptographic operations like PIN generation, validation, and credit/debit card security code processing.
manages underlying physical HSM infrastructure and key management automatically.
integrates with AWS IAM for authorization and AWS CloudTrail for auditing.
enables payment processing workloads to move to the cloud securely.
provides elastic scaling for payment cryptography operations.

AWS Private Certificate Authority (Private CA)

is a managed private certificate authority service for issuing and managing private SSL/TLS certificates.
removes upfront investment and ongoing maintenance costs of operating your own private CA.
supports two operating modes: General-purpose mode (certificates with any validity period) and Short-lived certificate mode (certificates valid up to 7 days, launched February 2023).
integrates with AWS Certificate Manager (ACM) for automated certificate provisioning and renewal.
supports Private CA Connector for Active Directory (September 2023) enabling AWS Private CA as drop-in replacement for self-managed enterprise CAs without local agents.
supports post-quantum ML-DSA digital certificates (November 2025) for transitioning PKI toward post-quantum cryptography.
provides audit and compliance support through AWS CloudTrail integration.
enables certificate-based authentication for services like Amazon WorkSpaces.

AWS WAF

is a web application firewall that helps monitor the HTTP/HTTPS traffic and allows controlling access to the content.
helps protect web applications from attacks by allowing rules configuration that allow, block, or monitor (count) web requests based on defined conditions. These conditions include IP addresses, HTTP headers, HTTP body, URI strings, SQL injection and cross-site scripting.
helps define Web ACLs, which is a combination of Rules that is a combinations of Conditions and Action to block or allow
integrated with CloudFront, Application Load Balancer (ALB), API Gateway, Amazon Cognito, AWS App Runner, and AWS Verified Access.
supports custom origins outside of AWS, when integrated with CloudFront
provides AWS WAF Fraud Control with three capabilities:
- Account Takeover Prevention (ATP) – Protects login pages against credential stuffing attacks
- Account Creation Fraud Prevention (ACFP) – Detects and blocks automated bot-based account creation
- Bot Control – Detects and controls common bots and targeted bots with a catalog of 650+ unique bots including AI crawlers, AI data collectors, AI assistants, and LLM training crawlers
supports Challenge and CAPTCHA actions for bot mitigation.
provides AI Activity Dashboard (February 2026) for visibility into AI bot and agent traffic patterns.
launched AI Traffic Monetization (June 2026), a Bot Control capability that lets content providers price, meter, and collect payment from AI bots and agents accessing their content and APIs via HTTP 402 Payment Required responses.
AWS WAF Classic reached end of support on September 30, 2025. All customers must use AWS WAF (v2).

AWS Verified Access

provides VPN-less, secure access to corporate applications (GA April 2023).
implements Zero Trust security model for application access without traditional VPN.
validates each application request against identity and device security requirements before granting access.
integrates with identity providers (IdPs) and device management systems for authentication and authorization.
uses Cedar policy language for fine-grained access control policies.
supports AWS WAF integration for additional web application protection.
provides signed identity context to end applications for additional security.
simplifies remote access management and improves user experience compared to VPN.
eliminates VPN infrastructure management overhead.

Amazon Verified Permissions

is a fully managed fine-grained authorization service for custom applications (GA June 2023).
uses Cedar, an open-source policy language released May 2023, for defining authorization policies.
enables developers to externalize authorization logic from application code.
provides centralized policy management and administration.
offers millisecond-latency authorization decisions with provably correct results.
supports policy validation using automated reasoning to prevent misconfigurations.
integrates with identity providers for user and group information.
enables fine-grained permissions based on user attributes, resource attributes, and context.
provides policy versioning and audit capabilities.
follows “explicit permit” and “forbid overrides permit” principles.

AWS Secrets Manager

helps protect secrets needed to access applications, services, and IT resources.
enables you to easily rotate, manage, and retrieve database credentials, API keys, and other secrets throughout their lifecycle.
secure secrets by encrypting them with encryption keys managed using AWS KMS.
offers native secret rotation with built-in integration for RDS, Redshift, and DocumentDB.
supports Lambda functions to extend secret rotation to other types of secrets, including API keys and OAuth tokens.
supports IAM and resource-based policies for fine-grained access control to secrets and centralized secret rotation audit for resources in the AWS Cloud, third-party services, and on-premises.
enables secret replication in multiple AWS regions to support multi-region applications and disaster recovery scenarios, automatically keeping replicas in sync including rotation.
launched Managed External Secrets (November 2025) — a new secret type enabling automatic rotation for third-party SaaS credentials (Salesforce, MongoDB Atlas, Confluent Cloud, Datadog, Snowflake) without custom Lambda rotation functions.
supports hybrid post-quantum TLS (ML-KEM) for protecting secrets against future quantum computing threats (April 2026).
supports private access using VPC Interface endpoints

AWS Shield

is a managed service that provides protection against Distributed Denial of Service (DDoS) attacks for applications running on AWS
provides protection for all AWS customers against common and most frequently occurring infrastructure (layer 3 and 4) attacks like SYN/UDP floods, reflection attacks, and others to support high availability of applications on AWS.
provides AWS Shield Advanced with additional protections against more sophisticated and larger attacks for applications running on EC2, ELB, CloudFront, AWS Global Accelerator, and Route 53.
Shield Advanced provides 24/7 access to AWS Shield Response Team (SRT) and cost protection against DDoS-related spikes.
AWS Shield Network Security Director (preview) performs analysis of resources to visualize network topology, identify configuration issues, and provide actionable remediation recommendations.

AWS GuardDuty

offers threat detection that enables continuous monitoring and protects the AWS accounts and workloads.
is a Regional service
analyzes continuous streams of meta-data generated from AWS accounts and network activity found in AWS CloudTrail Events, EKS audit logs, VPC Flow Logs, and DNS Logs.
integrated threat intelligence
combines machine learning, anomaly detection, network monitoring, and malicious file discovery, utilizing both AWS-developed and industry-leading third-party sources to help protect workloads and data on AWS
supports suppression rules, trusted IP lists, and threat lists. Now supports custom entity lists (September 2025) with domain-based threat intelligence in addition to IP-based lists.
provides Malware Protection to detect malicious files on EBS volumes and S3 objects (on-demand scanning API).
provides EKS Runtime Monitoring using fully managed EKS add-on for visibility into container runtime activities (file access, process execution, network connections).
provides RDS Protection for profiling and monitoring access activity to Amazon Aurora databases.
provides Lambda Protection for monitoring AWS Lambda function invocations and runtime behavior.
can identify specific containers within EKS clusters that are potentially compromised and detect privilege escalation attempts.
launched Extended Threat Detection (December 2024) — AI/ML-powered attack sequence identification that detects multi-stage attacks spanning multiple AWS data sources and resources, including EC2 instances and ECS clusters on Fargate.
offers flexible protection plan configuration — new accounts can inherit protection plans automatically, and plans can be enabled/disabled independently.
operates completely independently from the resources so there is no risk of performance or availability impacts on the workloads.

Amazon Inspector

is a vulnerability management service that continuously scans the AWS workloads for vulnerabilities
automatically discovers and scans EC2 instances and container images residing in Elastic Container Registry (ECR) for software vulnerabilities and unintended network exposure.
supports AWS Lambda function scanning for vulnerabilities in application code and dependencies.
provides CI/CD integration with open-source plugins for Jenkins, TeamCity, and other CI/CD tools to scan container images at build time.
provides code security capabilities including static application security testing (SAST), software composition analysis (SCA), and infrastructure as code (IaC) scanning via SCM tool connections.
supports agentless EC2 scanning (March 2026) with expanded detection coverage including Windows OS vulnerability scanning without requiring an agent.
launched Inspector VM Scanner (May 2026) for improved agent-based scanning with more granular package collection and reduced CPU utilization on EC2 instances.
creates a finding, when a software vulnerability or network issue is discovered, that describes the vulnerability, rates its severity, identifies the affected resource, and provides remediation guidance.
is a Regional service.
Amazon Inspector Classic reached end of support on May 20, 2026. All customers must use Amazon Inspector (v2).

Amazon Security Lake

is a fully managed security data lake service (GA November 2023).
automatically centralizes security data from AWS environments, SaaS providers, on-premises, and cloud sources into a purpose-built data lake.
normalizes security data into the Open Cybersecurity Schema Framework (OCSF) standard format.
aggregates data from AWS services like CloudTrail, VPC Flow Logs, Route 53 logs, and third-party sources.
enables comprehensive security data analysis across entire organization.
automatically collects data for existing and new accounts with multi-account support.
stores security data in customer’s own AWS account for data ownership and control.
integrates with analytics tools like Amazon Athena, Amazon OpenSearch, and third-party SIEM solutions.
supports cross-region data aggregation for centralized security monitoring.
pricing based on data ingestion volume and normalization (no charge for third-party or custom data).

Amazon Detective

helps analyze, investigate, and quickly identify the root cause of potential security issues or suspicious activities.
automatically collects log data from the AWS resources and uses machine learning, statistical analysis, and graph theory to build a linked set of data to easily conduct faster and more efficient security investigations.
enables customers to view summaries and analytical data associated with CloudTrail logs, EKS audit logs, VPC Flow Logs.
provides finding groups that let you examine multiple activities related to a potential security event, analyze root cause for high severity GuardDuty findings, and visualize entity connections.
provides detailed summaries, analysis, and visualizations of the behaviors and interactions amongst your AWS accounts, EC2 instances, AWS users, roles, and IP addresses.
supports automated investigation of IAM users and roles for indicators of compromise (IoC).
maintains up to a year of aggregated data
is a Regional service and needs to be enabled on a region-by-region basis.
is a multi-account service that aggregates data from monitored member accounts under a single administrative account within the same region.
integrates with Amazon Security Lake for lateral movement investigations.
has no impact on the performance or availability of the AWS infrastructure since it retrieves the log data and findings directly from the AWS services.

AWS Security Hub

is a unified cloud security solution that prioritizes critical security issues and helps respond at scale to protect cloud environments.
was completely re-imagined at re:Invent 2025 — now unifies AWS security services including Amazon GuardDuty, Amazon Inspector, and Amazon Macie into a single experience.
provides near real-time risk analytics (GA December 2025) with automated correlation, enrichment, and prioritization of security signals from multiple sources.
collects security data from across AWS accounts, services, and supported third-party partner products.
is Regional but supports cross-region aggregation of findings.
automatically runs continuous, account-level configuration and security checks based on AWS best practices and industry standards including CIS Foundations, PCI DSS, and NIST frameworks.
detects unused IAM permissions, roles, and credentials (May 2026) across the AWS organization for identity risk reduction.
offers Security Hub Extended plan (2026) providing full-stack enterprise security with 21+ curated partner solutions across 9 security categories (endpoint, identity, email, network, data, browser, cloud, AI, security operations).
supports integration with Amazon EventBridge for custom actions and automated remediation.
has multi-account management through AWS Organizations integration, which allows delegating an administrator account for the organization.
works with AWS Config to perform most of its security checks for controls.

AWS Macie

Macie is a data security service that discovers sensitive data by using machine learning and pattern matching, provides visibility into data security risks, and enables automated protection against those risks.
provides an inventory of the S3 buckets and automatically evaluates and monitors the buckets for security and access control.
automates the discovery, classification, and reporting of sensitive data.
supports automated sensitive data discovery that continuously samples and analyzes S3 objects, builds an interactive data map, and provides a sensitivity score for each bucket.
generates a finding for you to review and remediate as necessary if it detects a potential issue with the security or privacy of the data, such as a bucket that becomes publicly accessible.
can analyze objects encrypted with dual-layer server-side encryption (DSSE-KMS).
provides multi-account support using AWS Organizations to enable Macie across all of the accounts.
is a regional service and must be enabled on a region-by-region basis and helps view findings across all the accounts within each Region.
supports VPC Interface Endpoints to access Macie privately from a VPC without an internet gateway, NAT device, VPN connection, or AWS Direct Connect connection.

AWS Artifact

is a self-service audit artifact retrieval portal that provides customers with on-demand access to AWS’ compliance documentation and agreements.
can use AWS Artifact Reports to download AWS security and compliance documents, such as AWS ISO certifications, Payment Card Industry (PCI), and System and Organization Control (SOC) reports.
supports listCustomerAgreements API (November 2024) for programmatic tracking of active agreements across accounts.
provides SOC reports in machine-readable OSCAL format in addition to PDF.

AWS Security Services – Practice Questions

A company needs to manage encryption keys with FIPS 140-3 Level 3 compliance and wants AWS to handle the infrastructure. Which service should they use?
- A. AWS CloudHSM
- B. AWS KMS ✓
- C. AWS Secrets Manager
- D. AWS Certificate Manager
A financial institution needs to process payment card transactions in the cloud while meeting PCI compliance requirements. Which service should they use?
- A. AWS CloudHSM
- B. AWS KMS
- C. AWS Payment Cryptography ✓
- D. AWS Private CA
A company wants to provide secure access to corporate applications without using VPN. Which service implements Zero Trust access?
- A. AWS Client VPN
- B. AWS Verified Access ✓
- C. AWS Direct Connect
- D. AWS PrivateLink
A development team needs to externalize authorization logic from their application and use fine-grained permissions. Which service should they use?
- A. AWS IAM
- B. Amazon Cognito
- C. Amazon Verified Permissions ✓
- D. AWS IAM Identity Center
A company needs to centralize security data from multiple AWS accounts and third-party sources for analysis. Which service should they use?
- A. AWS Security Hub
- B. Amazon Security Lake ✓
- C. Amazon Detective
- D. AWS CloudTrail
Which AWS service uses AI/ML to detect multi-stage attack sequences spanning multiple data sources and resources?
- A. Amazon Inspector
- B. AWS Security Hub
- C. Amazon GuardDuty Extended Threat Detection ✓
- D. Amazon Detective
A company wants to scan EC2 instances for vulnerabilities without installing any agent. Which capability supports this?
- A. AWS Config Rules
- B. Amazon Inspector agentless scanning ✓
- C. AWS Security Hub
- D. Amazon GuardDuty
Which AWS WAF capability allows content providers to charge AI bots for accessing their content?
- A. AWS WAF Fraud Control
- B. AWS WAF Bot Control
- C. AWS WAF AI Traffic Monetization ✓
- D. AWS Shield Advanced
A company needs to automatically rotate third-party SaaS credentials without writing custom Lambda functions. Which feature supports this?
- A. AWS Secrets Manager Managed External Secrets ✓
- B. AWS Systems Manager Parameter Store
- C. AWS KMS automatic rotation
- D. AWS Config
A security team wants a unified view that correlates findings from GuardDuty, Inspector, and Macie with near real-time risk analytics. Which service provides this?
- A. Amazon Detective
- B. Amazon Security Lake
- C. AWS Security Hub ✓
- D. AWS CloudTrail Lake
An organization needs to protect their KMS encryption keys against future quantum computing threats. Which KMS feature should they use?
- A. External Key Store (XKS)
- B. Multi-Region keys
- C. ML-KEM hybrid post-quantum TLS ✓
- D. On-demand key rotation
Which service was renamed from AWS Single Sign-On (SSO) in July 2022?
- A. AWS IAM
- B. Amazon Cognito
- C. AWS IAM Identity Center ✓
- D. AWS Directory Service

References

Amazon Detective

October 4, 2022 ~ Last updated on : July 1, 2026 ~ jayendrapatil

Amazon Detective

Amazon Detective makes it easy to analyze, investigate, and quickly identify the root cause of potential security issues or suspicious activities.
automatically collects log data from the AWS resources and uses machine learning, statistical analysis, and graph theory to build a linked set of data to easily conduct faster and more efficient security investigations.
enables customers to view summaries and analytical data associated with CloudTrail logs, VPC Flow Logs, EKS audit logs, Amazon GuardDuty findings, and AWS Security Hub findings.
provides detailed summaries, analysis, and visualizations of the behaviors and interactions amongst your AWS accounts, EC2 instances, AWS users, roles, and IP addresses.
maintains up to a year of aggregated data and makes it easily available through a set of visualizations that shows changes in the type and volume of activity over a selected time window, and links those changes to security findings.
is a Regional service and needs to be enabled on a region-by-region basis. This ensures all data analyzed is regionally based and doesn’t cross AWS regional boundaries.
does not require Amazon GuardDuty to be enabled. As of Feb 2024, the requirement to have GuardDuty enabled for 48 hours before enabling Detective has been removed.
is a multi-account service that aggregates data from monitored member accounts under a single administrative account within the same region.
Multi-account monitoring deployments can be configured in the same way it is configured for administrative and member accounts in Amazon GuardDuty and AWS Security Hub.
is integrated with AWS Organizations. The organization management account designates a Detective administrator account for the organization.
has no impact on the performance or availability of the AWS infrastructure since it retrieves the log data and findings directly from the AWS services.
supports VPC endpoints via AWS PrivateLink, enabling secure API calls to Detective from within a VPC without requiring internet traversal.

Amazon Detective Data Sources

AWS CloudTrail logs – management events capturing API activity across your AWS accounts.
Amazon VPC Flow Logs – network traffic data for IP traffic going to and from network interfaces.
Amazon EKS Audit Logs – Kubernetes audit logs from EKS clusters for container security investigations.
Amazon GuardDuty findings – threat detection findings including runtime monitoring, malware protection, and extended threat detection.
AWS Security Hub findings – security posture findings from Security Hub and integrated services.
Other integrated AWS security services – including Amazon Inspector vulnerability findings.

Amazon Detective Finding Groups

Finding Groups automatically consolidate multiple related security findings into a single security event.
Detective detects patterns or relationships among multiple findings that suggest they are related to the same potential security incident.
Grouping helps in managing and investigating related findings more efficiently by reducing noise and prioritizing findings that present true risk.
Includes findings from GuardDuty, Security Hub, and Amazon Inspector vulnerability findings.
Provides interactive visualizations including radial layout and timeline layout views.
Supports severity-based filtering for findings to help prioritize critical issues.
Timeline layout includes play button functionality to understand event progression.

Finding Group Summaries (Generative AI)

Detective automatically generates finding group summaries powered by generative AI.
Analyzes relationships between findings and affected resources, and summarizes potential threats in natural language.
Provides a plain language title based on the analysis of the finding group with relevant summarized insights.
Describes the activity that initiated the event and its impact.
Accelerates security investigations by providing instant context without manual correlation.

Amazon Detective Investigations

Detective Investigations is a one-click investigation feature that automatically investigates IAM users and IAM roles for indicators of compromise (IoC).
Uses machine learning models and threat intelligence to analyze resources for potential security incidents.
Determines if IAM principals have potentially been compromised or involved in known tactics, techniques, and procedures (TTPs) from the MITRE ATT&CK framework.
Investigates attack tactics, impossible travel, flagged IP addresses, and finding groups.
Generates an investigation report highlighting anomalous behavior that indicates potential compromise.
Can generate up to 500 investigations per month in each AWS Region.
Detective recommends resources to investigate based on activity in findings and finding groups.

Amazon Detective and Security Lake Integration

Detective integrates with Amazon Security Lake to query and retrieve raw log data stored in Security Lake.
Enables deeper analysis with access to more detailed parameters as original evidence.
Supports log collection from CloudTrail management events, Amazon VPC Flow Logs, and Amazon EKS Audit Logs.
Supports both OCSF source version 1 (1.0.0-rc.2) and source version 2 (OCSF 1.1.0).
Allows querying log sources without having to craft queries or leave the Detective console.

Amazon Detective vs GuardDuty

Amazon GuardDuty is a threat detection service that continuously monitors malicious activity and unauthorized behavior to protect AWS accounts and workloads.
Amazon Detective simplifies the process of investigating security findings and identifying the root cause. It automatically creates a graph model and provides a unified, interactive view of your resources, users, and the interactions between them over time.
GuardDuty detects threats; Detective investigates those threats to determine root cause and scope.
Detective supports GuardDuty findings including Runtime Monitoring (ECS, EKS, EC2), Malware Protection for S3, Lambda Protection, RDS Protection, and Extended Threat Detection (attack sequences).

Amazon Detective Key Features

Graph Model – constructs a behavior graph using ML, statistical analysis, and graph theory to link security-related data for investigations.
Interactive Visualizations – provides geolocation-based login attempt views, API call volume analysis, and VPC flow volume tracking.
Seamless Integration – integrated with GuardDuty, Security Hub, Amazon Inspector, Amazon Security Lake, and AWS Partner security products.
AWS PrivateLink – supports VPC endpoints for private API access without internet traversal (added Sept 2025).
Simple Deployment – no software to deploy, agents to install, or data sources to enable manually.
Entity Profiles – provides profiles for AWS accounts, IAM users, IAM roles, EC2 instances, S3 buckets, EKS clusters, IP addresses, container images, and Kubernetes pods.
CSV Export – supports exporting data from Summary page and search results in CSV format.

AWS Certification Exam Practice Questions

Questions are collected from Internet and the answers are marked as per my knowledge and understanding (which might differ with yours).

AWS services are updated everyday and both the answers and questions might be outdated soon, so research accordingly.

AWS exam questions are not updated to keep up the pace with AWS updates, so even if the underlying feature has changed the question might not be updated

Open to further feedback, discussion and correction.

A security team needs to investigate a potential security incident across multiple AWS accounts. They want a service that automatically correlates security findings and provides visualizations of related entities. Which AWS service should they use?
1. Amazon GuardDuty
2. AWS Security Hub
3. Amazon Detective
4. AWS CloudTrail
Answer: 3. Amazon Detective automatically creates a graph model that correlates findings across accounts and provides interactive visualizations for security investigations.
Which data sources does Amazon Detective automatically ingest? (Select THREE)
1. AWS CloudTrail logs
2. Amazon VPC Flow Logs
3. Amazon S3 access logs
4. Amazon EKS audit logs
5. AWS Config rules evaluations
Answer: 1, 2, 4. Amazon Detective automatically ingests CloudTrail logs, VPC Flow Logs, and EKS audit logs, along with GuardDuty and Security Hub findings.
A company uses Amazon Detective and wants to investigate whether an IAM role has been compromised. Which Detective feature provides automated investigation of IAM entities for indicators of compromise?
1. Finding Groups
2. Detective Investigations
3. Behavior Graph
4. Security Lake Integration
Answer: 2. Detective Investigations is a one-click feature that automatically investigates IAM users and roles for indicators of compromise (IoC) using the MITRE ATT&CK framework.
What is the purpose of Amazon Detective Finding Groups?
1. To group AWS accounts for multi-account monitoring
2. To consolidate related security findings that may belong to the same security incident
3. To organize VPC Flow Logs by security groups
4. To categorize CloudTrail events by service
Answer: 2. Finding Groups automatically consolidate multiple related security findings into a single security event, reducing noise and helping prioritize findings that present true risk.
Which statement about Amazon Detective is correct? (Select TWO)
1. It requires Amazon GuardDuty to be enabled for at least 48 hours before activation
2. It is a Regional service that does not cross AWS regional boundaries
3. It can maintain up to 5 years of aggregated data
4. It provides finding group summaries powered by generative AI
5. It requires manual configuration of data sources
Answer: 2, 4. Detective is regional and provides GenAI-powered finding group summaries. As of Feb 2024, GuardDuty is no longer required. Detective maintains up to 1 year (not 5) of data. No manual data source configuration is needed.
A security analyst wants to access raw log data during an investigation without leaving the Amazon Detective console. Which integration enables this capability?
1. AWS CloudTrail Lake
2. Amazon Security Lake
3. Amazon S3 Select
4. Amazon Athena
Answer: 2. Detective integrates with Amazon Security Lake, enabling analysts to query and retrieve raw log data stored in Security Lake directly from the Detective console.

References

Amazon Detective

Amazon Detective Features

Amazon Detective User Guide

AWS Certified Solutions Architect – Associate SAA-C03 Exam Learning Path

AWS Solutions Architect - Associate Certificate

September 29, 2022 ~ Last updated on : July 11, 2026 ~ jayendrapatil

AWS Certified Solutions Architect – Associate SAA-C03 Exam Learning Path

I just cleared the AWS Solutions Architect – Associate SAA-C03 exam with a score of 914/1000.
AWS Solutions Architect – Associate SAA-C03 exam is the latest AWS exam released on 30th August 2022 and has replaced the previous AWS Solutions Architect – SAA-C02 certification exam.
The SAA-C03 exam continues to be the current version as of June 2026, with enhanced focus on modern AWS services, sustainability considerations, and advanced networking capabilities. Note: AWS announced the SAA-C04 revision rolling out in Q2-Q3 2026 with increased emphasis on resilient architecture design and cost optimization. Both SAA-C03 and SAA-C04 versions remain available until September 30, 2026 (grace period).

AWS Solutions Architect – Associate SAA-C03 Exam Content

It basically validates the ability to effectively demonstrate knowledge of how to design, architect, and deploy secure, cost-effective, and robust applications on AWS technologies
The exam also validates a candidate’s ability to complete the following tasks:
- Design solutions that incorporate AWS services to meet current business requirements and future projected needs
- Design architectures that are secure, resilient, high-performing, and cost-optimized
- Review existing solutions and determine improvements

Refer AWS Solutions Architect – Associate SAA-C03 Exam Guide

AWS Solutions Architect – Associate SAA-C03 Exam Summary

SAA-C03 exam consists of 65 questions in 130 minutes, and the time is more than sufficient if you are well-prepared.
SAA-C03 exam includes two types of questions, multiple-choice and multiple-response.
SAA-C03 has a scaled score between 100 and 1,000. The scaled score needed to pass the exam is 720.
Associate exams currently cost $ 150 + tax.
The exam includes 50 scored questions and 15 unscored questions (total 65 questions). The unscored questions are used by AWS to evaluate future exam content.
You can get an additional 30 minutes if English is your second language by requesting Exam Accommodations. It might not be needed for Associate exams but is helpful for Professional and Specialty ones.
AWS exams can be taken either remotely or online, I prefer to take them online as it provides a lot of flexibility. Just make sure you have a proper place to take the exam with no disturbance and nothing around you.
Also, if you are taking the AWS Online exam for the first time try to join at least 30 minutes before the actual time as I have had issues with both PSI and Pearson with long wait times.

🆕 SAA-C04 Exam Update (Announced April 2026)

AWS announced the SAA-C04 revision rolling out Q2-Q3 2026 with the following changes:

Increased emphasis on resilient architecture design (now 30% of exam content)
Enhanced cost optimization strategies coverage
AI/GenAI awareness – Generative AI competency embedded at Professional level; Associate remains focused on core architectural skills
Grace period: Both SAA-C03 and SAA-C04 versions active until September 30, 2026

Exam delivery updates (April 2026):

AI-assisted identity verification for remote proctoring
Score reporting reduced to under 24 hours (from 1-5 business days)
ESL exam duration extensions now automatically applied (no separate accommodation request needed in the US)

AWS Solutions Architect – Associate SAA-C03 Exam Resources

Online Courses
- Stephane Maarek – Ultimate AWS Certified Solutions Architect Associate SAA-C03
- Adrian Cantrill – AWS Certified Solutions Architect – Associate (SAA-C03)
- Adrian Cantrill – All Associate Bundle
- DolfinEd – AWS Certified Solutions Architect Associate – SAA-C03 (E-Study & Lab Guides Included)
- DolfinEd – AWS Certified Solutions Architect Associate (On-line, Instructor-Led – Private Group Bootcamp)
- Whizlabs – AWS Certified Solutions Architect Associate Course
- Coursera Exam Prep: AWS Certified Solutions Architect – Associate
Practice tests
- Braincert AWS Solutions Architect – Associate SAA-C03 Practice Exams, which are updated for SAA-C03
- Stephane Maarek – AWS Certified Solutions Architect Associate Practice Exams
- Whizlabs – AWS Certified Solutions Architect Associate Practice Tests
Signed up with AWS for the Free Tier account which provides a lot of Services to be tried for free with certain limits which are more than enough to get things going. Be sure to decommission services beyond the free limits, preventing any surprises 🙂
Also, use QwikLabs for introductory courses which are free
Read the FAQs at least for the important topics, as they cover important points and are good for quick review

AWS Solutions Architect – Associate SAA-C03 Exam Topics

SAA-C03 Exam covers the design and architecture aspects in deep, so you must be able to visualize the architecture, even draw them out or prepare a mental picture just to understand how it would work and how different services relate.
SAA-C03 exam concepts cover solutions that fall within AWS Well-Architected framework to cover scalable, highly available, cost-effective, performant, and resilient pillars.
If you had been preparing for the SAA-C02, SAA-C03 is pretty much similar to SAA-C02 except for the addition of some new services Aurora Serverless, AWS Global Accelerator, FSx for Windows, and FSx for Lustre.
New services and features added to exam scope include VPC Lattice, VPC IP Address Manager (IPAM), AWS Network Firewall, Amazon Verified Permissions, and enhanced focus on sustainability and cost optimization.

⚠️ IMPORTANT: AWS SERVICES DEPRECATED / MAINTENANCE MODE

Several AWS services have been deprecated or moved to maintenance mode (updated June 2026):

AWS App Mesh – End of support September 30, 2026. Migrate to Amazon VPC Lattice or ECS Service Connect
AWS App Runner – Moved to maintenance mode (April 30, 2026). No longer accepting new customers. Consider ECS Fargate, Lambda, or EKS
Amazon RDS Custom for Oracle – Entering sunset; end of support March 31, 2027. Migrate to Amazon RDS for Oracle or self-managed EC2
Amazon Cloud9 – No longer accepting new customers (July 2024). Use local IDEs with AWS Toolkit or AWS CloudShell
AWS CloudTrail Lake – Moved to maintenance mode (May 31, 2026). Use CloudWatch Logs Insights or S3 + Athena

Note: AWS CodeCommit was temporarily de-emphasized in July 2024 but returned to full General Availability in November 2025.

This post has been updated to reflect these changes and include migration guidance.

Networking

Virtual Private Network – VPC
- Create a VPC from scratch with public, private, and dedicated subnets with proper route tables, security groups, and NACLs.
- Understand what a CIDR is and address patterns.
- Subnets are public or private depending on whether they can route traffic directly through an Internet gateway
- Understand how communication happens between the Internet, Public subnets, Private subnets, NAT, Bastion, etc.
- Bastion (also referred to as a Jump server) can be used to securely access instances in the private subnets.
- Create two-tier architecture with application in public and database in private subnets
- Create three-tier architecture with web servers in public, application, and database servers in private. (hint: focus on security group configuration with least privilege)
NEW 2025: VPC IP Address Manager (IPAM)
- Centrally manage and monitor IP addresses across AWS accounts and regions
- Automate IP address assignments and prevent IP address conflicts
- Provides visibility into IP address utilization and helps with compliance
- Essential for large-scale, multi-account AWS deployments
Amazon VPC Lattice
- Application networking service that connects, secures, and monitors service-to-service communications
- Simplifies microservices connectivity across VPCs, accounts, and compute types
- Provides Layer 7 load balancing, service discovery, and traffic management
- Replaces complex service mesh configurations with managed service
- Ideal migration path from deprecated AWS App Mesh
AWS Network Firewall
- Managed, stateful firewall service for VPC protection
- Provides deep packet inspection (DPI) and intrusion prevention
- Supports custom rules and AWS managed threat intelligence
- Integrates with AWS Firewall Manager for centralized management
- Essential for compliance and advanced threat protection
Security Groups and NACLs
- Security Groups are Stateful vs NACLs are stateless.
- Also, only NACLs provide the ability to deny or block IPs
NAT Gateway or Instances
- help enables instances in a private subnet to connect to the Internet.
- Understand the difference between NAT Gateway & NAT Instance.
- NAT Gateway is AWS-managed and is scalable and highly available.
VPC endpoints
- enable the creation of a private connection between VPC to supported AWS services and VPC endpoint services powered by PrivateLink using its private IP address without needing an Internet or NAT Gateway.
- VPC Gateway Endpoints supports S3 and DynamoDB.
- VPC Interface Endpoints OR Private Links supports others
VPN and Direct Connect for on-premises to AWS connectivity
- VPN provides a quick, cost-effective, secure channel, however, routes through the internet and does not provide consistent throughput
- Direct Connect provides consistent, dedicated throughput without Internet, however, requires time to set up and is not cost-effective.
Understand Data Migration techniques at a high level
- VPN and Direct Connect for continuous, frequent data transfers.
- Snow Family is ideal for one-time, cost-effective huge data transfer.
- Choose a technique depending on the available bandwidth, data transfer needed, time available, encryption, one-time or continuous.
CloudFront
- fully managed, fast CDN service that speeds up the distribution of static, dynamic web, or streaming content to end-users
- S3 frontend by CloudFront provides low latency, performant experience for global users.
- provides static and dynamic caching for both AWS and on-premises origin.
Global Accelerator
- optimizes the path to applications to keep packet loss, jitter, and latency consistently low.
- helps improve the performance by lowering first-byte latency
- provides 2 static IP address
Know CloudFront vs Global Accelerator
Route 53
- highly available and scalable DNS web service.
- Health checks and failover routing helps provide resilient and active-passive solutions
- Route 53 Routing Policies and their use cases (hint: focus on weighted, latency, geolocation, failover routing)
Elastic Load Balancer
- Focus on ALB and NLB
- Differences between ALB vs NLB
  - ALB is layer 7 vs NLB is layer 4
  - ALB provides content-based, host-based, path-based routing
  - ALB provides dynamic port mapping which allows the same tasks to be hosted on the ECS node
  - NLB provides low latency, the ability to scale rapidly, and a static IP address
  - ALB works with WAF while NLB does not.
- Gateway Load Balancer – GWLB
  - helps deploy, scale, and manage virtual appliances like firewalls, IDS/IPS, and deep packet inspection systems.

Security

Identity Access Management – IAM
- IAM role
  - provides permissions that are not associated with a particular user, group, or service and are intended to be assumable by anyone who needs it.
  - can be used for EC2 application access and Cross-account access
- IAM identity providers and federation and use cases – Although did not see much in SAA-C03
NEW 2025: Amazon Verified Permissions
- Centrally manage fine-grained permissions and authorization for applications
- Uses Cedar policy language for defining access control policies
- Provides scalable, consistent authorization across microservices
- Integrates with existing identity providers and AWS services
- Essential for zero-trust architecture implementations
Key Management Services – KMS encryption service
- for key management and envelope encryption
- S3 Integration with SSE, SSE-C, SSE-KMS
- KMS Multi-region keys are AWS KMS keys in different AWS Regions that can be used interchangeably – as though having the same key in multiple Regions.
AWS WAF
- integrates with CloudFront, and ALB to provide protection against Cross-site scripting (XSS), and SQL injection attacks.
- provides IP blocking and geo-protection, rate limiting, etc.
AWS Shield
- managed DDoS protection service
- integrates with CloudFront, ALB, and Route 53
- Advanced provides additional detection and mitigation against large and sophisticated DDoS attacks, near real-time visibility into attacks
AWS GuardDuty
- managed threat detection service and provides Malware protection
- Enhanced with machine learning-based threat detection and integration with Security Hub
AWS Inspector
- is a vulnerability management service that continuously scans the AWS workloads for vulnerabilities
- Now includes container image scanning and enhanced software vulnerability detection
AWS Secrets Manager
- helps protect secrets needed to access applications, services, and IT resources.
- supports rotations of secrets, which Systems Manager Parameter Stores does not support.
Disaster Recovery whitepaper
- Be sure you know the different recovery types with impact on RTO/RPO.
- Enhanced focus on cross-region disaster recovery and automated failover strategies

Storage

Understand various storage options S3, EBS, Instance store, EFS, Glacier, FSx, and what are the use cases and anti-patterns for each
Instance Store
- is physically attached to the EC2 instance and provides the lowest latency and highest IOPS
Elastic Block Storage – EBS
- EBS volume types and their use cases in terms of IOPS and throughput. SSD for IOPS and HDD for throughput
- EBS Snapshots
  - Backups are automated, snapshots are manual
  - Can be used to encrypt an unencrypted EBS volume
- Multi-Attach EBS feature allows attaching an EBS volume to multiple instances within the same AZ only.
- EBS fast snapshot restore feature helps ensure that the EBS volumes created from a snapshot are fully-initialized at creation and instantly deliver all of their provisioned performance.
Simple Storage Service – S3
- S3 storage classes with lifecycle policies
  - Understand the difference between SA Standard vs SA IA vs SA IA One Zone in terms of cost and durability
  - New S3 Express One Zone storage class for high-performance workloads
- S3 Data Protection
  - S3 Client-side encryption encrypts data before storing it in S3
- S3 features including
  - S3 provides cost-effective static website hosting. However, it does not support HTTPS endpoint. Can be integrated with CloudFront for HTTPS, caching, performance, and low-latency access.
  - S3 versioning provides protection against accidental overwrites and deletions. Used with MFA Delete feature.
  - S3 Pre-Signed URLs for both upload and download provide access without needing AWS credentials.
  - S3 CORS allows cross-domain calls
  - S3 Transfer Acceleration enables fast, easy, and secure transfers of files over long distances between your client and an S3 bucket.
  - S3 Event Notifications to trigger events on various S3 events like objects added or deleted. Supports SQS, SNS, and Lambda functions.
  - Integrates with Amazon Macie to detect PII data
  - Replication that supports the same and cross-region replication required versioning to be enabled.
  - Integrates with Athena to analyze data in S3 using standard SQL.
⚠️ NOTE: Amazon S3 Glacier (standalone vault-based API) has been superseded by S3 Glacier storage classes. Use S3 Glacier Instant Retrieval, S3 Glacier Flexible Retrieval, or S3 Glacier Deep Archive with S3 lifecycle policies for archival storage.
Storage gateway and its different types.
- Cached Volume Gateway provides access to frequently accessed data while using AWS as the actual storage
- Stored Volume gateway uses AWS as a backup, while the data is being stored on-premises as well
- File Gateway supports SMB protocol
FSx is easy and cost-effective to launch and run popular file systems.
- FSx provides two file systems to choose from:
- Amazon FSx for Windows File Server
  - works with both Linux and Windows
  - provides Windows File System features including integration with Active Directory.
- Amazon FSx for Lustre
  - for high-performance workloads
  - works with only Linux
- FSx for NetApp ONTAP and FSx for OpenZFS now available for additional file system options
Elastic File System – EFS
- simple, fully managed, scalable, serverless, and cost-optimized file storage for use with AWS Cloud and on-premises resources.
- provides shared volume across multiple EC2 instances, while EBS can be attached to a single instance within the same AZ or EBS Multi-Attach can be attached to multiple instances within the same AZ
- supports the NFS protocol, and is compatible with Linux-based AMIs
- supports cross-region replication, storage classes for cost.
AWS Transfer Family
- secure transfer service that helps transfer files into and out of AWS storage services using FTP, SFTP and FTPS protocol.
Difference between EBS vs S3 vs EFS
Difference between EBS vs Instance Store
Would recommend referring Storage Options whitepaper, although a bit dated 90% still holds right

Compute

Elastic Cloud Compute – EC2
Auto Scaling and ELB
- Auto Scaling provides the ability to ensure a correct number of EC2 instances are always running to handle the load of the application
- Elastic Load Balancer allows the incoming traffic to be distributed automatically across multiple healthy EC2 instances
Autoscaling & ELB
- work together to provide High Availability and Scalability.
- Span both ELB and Auto Scaling across Multi-AZs to provide High Availability
- Do not span across regions. Use Route 53 or Global Accelerator to route traffic across regions.
EC2 Instance Purchase Types – Reserved, Scheduled Reserved, On-demand, and Spot and their use cases
- Reserved instances provide cost benefits for long terms requirements over On-demand instances for continuous persistent load
- Scheduled Reserved Instances for load with fixed scheduled and time interval
- Spot instances provide cost benefits for temporary, fault-tolerant, spiky load
- Savings Plans now preferred over Reserved Instances for flexibility across instance families
EC2 Placement Groups
- Cluster placement groups provide low latency and high throughput communication
- Spread placement group provides high availability
- Partition placement groups for distributed workloads like Hadoop and Cassandra
Lambda and serverless architecture, its features, and use cases.
- Lambda integrated with API Gateway to provide a serverless, highly scalable, cost-effective architecture
- Enhanced with container image support and improved cold start performance
Elastic Container Service – ECS with its ability to deploy containers and microservices architecture.
- ECS role for tasks can be provided through taskRoleArn
- ALB provides dynamic port mapping to allow multiple same tasks on the same node.
- ECS Anywhere allows running containers on-premises
Elastic Kubernetes Service – EKS
- managed Kubernetes service to run Kubernetes in the AWS cloud and on-premises data centers
- ideal for migration of an existing workload on Kubernetes
- EKS Anywhere and EKS Distro for hybrid deployments
Elastic Beanstalk at a high level, what it provides, and its ability to get an application running quickly.

Databases

Understand relational and NoSQL data storage options which include RDS, DynamoDB, and Aurora with their use cases
Relational Database Service – RDS
- Read Replicas vs Multi-AZ
  - Read Replicas for scalability, Multi-AZ for High Availability
  - Multi-AZ are regional only
  - Read Replicas can span across regions and can be used for disaster recovery
- Understand Automated Backups, underlying volume types (which are the same as EBS volume types)
- ~~RDS Custom for Oracle~~ – ⚠️ Entering sunset (end of support March 31, 2027). RDS Custom for SQL Server remains available. For Oracle with OS-level access, consider self-managed EC2 or standard RDS for Oracle.
Aurora
- provides multiple read replicas and replicates 6 copies of data across AZs
- Aurora Serverless
  - provides a highly scalable cost-effective database solution
  - automatically starts up, shuts down, and scales capacity up or down based on the application’s needs.
  - supports only MySQL and PostgreSQL
  - Aurora Serverless v2 with instant scaling and better cost optimization
- Aurora Global Database for cross-region disaster recovery
DynamoDB
- provides low latency performance, a key-value store
- is not a relational database
- DynamoDB DAX provides caching for DynamoDB
- DynamoDB TTL helps expire data in DynamoDB without any cost or consuming any write throughput.
- DynamoDB Standard-IA storage class for cost optimization
ElastiCache use cases, mainly for caching performance
- ElastiCache Redis vs Memcached
- ElastiCache Serverless for Redis with automatic scaling

Integration Tools

Simple Queue Service
- as message queuing service and SNS as pub/sub notification service
- as a decoupling service and provide resiliency
- SQS features like visibility, and long poll vs short poll
- provide scaling for the Auto Scaling group based on the SQS size.
- SQS Standard vs SQS FIFO difference
  - FIFO provides exactly-once delivery but with low throughput
Simple Notification Service – SNS
- is a web service that coordinates and manages the delivery or sending of messages to subscribing endpoints or clients
- Fanout pattern can be used to push messages to multiple subscribers
Amazon EventBridge for event-driven architectures and cross-service integration

Analytics

Redshift as a business intelligence tool
- Redshift Serverless for automatic scaling and cost optimization
Kinesis
- for real-time data capture and analytics.
- Integrates with Lambda functions to perform transformations
AWS Glue
- fully-managed, ETL service that automates the time-consuming steps of data preparation for analytics
- AWS Glue for Ray for distributed data processing
Amazon OpenSearch Service (successor to Elasticsearch Service) for search and analytics

Management Tools

CloudWatch
- monitoring to provide operational transparency
- is extendable with custom metrics
- CloudWatch -> (Subscription filter) -> Kinesis Data Firehose -> S3
- CloudWatch Application Insights for automated application monitoring
CloudTrail
- helps enable governance, compliance, and operational and risk auditing of the AWS account.
- helps to get a history of AWS API calls and related events for the AWS account.
CloudFormation
- easy way to create and manage a collection of related AWS resources, and provision and update them in an orderly and predictable fashion.
AWS Config
- fully managed service that provides AWS resource inventory, configuration history, and configuration change notifications to enable security, compliance, and governance.
AWS Systems Manager enhanced with better patch management and automation capabilities

NEW 2025: Sustainability and Cost Optimization

AWS Sustainability: Understanding the AWS commitment to net-zero carbon by 2040
- Carbon footprint tracking and optimization
- Sustainable architecture patterns
- Right-sizing resources for environmental impact
Enhanced Cost Optimization:
- AWS Cost Explorer and Cost Anomaly Detection
- Savings Plans vs Reserved Instances comparison
- Spot Instance best practices and interruption handling
- Resource tagging strategies for cost allocation

NEW 2025: Practice Questions for Updated Services

VPC Lattice Questions:
- Q: A company needs to connect microservices across multiple VPCs and AWS accounts with centralized security policies. Which service should they use?
  - A) VPC Peering
  - B) Transit Gateway
  - C) Amazon VPC Lattice ✓
  - D) AWS PrivateLink
Network Firewall Questions:
- Q: Which AWS service provides stateful firewall capabilities with deep packet inspection for VPC traffic?
  - A) Security Groups
  - B) Network ACLs
  - C) AWS WAF
  - D) AWS Network Firewall ✓
IPAM Questions:
- Q: A large enterprise needs to manage IP address allocation across 50+ AWS accounts. Which service provides centralized IP address management?
  - A) VPC DHCP Options
  - B) Amazon VPC IP Address Manager (IPAM) ✓
  - C) Route 53 Resolver
  - D) AWS Config
Verified Permissions Questions:
- Q: Which service provides fine-grained authorization using Cedar policy language?
  - A) AWS IAM
  - B) Amazon Cognito
  - C) Amazon Verified Permissions ✓
  - D) AWS Directory Service
Deprecated Services Questions:
- Q: AWS App Mesh reached end-of-life in September 2026. What is the recommended migration path?
  - A) AWS Service Mesh
  - B) Amazon VPC Lattice ✓
  - C) Application Load Balancer
  - D) AWS Transit Gateway
- Q: A company is using AWS App Runner to deploy containerized web applications. Given that App Runner moved to maintenance mode in April 2026, which service provides the most similar fully-managed container deployment experience?
  - A) Amazon EC2 with Auto Scaling
  - B) Amazon ECS with Fargate ✓
  - C) AWS Lambda
  - D) Amazon EKS with managed node groups

AWS Whitepapers & Cheatsheets

Architecting for the AWS Cloud: Best Practices
AWS Well-Architected Framework whitepaper
AWS Sustainability Pillar – New addition to Well-Architected Framework
AWS Storage & Content Delivery Services Cheatsheet
AWS Compute Services Cheat Sheet
AWS Database Services Cheat Sheet
New 2025: AWS Networking Services Cheat Sheet covering VPC Lattice, IPAM, and Network Firewall
New 2026: AWS Certification Coming Soon page – Track SAA-C04 rollout and exam guide updates

Important Migration Notes for Deprecated Services

Service Migration Guide (Updated June 2026)

AWS App Mesh → Amazon VPC Lattice / ECS Service Connect:
- VPC Lattice provides simpler service-to-service connectivity
- ECS Service Connect for ECS-native service mesh capabilities
- No need for sidecar proxies or complex mesh configuration
- Built-in security policies and observability
- Deadline: September 30, 2026
AWS App Runner → ECS Fargate / Lambda / EKS:
- ECS Fargate for containerized workloads with more control
- Lambda for event-driven, short-duration workloads
- EKS for Kubernetes-native deployments
- Status: Maintenance mode from April 30, 2026
Amazon RDS Custom for Oracle → RDS for Oracle / EC2:
- Standard RDS for Oracle if OS-level access not critical
- Self-managed Oracle on EC2 for full customization
- Deadline: March 31, 2027
AWS CloudTrail Lake → CloudWatch Logs Insights / S3 + Athena:
- CloudWatch Logs Insights for querying CloudTrail logs
- S3 with Athena for long-term log analysis at scale
- Status: No new customers from May 31, 2026
Amazon S3 Glacier (standalone) → S3 Glacier Storage Classes:
- Use S3 Glacier Instant Retrieval for frequent access
- Use S3 Glacier Flexible Retrieval for standard archival
- Use S3 Glacier Deep Archive for long-term archival

✅ AWS CodeCommit: Returned to full General Availability (November 2025) after being temporarily de-emphasized. Git LFS support coming Q1 2026, regional expansions Q3 2026.

SAA-C03 Architecture Patterns

On the Exam Day

Make sure you are relaxed and get some good night’s sleep. The exam is not tough if you are well-prepared.
If you are taking the AWS Online exam
- Try to join at least 30 minutes before the actual time as I have had issues with both PSI and Pearson with long wait times.
- The online verification process does take some time and usually, there are glitches.
- Remember, you would not be allowed to take the take if you are late by more than 30 minutes.
- Make sure you have your desk clear, no hand-watches, or external monitors, keep your phones away, and nobody can enter the room.
Be prepared for scenario-based questions focusing on cost optimization, sustainability considerations, and modern networking architectures.
Key Focus Areas for 2026:
- Service-to-service connectivity patterns (VPC Lattice)
- Advanced security implementations (Verified Permissions, Network Firewall)
- Cost optimization strategies (Savings Plans, right-sizing)
- Sustainability considerations in architecture decisions
- Migration strategies for deprecated services (App Mesh, App Runner, RDS Custom for Oracle)
- Resilient architecture design (increased to 30% in SAA-C04)

Finally, All the Best 🙂

June 2026 Update Summary

This post has been updated to reflect the latest AWS certification and service changes. Key additions include: the SAA-C04 exam revision announcement (Q2-Q3 2026 rollout with grace period until Sept 30, 2026), AWS CodeCommit’s return to General Availability (Nov 2025), new service deprecations (App Runner maintenance mode, RDS Custom for Oracle sunset, CloudTrail Lake maintenance mode), and updated exam delivery improvements. The post continues to cover VPC Lattice, IPAM, Network Firewall, Verified Permissions, and essential migration guidance for deprecated services.

AWS EC2 Monitoring – CloudWatch Metrics & Alarms

September 20, 2022 ~ Last updated on : June 26, 2026 ~ jayendrapatil ~ 8 Comments

EC2 Monitoring

Status Checks

Status monitoring helps quickly determine whether EC2 has detected any problems that might prevent instances from running applications.
EC2 performs automated checks on every running EC2 instance to identify hardware and software issues.
Status checks are performed every minute and each returns a pass or a fail status.
If all checks pass, the overall status of the instance is OK.
If one or more checks fail, the overall status is Impaired.
Status checks are built into EC2, so they cannot be disabled or deleted.
There are three types of status checks:
- System status checks
- Instance status checks
- Attached EBS status checks
Status checks data augments the information that EC2 already provides about the intended state of each instance (such as pending, running, and stopping) as well as the utilization metrics that CloudWatch monitors (CPU utilization, network traffic, and disk activity).
Alarms can be created or deleted, that are triggered based on the result of the status checks. for e.g., an alarm can be created to warn if status checks fail on a specific instance.

System Status Checks

monitor the AWS systems, required to use the instance, to ensure they are working properly.
detect problems with the instance that require AWS involvement to repair.
System status checks failure might due to
- Loss of network connectivity
- Loss of system power
- Software issues on the physical host
- Hardware issues on the physical host
When a system status check fails, one can either
- check AWS Health Dashboard for any scheduled critical maintenance by AWS to the instance’s host.
- wait for AWS to fix the issue
- or resolve it by stopping and restarting or terminating and replacing an instance

Instance Status Checks

monitor the software and network configuration of the individual instance
checks to detect problems that require involvement to repair.
Instance status checks failure might be due to
- Failed system status checks
- Misconfigured networking or startup configuration
- Exhausted memory
- Corrupted file system
- Incompatible kernel
When an instance status check fails, it can be resolved by either rebooting the instance or by making modifications to the operating system

Attached EBS Status Checks

monitor whether the EBS volumes attached to an instance are reachable and able to complete I/O operations.
available for Nitro-based instances only.
helps detect issues where the instance cannot communicate with one or more attached EBS volumes.
Attached EBS status check failure might be due to
- Hardware or software issues on the storage subsystem underlying the EBS volume
- Hardware issues on the physical host impacting reachability to EBS
The metric StatusCheckFailed_AttachedEBS is available at a 1-minute frequency at no additional charge.
Can be used with CloudWatch alarms and Auto Scaling health checks to replace instances with impaired EBS volumes.

EC2 Instance Recovery

Simplified Automatic Recovery
- enabled by default during instance launch on supported instances.
- automatically moves the instance from the impaired host to a different host when a system status check failure is detected.
- recovered instance is identical to the original (instance ID, private IP, Elastic IP, metadata, placement group).
- does not require a CloudWatch alarm to be configured.
- works only for system status check failures, not for instance status check failures.
- available for over 90% of deployed EC2 instances.
CloudWatch Action Based Recovery
- can be configured optionally after instance launch using CloudWatch alarms.
- provides the ability to set a recovery action on a CloudWatch alarm monitoring the StatusCheckFailed_System metric.
- provides more granular control over recovery conditions and notification.

CloudWatch Monitoring

CloudWatch helps monitor EC2 instances, which collects and processes
raw data from EC2 into readable, near real-time metrics.
Statistics are recorded for a period of two weeks so that historical information can be accessed and used to gain a better perspective on how
the application or service is performing.
By default, Basic monitoring is enabled and EC2 metric data is sent to CloudWatch in 5-minute periods automatically
Detailed monitoring can be enabled on the EC2 instance, which sends data to CloudWatch in 1-minute periods.
Organization-wide Detailed Monitoring Enablement (2026)
- CloudWatch Ingestion enablement rules can automatically enable detailed monitoring for both existing and newly launched EC2 instances matching the rule scope.
- Ensures consistent 1-minute metrics collection across EC2 instances at the organization or account level.
Aggregating Statistics Across Instances/ASG/AMI ID
- Aggregate statistics are available for the instances that have detailed monitoring (at an additional charge) enabled, which provides data in 1-minute periods
- Instances that use basic monitoring are not included in the aggregates.
- CloudWatch does not aggregate data across Regions. Therefore, metrics are completely separate between regions.
- CloudWatch returns statistics for all dimensions in the AWS/EC2 namespace if no dimension is specified
- The technique for retrieving all dimensions across an AWS namespace does not work for custom namespaces published to CloudWatch.
- Statistics include Sum, Average, Minimum, Maximum, Data Samples
- With custom namespaces, the complete set of dimensions that are associated with any given data point to retrieve statistics that include the data point must be specified
CloudWatch alarms
- can be created to monitor any one of the EC2 instance’s metrics.
- can be configured to automatically send you a notification when the metric reaches a specified threshold.
- can automatically stop, terminate, reboot, or recover EC2 instances
- can automatically recover an EC2 instance when the instance becomes impaired due to an underlying hardware failure or a problem that requires AWS involvement to repair
- can automatically stop or terminate the instances to save costs (EC2 instances that use an EBS volume as the root device can be stopped
  or terminated, whereas instances that use the instance store as the root device can only be terminated)
- can use EC2ActionsAccess IAM role, which enables AWS to perform stop, terminate, or reboot actions on EC2 instances
- If you have read/write permissions for CloudWatch but not for EC2, alarms can still be created but the stop or terminate actions won’t be performed on the EC2 instance
- Composite Alarms can combine multiple metric alarms into a single alarm for aggregated health, but cannot perform EC2 actions directly.

CloudWatch Agent

The unified CloudWatch agent collects system-level metrics and logs from EC2 instances that are not available through the default hypervisor-level metrics.
Key OS-level metrics collected by the agent include:
- Memory utilization (mem_used_percent)
- Disk usage (disk_used_percent)
- Swap usage
- Process-level metrics (procstat)
EC2 does NOT provide memory or disk usage metrics by default — these require the CloudWatch agent.
Can be installed and managed via AWS Systems Manager (SSM).
Configuration is stored in a JSON file or as an SSM Parameter Store parameter.
Metrics collected by the CloudWatch agent are billed as custom metrics.
In-Console Agent Management (2025/2026)
- CloudWatch provides visibility into agent status across the EC2 fleet directly in the console.
- Automatic detection of supported workloads and recommended monitoring configurations.
- Visual configuration editor for the agent eliminates the need to hand-edit JSON (April 2026).

EC2 Monitoring Metrics

Instance Metrics

CPUUtilization
- % of physical CPU time that EC2 uses to run the instance, including time spent running both user code and EC2 code.
- At a very high level, CPUUtilization is the sum of guest CPUUtilization and hypervisor CPUUtilization.
DiskReadOps
- Completed read operations from all instance store volumes available to the instance in a specified period of time.
- If there are no instance store volumes, the value is 0 or the metric is not reported.
DiskWriteOps
- Completed write operations to all instance store volumes available to the instance in a specified period of time.
- If there are no instance store volumes, the value is 0 or the metric is not reported.
DiskReadBytes
- Bytes read from all instance store volumes available to the instance.
- This metric is used to determine the volume of the data the application reads from the hard disk of the instance.
DiskWriteBytes
- Bytes written to all instance store volumes available to the instance.
- This metric is used to determine the volume of the data the application writes onto the hard disk of the instance.
MetadataNoToken
- The number of times the Instance Metadata Service (IMDS) was successfully accessed using a method that does not use a token (IMDSv1).
- Used to determine if there are any processes accessing instance metadata using IMDSv1, which is less secure than IMDSv2.
- If all requests use token-backed sessions (IMDSv2), the value is 0.
MetadataNoTokenRejected
- The number of times an IMDSv1 call was attempted after IMDSv1 was disabled on the instance.
- Indicates that software on the instance still attempts IMDSv1 calls and needs updating.
NetworkIn
- The number of bytes received on all network interfaces by the instance. This metric identifies the volume of incoming network traffic to an application on a single instance.
NetworkOut
- The number of bytes sent out on all network interfaces by the instance. This metric identifies the volume of outgoing network traffic from a single instance.
NetworkPacketsIn
- The number of packets received on all network interfaces by the instance.
- This metric is available for basic monitoring only (5-minute periods).
NetworkPacketsOut
- The number of packets sent out on all network interfaces by the instance.
- This metric is available for basic monitoring only (5-minute periods).

CPU Credit Metrics (Burstable Performance Instances)

Applicable to all burstable performance instances (T2, T3, T3a, T4g) — not just T2.
CPU Credit metrics are available at a 5-minute frequency only.
CPUCreditUsage
- The number of CPU credits spent by the instance for CPU utilization.
- One CPU credit equals one vCPU running at 100% utilization for one minute.
CPUCreditBalance
- The number of earned CPU credits that an instance has accrued since it was launched or started.
- For T2 Standard, also includes the number of launch credits accrued.
- When a T3/T3a instance stops, the CPUCreditBalance persists for seven days. When a T2 instance stops, credits are lost.
- Used to determine how long an instance can burst beyond its baseline performance level.
CPUSurplusCreditBalance (Unlimited mode only)
- The number of surplus credits spent when the CPUCreditBalance is zero.
- Surplus credits are paid down by earned CPU credits.
- If surplus credits exceed the maximum earnable in a 24-hour period, additional charges apply.
CPUSurplusCreditsCharged (Unlimited mode only)
- The number of surplus credits that are not paid down and incur an additional charge.
- Charged when surplus credits exceed 24-hour maximum, instance is stopped/terminated, or switched from unlimited to standard mode.

Amazon EBS Metrics for Nitro-based Instances

Available for EBS volumes attached to Nitro-based instances (non-bare-metal).
EBSReadOps / EBSWriteOps – Completed read/write operations from all attached EBS volumes.
EBSReadBytes / EBSWriteBytes – Bytes read from/written to all attached EBS volumes.
EBSIOBalance%
- Percentage of I/O credits remaining in the burst bucket.
- Available for basic monitoring only.
- Available for some *.4xlarge and smaller instance sizes that burst to maximum performance for 30 minutes every 24 hours.
EBSByteBalance%
- Percentage of throughput credits remaining in the burst bucket.
- Available for basic monitoring only.
- Available for some *.4xlarge and smaller instance sizes that burst to maximum performance for 30 minutes every 24 hours.
InstanceEBSIOPSExceededCheck
- Reports whether the application attempted to drive IOPS exceeding the maximum EBS IOPS limits for the instance.
- Values: 0 (not exceeded) or 1 (exceeded).
InstanceEBSThroughputExceededCheck
- Reports whether the application attempted to drive throughput exceeding the maximum EBS throughput limits for the instance.
- Values: 0 (not exceeded) or 1 (exceeded).

Status Check Metrics

Available at a 1-minute frequency at no charge by default.
StatusCheckFailed
- Reports if either of the status checks has failed.
- Values: 0 (passed) or 1 (failed).
StatusCheckFailed_Instance
- Reports whether the instance has passed the EC2 instance status check in the last minute.
- Values: 0 (passed) or 1 (failed).
StatusCheckFailed_System
- Reports whether the instance has passed the EC2 system status check in the last minute.
- Values: 0 (passed) or 1 (failed).
StatusCheckFailed_AttachedEBS
- Reports whether the instance has passed the attached EBS status check in the last minute.
- Values: 0 (passed) or 1 (failed).
- Available for Nitro-based instances only.

Accelerator Metrics

GPUPowerUtilization
- Active power usage as a percentage of maximum active power.
- Available for supported accelerated computing instances only.

CloudWatch Network Flow Monitor

Launched at re:Invent 2024 as part of CloudWatch Network Monitoring.
Provides near real-time visibility into network performance (packet loss and latency) for traffic between EC2 instances, EKS workloads, and AWS services (S3, DynamoDB).
Uses fully-managed agents installed on EC2 instances to collect TCP-based performance metrics.
Agents send aggregated metrics to the backend approximately every 30 seconds.
Top contributors feature identifies network flows with the highest retransmissions or latency to help pinpoint impairments.
Supports multi-account monitoring via AWS Organizations integration.

EC2 Metric Dimensions

InstanceId – Filters data for a specific instance.
InstanceType – Filters data for all instances of a specific type (requires Detailed Monitoring).
ImageId (AMI ID) – Filters data for all instances running a specific AMI (requires Detailed Monitoring).
AutoScalingGroupName – Filters data for all instances in a specified Auto Scaling group.

AWS Certification Exam Practice Questions

Questions are collected from Internet and the answers are marked as per my knowledge and understanding (which might differ with yours).

AWS services are updated everyday and both the answers and questions might be outdated soon, so research accordingly.

AWS exam questions are not updated to keep up the pace with AWS updates, so even if the underlying feature has changed the question might not be updated

Open to further feedback, discussion and correction.

In the basic monitoring package for EC2, Amazon CloudWatch provides the following metrics:
1. Web server visible metrics such as number failed transaction requests
2. Operating system visible metrics such as memory utilization
3. Database visible metrics such as number of connections
4. Hypervisor visible metrics such as CPU utilization
Which of the following requires a custom CloudWatch metric to monitor?
1. Memory Utilization of an EC2 instance
2. CPU Utilization of an EC2 instance
3. Disk usage activity of an EC2 instance
4. Data transfer of an EC2 instance
A user has configured CloudWatch monitoring on an EBS backed EC2 instance. If the user has not attached any additional device, which of the below mentioned metrics will always show a 0 value?
1. DiskReadBytes
2. NetworkIn
3. NetworkOut
4. CPUUtilization
A user is running a batch process on EBS backed EC2 instances. The batch process starts a few instances to process Hadoop Map reduce jobs, which can run between 50 – 600 minutes or sometimes for more time. The user wants to configure that the instance gets terminated only when the process is completed. How can the user configure this with CloudWatch?
1. Setup the CloudWatch action to terminate the instance when the CPU utilization is less than 5%
2. Setup the CloudWatch with Auto Scaling to terminate all the instances
3. Setup a job which terminates all instances after 600 minutes
4. It is not possible to terminate instances automatically
An AWS account owner has setup multiple IAM users. One IAM user only has CloudWatch access. He has setup the alarm action, which stops the EC2 instances when the CPU utilization is below the threshold limit. What will happen in this case?
1. It is not possible to stop the instance using the CloudWatch alarm
2. CloudWatch will stop the instance when the action is executed
3. The user cannot set an alarm on EC2 since he does not have the permission
4. The user can setup the action but it will not be executed if the user does not have EC2 rights
A user has launched 10 instances from the same AMI ID using Auto Scaling. The user is trying to see the average CPU utilization across all instances of the last 2 weeks under the CloudWatch console. How can the user achieve this?
1. View the Auto Scaling CPU metrics (Refer AS Instance Monitoring)
2. Aggregate the data over the instance AMI ID (Works but needs detailed monitoring enabled)
3. The user has to use the CloudWatch analyser to find the average data across instances
4. It is not possible to see the average CPU utilization of the same AMI ID since the instance ID is different
Which EC2 status check type monitors whether the EBS volumes attached to a Nitro-based instance are reachable?
1. System status check
2. Instance status check
3. Attached EBS status check
4. Volume status check
An organization wants to monitor memory utilization of their EC2 instances. Which approach should they use?
1. Enable detailed monitoring on the instances
2. Install the unified CloudWatch agent and configure memory metrics
3. Use the default CloudWatch EC2 metrics
4. Enable enhanced monitoring on the instances
Which CloudWatch metric can help identify if an EC2 instance is still using the less secure IMDSv1 to access instance metadata?
1. StatusCheckFailed_Instance
2. MetadataNoToken
3. CPUCreditBalance
4. NetworkPacketsIn
A company wants to ensure all EC2 instances across their AWS Organization have detailed monitoring enabled. What is the most efficient approach? [Select 2]
1. Manually enable detailed monitoring on each instance
2. Create CloudWatch Ingestion enablement rules scoped to the organization
3. Use enablement rules to automatically enable detailed monitoring for existing and new instances
4. Use AWS Config rules to detect and auto-remediate