AWS Storage Options – Whitepaper – Certification

May 29, 2016 ~ Last updated on : June 24, 2026 ~ jayendrapatil

Table of Contents hide

📋 Whitepaper Archived

Overview

AWS Storage Services

Deprecated Services Referenced in Exam Questions

AWS Certification Exam Practice Questions

AWS Storage Options – Whitepaper – Certification

📋 Whitepaper Archived

The original AWS Storage Options whitepaper has been archived by AWS. AWS now recommends referring to the Storage section in the AWS Overview whitepaper or the AWS Cloud Storage page for current storage guidance.

This content is maintained and updated for certification exam preparation as the core storage concepts and service selection patterns remain highly relevant.

AWS Storage Options is one of the most important topics for AWS Solution Architect Professional Certification exam and covers a brief summary of each AWS storage option, their ideal usage patterns, anti-patterns, performance, durability and availability, scalability etc.

Overview

AWS offers multiple cloud-based storage options. Each has a unique combination of performance, durability, availability, cost, and interface, as well as other characteristics such as scalability and elasticity
All storage options are ideally suited for some use cases and there are certain Anti-Patterns which should be taken into account while making a storage choice
AWS storage services now span object storage, block storage, file storage, archival storage, hybrid storage, data transfer, and backup services

AWS Various Storage Options

AWS Storage Services

Amazon S3 & S3 Glacier Storage Classes

More Details @ AWS Storage Options – S3 & Glacier

Key Updates (2024-2026):

S3 Glacier is now three separate storage classes:
- S3 Glacier Instant Retrieval – millisecond retrieval for rarely accessed data
- S3 Glacier Flexible Retrieval (formerly S3 Glacier) – minutes to hours retrieval
- S3 Glacier Deep Archive – lowest cost, 12-48 hour retrieval
S3 Express One Zone (launched 2023) – up to 10x faster performance than S3 Standard, single-digit millisecond latency, designed for most frequently accessed data. Received up to 85% price reduction in 2025.

S3 Tables (launched Dec 2024) – fully managed Apache Iceberg tables optimized for analytics workloads with up to 3x faster query throughput
S3 Intelligent-Tiering – now includes Archive Instant Access, Archive Access, and Deep Archive Access tiers

Amazon Elastic Block Store (EBS) & Instance Store Volumes

More details @ AWS Storage Options – EBS & Instance Store

Amazon EFS (Elastic File System)

Fully managed, elastic NFS file system for Linux workloads
Supports machine learning, big data analytics, web serving, and content management
Scales automatically without provisioning or managing capacity
Offers Standard and Infrequent Access storage classes with lifecycle management

Amazon FSx Family

FSx for Windows File Server – fully managed Windows-native file system

FSx for Lustre – high-performance file system for compute-intensive workloads (new Elastic storage class launched 2025)
FSx for NetApp ONTAP – fully managed shared storage with NetApp ONTAP (2nd gen file systems in 2024)
FSx for OpenZFS – fully managed OpenZFS file system (Intelligent-Tiering storage class launched Dec 2024, saves up to 85%)

Key Updates:

Storage Gateway continues to provide S3 File Gateway, Tape Gateway, and Volume Gateway
FSx File Gateway is no longer available to new customers (effective October 28, 2024). Existing customers should migrate to direct Amazon FSx for Windows File Server access.

All Storage Gateway appliances must migrate from Amazon Linux 2 to AL2023 for continued updates

AWS Data Transfer & Migration Services

⚠️ AWS Import/Export & Snow Family Updates:

AWS Import/Export (original disk-shipping service) – deprecated long ago, replaced by Snow Family
AWS Snowmobile – Retired in March 2024. Service is no longer available.

AWS Snowcone – Discontinued effective November 12, 2024. Support ended November 12, 2025.
AWS Snowball Edge – Only available to existing customers as of November 7, 2025. Not available to new customers.

Recommended Replacements:

AWS DataSync – for online data transfers (now supports cross-cloud transfers to Google Cloud, Azure, Oracle Cloud as of 2025)

AWS Data Transfer Terminal (launched Dec 2024) – secure physical locations where you bring your storage devices and connect directly to the AWS network for high-speed uploads to S3, EFS, and other services
AWS Outposts – for edge computing use cases previously served by Snow devices
AWS Partner solutions – for specialized migration needs

AWS Backup

Fully managed, centralized backup service that automates data protection across AWS services and hybrid workloads

Supports EC2, EBS, RDS, DynamoDB, EFS, FSx, S3, Storage Gateway, and Amazon EKS (added 2025)
Provides ransomware detection and recovery capabilities
Supports cross-Region and cross-account backup with AWS Organizations integration
Logically air-gapped vaults for additional protection
Policy-based backup plans with configurable frequency and retention

Deprecated Services Referenced in Exam Questions

⚠️ Amazon Elastic Transcoder – EOL November 13, 2025

Amazon Elastic Transcoder has been discontinued. AWS Elemental MediaConvert is the recommended replacement, offering better performance, more features, and lower pricing. Questions referencing Elastic Transcoder still appear on older exam versions but the correct architectural pattern (S3 + transcoding + CloudFront) remains valid using MediaConvert.

⚠️ Amazon SWF (Simple Workflow Service) – Superseded by Step Functions

While SWF remains available, AWS recommends Step Functions for all new applications. SWF still appears in exam questions but new designs should use Step Functions for workflow orchestration.

AWS Certification Exam Practice Questions

Questions are collected from Internet and the answers are marked as per my knowledge and understanding (which might differ with yours).

AWS services are updated everyday and both the answers and questions might be outdated soon, so research accordingly.

AWS exam questions are not updated to keep up the pace with AWS updates, so even if the underlying feature has changed the question might not be updated

Open to further feedback, discussion and correction.

You are developing a highly available web application using stateless web servers. Which services are suitable for storing session state data? Choose 3 answers.
1. Elastic Load Balancing
2. Amazon Relational Database Service (RDS)
3. Amazon CloudWatch
4. Amazon ElastiCache
5. Amazon DynamoDB
6. AWS Storage Gateway
Your firm has uploaded a large amount of aerial image data to S3. In the past, in your on-premises environment, you used a dedicated group of servers to oaten process this data and used Rabbit MQ, an open source messaging system, to get job information to the servers. Once processed the data would go to tape and be shipped offsite. Your manager told you to stay with the current design, and leverage AWS archival storage and messaging services to minimize cost. Which is correct? [PROFESSIONAL]
1. Use SQS for passing job messages, use Cloud Watch alarms to terminate EC2 worker instances when they become idle. Once data is processed, change the storage class of the S3 objects to Reduced Redundancy Storage.
2. Setup Auto-Scaled workers triggered by queue depth that use spot instances to process messages in SQS. Once data is processed, change the storage class of the S3 objects to Reduced Redundancy Storage.
3. Setup Auto-Scaled workers triggered by queue depth that use spot instances to process messages in SQS. Once data is processed, change the storage class of the S3 objects to Glacier. (Now S3 Glacier Flexible Retrieval or S3 Glacier Deep Archive)
4. Use SNS to pass job messages use Cloud Watch alarms to terminate spot worker instances when they become idle. Once data is processed, change the storage class of the S3 object to Glacier.
You are developing a new mobile application and are considering storing user preferences in AWS, which would provide a more uniform cross-device experience to users using multiple mobile devices to access the application. The preference data for each user is estimated to be 50KB in size. Additionally 5 million customers are expected to use the application on a regular basis. The solution needs to be cost-effective, highly available, scalable and secure, how would you design a solution to meet the above requirements? [PROFESSIONAL]
1. Setup an RDS MySQL instance in 2 availability zones to store the user preference data. Deploy a public facing application on a server in front of the database to manage security and access credentials
2. Setup a DynamoDB table with an item for each user having the necessary attributes to hold the user preferences. The mobile application will query the user preferences directly from the DynamoDB table. Utilize STS. Web Identity Federation, and DynamoDB Fine Grained Access Control to authenticate and authorize access
3. Setup an RDS MySQL instance with multiple read replicas in 2 availability zones to store the user preference data .The mobile application will query the user preferences from the read replicas. Leverage the MySQL user management and access privilege system to manage security and access credentials.
4. Store the user preference data in S3 Setup a DynamoDB table with an item for each user and an item attribute pointing to the user’ S3 object. The mobile application will retrieve the S3 URL from DynamoDB and then access the S3 object directly utilize STS, Web identity Federation, and S3 ACLs to authenticate and authorize access.

A company is building a voting system for a popular TV show, viewers would watch the performances then visit the show’s website to vote for their favorite performer. It is expected that in a short period of time after the show has finished the site will receive millions of visitors. The visitors will first login to the site using their Amazon.com credentials and then submit their vote. After the voting is completed the page will display the vote totals. The company needs to build the site such that can handle the rapid influx of traffic while maintaining good performance but also wants to keep costs to a minimum. Which of the design patterns below should they use? [PROFESSIONAL]
1. Use CloudFront and an Elastic Load balancer in front of an auto-scaled set of web servers, the web servers will first can the Login With Amazon service to authenticate the user then process the users vote and store the result into a multi-AZ Relational Database Service instance.
2. Use CloudFront and the static website hosting feature of S3 with the Javascript SDK to call the Login With Amazon service to authenticate the user, use IAM Roles to gain permissions to a DynamoDB table to store the users vote.
3. Use CloudFront and an Elastic Load Balancer in front of an auto-scaled set of web servers, the web servers will first call the Login with Amazon service to authenticate the user, the web servers will process the users vote and store the result into a DynamoDB table using IAM Roles for EC2 instances to gain permissions to the DynamoDB table.
4. Use CloudFront and an Elastic Load Balancer in front of an auto-scaled set of web servers, the web servers will first call the Login With Amazon service to authenticate the user, the web servers would process the users vote and store the result into an SQS queue using IAM Roles for EC2 Instances to gain permissions to the SQS queue. A set of application servers will then retrieve the items from the queue and store the result into a DynamoDB table
A large real-estate brokerage is exploring the option to adding a cost-effective location-based alert to their existing mobile application. The application backend infrastructure currently runs on AWS. Users who opt in to this service will receive alerts on their mobile device regarding real-estate offers in proximity to their location. For the alerts to be relevant delivery time needs to be in the low minute count. The existing mobile app has 5 million users across the US. Which one of the following architectural suggestions would you make to the customer? [PROFESSIONAL]
1. Mobile application will submit its location to a web service endpoint utilizing Elastic Load Balancing and EC2 instances. DynamoDB will be used to store and retrieve relevant offers. EC2 instances will communicate with mobile carriers/device providers to push alerts back to mobile application.
2. Use AWS Direct Connect or VPN to establish connectivity with mobile carriers EC2 instances will receive the mobile applications location through carrier connection: RDS will be used to store and relevant offers. EC2 instances will communicate with mobile carriers to push alerts back to the mobile application
3. Mobile application will send device location using SQS. EC2 instances will retrieve the relevant offers from DynamoDB. AWS Mobile Push will be used to send offers to the mobile application (Note: Amazon SNS Mobile Push is now the terminology for mobile push notifications)
4. Mobile application will send device location using AWS Mobile Push. EC2 instances will retrieve the relevant offers from DynamoDB. EC2 instances will communicate with mobile carriers/device providers to push alerts back to the mobile application.
You are running a news website in the eu-west-1 region that updates every 15 minutes. The website has a worldwide audience and it uses an Auto Scaling group behind an Elastic Load Balancer and an Amazon RDS database. Static content resides on Amazon S3, and is distributed through Amazon CloudFront. Your Auto Scaling group is set to trigger a scale up event at 60% CPU utilization; you use an Amazon RDS extra-large DB instance with 10.000 Provisioned IOPS its CPU utilization is around 80%. While freeable memory is in the 2 GB range. Web analytics reports show that the average load time of your web pages is around 1.5 to 2 seconds, but your SEO consultant wants to bring down the average load time to under 0.5 seconds. How would you improve page load times for your users? (Choose 3 answers) [PROFESSIONAL]
1. Lower the scale up trigger of your Auto Scaling group to 30% so it scales more aggressively.
2. Add an Amazon ElastiCache caching layer to your application for storing sessions and frequent DB queries
3. Configure Amazon CloudFront dynamic content support to enable caching of re-usable content from your site
4. Switch Amazon RDS database to the high memory extra-large Instance type
5. Set up a second installation in another region, and use the Amazon Route 53 latency-based routing feature to select the right region.
A read only news reporting site with a combined web and application tier and a database tier that receives large and unpredictable traffic demands must be able to respond to these traffic fluctuations automatically. What AWS services should be used meet these requirements? [PROFESSIONAL]
1. Stateless instances for the web and application tier synchronized using ElastiCache Memcached in an autoscaling group monitored with CloudWatch. And RDS with read replicas.
2. Stateful instances for the web and application tier in an autoscaling group monitored with CloudWatch and RDS with read replicas
3. Stateful instances for the web and application tier in an autoscaling group monitored with CloudWatch. And multi-AZ RDS
4. Stateless instances for the web and application tier synchronized using ElastiCache Memcached in an autoscaling group monitored with CloudWatch and multi-AZ RDS

You have a periodic Image analysis application that gets some files as input, analyzes them and for each file writes some data in output to a ten file. The number of files in input per day is high and concentrated in a few hours of the day. Currently you have a server on EC2 with a large EBS volume that hosts the input data and the results it takes almost 20 hours per day to complete the process. What services could be used to reduce the elaboration time and improve the availability of the solution? [PROFESSIONAL]
1. S3 to store I/O files. SQS to distribute elaboration commands to a group of hosts working in parallel. Auto scaling to dynamically size the group of hosts depending on the length of the SQS queue
2. EBS with Provisioned IOPS (PIOPS) to store I/O files. SNS to distribute elaboration commands to a group of hosts working in parallel Auto Scaling to dynamically size the group of hosts depending on the number of SNS notifications
3. S3 to store I/O files, SNS to distribute evaporation commands to a group of hosts working in parallel. Auto scaling to dynamically size the group of hosts depending on the number of SNS notifications
4. EBS with Provisioned IOPS (PIOPS) to store I/O files SQS to distribute elaboration commands to a group of hosts working in parallel Auto Scaling to dynamically size the group to hosts depending on the length of the SQS queue.
A 3-tier e-commerce web application is current deployed on-premises and will be migrated to AWS for greater scalability and elasticity. The web server currently shares read-only data using a network distributed file system The app server tier uses a clustering mechanism for discovery and shared session state that depends on IP multicast The database tier uses shared-storage clustering to provide database fail over capability, and uses several read slaves for scaling. Data on all servers and the distributed file system directory is backed up weekly to off-site tapes. Which AWS storage and database architecture meets the requirements of the application? [PROFESSIONAL]
1. Web servers store read-only data in S3, and copy from S3 to root volume at boot time. App servers share state using a combination of DynamoDB and IP unicast. Database use RDS with multi-AZ deployment and one or more Read Replicas. Backup web and app servers backed up weekly via AMIs, database backed up via DB snapshots.
2. Web servers store read-only data in S3, and copy from S3 to root volume at boot time. App servers share state using a combination of DynamoDB and IP unicast. Database use RDS with multi-AZ deployment and one or more Read replicas. Backup web servers app servers, and database backed up weekly to Glacier using snapshots (Snapshots to Glacier don’t work directly with EBS snapshots)
3. Web servers store read-only data in S3 and copy from S3 to root volume at boot time. App servers share state using a combination of DynamoDB and IP unicast. Database use RDS with multi-AZ deployment. Backup web and app servers backed up weekly via AMIs. Database backed up via DB snapshots (Need Read replicas for scalability and elasticity)
4. Web servers, store read-only data in an EC2 NFS server, mount to each web server at boot time App servers share state using a combination of DynamoDB and IP multicast Database use RDS with multi-AZ deployment and one or more Read Replicas Backup web and app servers backed up weekly via AMIs database backed up via DB snapshots (IP multicast not available in AWS)
Our company is getting ready to do a major public announcement of a social media site on AWS. The website is running on EC2 instances deployed across multiple Availability Zones with a Multi-AZ RDS MySQL Extra Large DB Instance. The site performs a high number of small reads and writes per second and relies on an eventual consistency model. After comprehensive tests you discover that there is read contention on RDS MySQL. Which are the best approaches to meet these requirements? (Choose 2 answers) [PROFESSIONAL]
1. Deploy ElastiCache in-memory cache running in each availability zone
2. Implement sharding to distribute load to multiple RDS MySQL instances (Would distribute read write both, focus is on read contention)
3. Increase the RDS MySQL Instance size and Implement provisioned IOPS (Would distribute read write both, focus is on read contention)
4. Add an RDS MySQL read replica in each availability zone
Run 2-tier app with the following: an ELB, three web app server on EC2, and 1 MySQL RDS db. With grown load, db queries take longer and longer and slow down the overall response time for user request. What Options could speed up performance? (Choose 3) [PROFESSIONAL]
1. Create an RDS read-replica and redirect half of the database read request to it
2. Cache database queries in Amazon ElastiCache
3. Setup RDS in multi-availability zone mode.
4. Shard the database and distribute loads between shards.
5. Use Amazon CloudFront to cache database queries.
You have a web application leveraging an Elastic Load Balancer (ELB) In front of the web servers deployed using an Auto Scaling Group Your database is running on Relational Database Service (RDS) The application serves out technical articles and responses to them in general there are more views of an article than there are responses to the article. On occasion, an article on the site becomes extremely popular resulting in significant traffic Increases that causes the site to go down. What could you do to help alleviate the pressure on the infrastructure while maintaining availability during these events? Choose 3 answers [PROFESSIONAL]
1. Leverage CloudFront for the delivery of the articles.
2. Add RDS read-replicas for the read traffic going to your relational database
3. Leverage ElastiCache for caching the most frequently used data.
4. Use SQS to queue up the requests for the technical posts and deliver them out of the queue (does not process and would not be real time)
5. Use Route53 health checks to fail over to an S3 bucket for an error page (more of an error handling then availability)

Your website is serving on-demand training videos to your workforce. Videos are uploaded monthly in high resolution MP4 format. Your workforce is distributed globally often on the move and using company-provided tablets that require the HTTP Live Streaming (HLS) protocol to watch a video. Your company has no video transcoding expertise and it required you might need to pay for a consultant. How do you implement the most cost-efficient architecture without compromising high availability and quality of video delivery? [PROFESSIONAL]
1. AWS Elemental MediaConvert to transcode original high-resolution MP4 videos to HLS. S3 to host videos with Lifecycle Management to archive original files to S3 Glacier Flexible Retrieval after a few days. CloudFront to serve HLS transcoded videos from S3. (MediaConvert replaces Elastic Transcoder (EOL Nov 2025) for high quality transcoding. S3 to host videos cheaply, Glacier for archives and CloudFront for high availability)
2. A video transcoding pipeline running on EC2 using SQS to distribute tasks and Auto Scaling to adjust the number of nodes depending on the length of the queue S3 to host videos with Lifecycle Management to archive all files to Glacier after a few days CloudFront to serve HLS transcoding videos from Glacier
3. AWS Elemental MediaConvert to transcode original high-resolution MP4 videos to HLS EBS volumes to host videos and EBS snapshots to incrementally backup original files after a few days. CloudFront to serve HLS transcoded videos from EC2.
4. A video transcoding pipeline running on EC2 using SQS to distribute tasks and Auto Scaling to adjust the number of nodes depending on the length of the queue. EBS volumes to host videos and EBS snapshots to incrementally backup original files after a few days. CloudFront to serve HLS transcoded videos from EC2
Note: Original question referenced Elastic Transcoder which reached End of Life on November 13, 2025. AWS Elemental MediaConvert is the replacement service. The architectural pattern remains the same.
To meet regulatory requirements, a pharmaceuticals company needs to archive data after a drug trial test is concluded. Each drug trial test may generate up to several thousands of files, with compressed file sizes ranging from 1 byte to 100MB. Once archived, data rarely needs to be restored, and on the rare occasion when restoration is needed, the company has 24 hours to restore specific files that match certain metadata. Searches must be possible by numeric file ID, drug name, participant names, date ranges, and other metadata. Which is the most cost-effective architectural approach that can meet the requirements? [PROFESSIONAL]
1. Store individual files in Amazon S3 Glacier, using the file ID as the archive name. When restoring data, query the Amazon Glacier vault for files matching the search criteria. (Individual files are expensive and does not allow searching by participant names etc)
2. Store individual files in Amazon S3, and store search metadata in an Amazon Relational Database Service (RDS) multi-AZ database. Create a lifecycle rule to move the data to Amazon S3 Glacier after a certain number of days. When restoring data, query the Amazon RDS database for files matching the search criteria, and move the files matching the search criteria back to S3 Standard class. (As the data is not needed can be stored to Glacier directly and the data need not be moved back to S3 standard)
3. Store individual files in Amazon S3 Glacier, and store the search metadata in an Amazon RDS multi-AZ database. When restoring data, query the Amazon RDS database for files matching the search criteria, and retrieve the archive name that matches the file ID returned from the database query. (Individual files and Multi-AZ is expensive)
4. First, compress and then concatenate all files for a completed drug trial test into a single Amazon S3 Glacier archive. Store the associated byte ranges for the compressed files along with other search metadata in an Amazon RDS database with regular snapshotting. When restoring data, query the database for files that match the search criteria, and create restored files from the retrieved byte ranges.
5. Store individual compressed files and search metadata in Amazon Simple Storage Service (S3). Create a lifecycle rule to move the data to Amazon S3 Glacier, after a certain number of days. When restoring data, query the Amazon S3 bucket for files matching the search criteria, and retrieve the file to S3 reduced redundancy in order to move it back to S3 Standard class. (Once the data is moved from S3 to Glacier the metadata is lost, as Glacier does not have metadata and must be maintained externally. Also S3 Reduced Redundancy Storage is no longer recommended.)
A document storage company is deploying their application to AWS and changing their business model to support both free tier and premium tier users. The premium tier users will be allowed to store up to 200GB of data and free tier customers will be allowed to store only 5GB. The customer expects that billions of files will be stored. All users need to be alerted when approaching 75 percent quota utilization and again at 90 percent quota use. To support the free tier and premium tier users, how should they architect their application? [PROFESSIONAL]
1. The company should utilize an Amazon Simple Workflow Service activity worker that updates the users data counter in Amazon DynamoDB. The activity worker will use Simple Email Service to send an email if the counter increases above the appropriate thresholds. (Note: For new implementations, AWS Step Functions with DynamoDB and SES would be the modern approach)
2. The company should deploy an Amazon Relational Database Service relational database with a store objects table that has a row for each stored object along with size of each object. The upload server will query the aggregate consumption of the user in question by first determining the files stored by the user, and then querying the stored objects table for respective file sizes and send an email via Amazon Simple Email Service if the thresholds are breached.
3. The company should write both the content length and the username of the files owner as S3 metadata for the object. They should then create a file watcher to iterate over each object and aggregate the size for each user and send a notification via Amazon Simple Queue Service to an emailing service if the storage threshold is exceeded.
4. The company should create two separated Amazon Simple Storage Service buckets one for data storage for free tier users and another for data storage for premium tier users. An Amazon Simple Workflow Service activity worker will query all objects for a given user based on the bucket the data is stored
Your company has been contracted to develop and operate a website that tracks NBA basketball statistics. Statistical data to derive reports like “best game-winning shots from the regular season” and more frequently built reports like “top shots of the game” need to be stored durably for repeated lookup. Leveraging social media techniques, NBA fans submit and vote on new report types from the existing data set so the system needs to accommodate variability in data queries and new static reports must be generated and posted daily. Initial research in the design phase indicates that there will be over 3 million report queries on game day by end users and other applications that use this application as a data source. It is expected that this system will gain in popularity over time and reach peaks of 10-15 million report queries of the system on game days. Select the answer that will allow your application to best meet these requirements while minimizing costs. [PROFESSIONAL]
1. Launch a multi-AZ MySQL Amazon Relational Database Service (RDS) Read Replica connected to your multi AZ master database and generate reports by querying the Read Replica. Perform a daily table cleanup.
2. Implement a multi-AZ MySQL RDS deployment and have the application generate reports from Amazon ElastiCache for in-memory performance results. Utilize the default expire parameter for items in the cache.
3. Generate reports from a multi-AZ MySQL Amazon RDS deployment and have an offline task put reports in Amazon Simple Storage Service (S3) and use CloudFront to cache the content. Use a TTL to expire objects daily. (Offline task with S3 storage and CloudFront cache)
4. Query a multi-AZ MySQL RDS instance and store the results in a DynamoDB table. Generate reports from the DynamoDB table. Remove stale tables daily.

References

58 thoughts on “AWS Storage Options – Whitepaper – Certification”

Wade Watts says:

July 28, 2016 at 8:26 pm

Number 8 is wrong the answer should be “D”
1. jayendrapatil says:
  
  July 28, 2016 at 9:02 pm
  
  Any reasoning ?
  Would still go with B, as it provides S3 storage for read only instead of the NFS servers providing scalability and elasticity
  Also, the data is backed up in off site tapes where Glacier can easily replace the functionality.
  1. Hemachandran says:
    
    September 6, 2016 at 2:37 pm
    
    In D they said Multicast. Multicast is not directly supported in public cloud.
    1. Kevin Wong says:
      
      March 27, 2018 at 7:43 pm
      
      Hema, do you means Q9? and do you means A is the answer if AWS public cloud not support Multi-case? 🙂
  2. Vijay says:
    
    August 9, 2018 at 10:34 pm
    
    In the BrainCert Exam – They are conceding option D. Because as the updates are made on the single file (temp). S3 is not a storage and not a efficient solution to do updates to a single file. can you confirm?
    
    EBS with Provisioned IOPS (PIOPS) to store I/O files SQS to distribute elaboration commands to a group of hosts working in parallel Auto Scaling to dynamically size the group to hosts depending on the length of the SQS queue.
    1. Arunkumar says:
      
      September 21, 2018 at 7:18 pm
      
      Did you clear the exams ?
Dung says:

August 5, 2016 at 9:55 pm

Question number 3 the answer should be B. You can host static website on S3 with Javascript SDK and that java script can write data to DynamoDB.

Option D. is technically right but it’s not a “cost minimum” option since “a set of servers get the vote from SQS and write to DynamoDB” is a waste.
1. jayendrapatil says:
  
  August 6, 2016 at 3:01 pm
  
  #B is not scalable as it can’t handle the rapid influx of traffic and is making a direct call to dynamodb. I would always go with #D with the queuing mechanism and scaling as per the queue size which will help me control the cost.
  1. Dung says:
    
    August 6, 2016 at 3:49 pm
    
    Option D. Lack of info about instance type and autoscaling configuration for app layer after the SQS.
    
    S3 and DynamoDB designed to be able to auto scale based on user usage. Java script clientside will populate data directly to/from DynamoDB after loaded on client.
    1. jayendrapatil says:
      
      August 6, 2016 at 7:53 pm
      
      The issue here is will it able to handle so many concurrent requests simultaneous without losing any data, which i doubt with a javascript app with DynamoDb would be able to handle. Would prefer to put the messages in SQS ensuring One time delivery so that there is no loss. There are never so specific details in the options unless thats what the target of the question is.
Sherief Melik says:

October 20, 2016 at 1:20 am

Question (2)
how this answer gives high availability and scalability?
1. jayendrapatil says:
  
  October 20, 2016 at 6:28 am
  
  DynamoDB provides high availability as it synchronously replicates data across three facilities within an AWS Region and scalability as it is designed to scale its provisioned throughput up or down while still remaining available
  
  More @ https://aws.amazon.com/dynamodb/faqs/#scale_anchor
  1. Sherief Melik says:
    
    October 20, 2016 at 6:30 am
    
    thanks for detailed answer 🙂
Uma C says:

November 18, 2016 at 11:57 pm

For Question 5: b, c, e is not the answer? as it is already running on extra large DB?

**Adding notification for the response to this question
1. jayendrapatil says:
  
  November 19, 2016 at 8:02 am
  
  From my perspective,
  A is about scaling out as the threshold is too low it is wasteful.
  B to cache the DB queries and improve read only performance
  C Cloudfront to cache static content and can be used to cache dynamic content as well. With Edge locations, it should take care of latency worldwide
  D RDS to extra Memory improving performance – Scale up
  E would require the complete replication of site in other region but latency based routing would not guarantee performance worldwide from all regions.
  1. Saul says:
    
    August 16, 2018 at 11:11 pm
    
    I will not agree. Independent if latency will guarantee performance or not, it will be lowing the response time for the closest clients. Doesn’t matter if the region is closest the first one or in another continent like us-east-1, someone close of this new environment will performe better experience than before
Uma C says:

November 20, 2016 at 6:20 am

Thank you for the response.
RajMan says:

March 6, 2017 at 10:46 pm

Question 9: I think Answer is A and not B. B states to backup RDS to glacier directly. I don’t think that is possible. Please clarify.
1. jayendrapatil says:
  
  March 8, 2017 at 9:33 pm
  
  Thanks Raj, corrected the answer.
RajMan says:

March 6, 2017 at 10:49 pm

Further, here is the link which states about being unable to backup RDS snapshot to glacier. http://superuser.com/questions/555929/moving-ebs-snapshots-to-glacier
Sam T says:

March 9, 2017 at 2:25 am

Q5 c. “Mobile application will send device location using SQS” – does not sound right.
A – webservices endpoint would be better?

Thanks
1. jayendrapatil says:
  
  March 9, 2017 at 1:07 pm
  
  Key point here is push notifications. AWS Mobile Push is supported through SNS to push the offers.
  Also, its an Mobile application which can send the location to either SQS or web service endpoint. SQS is just scalable and async to support 5M customers.
Sam T says:

March 9, 2017 at 9:45 pm

Jay – I don’t know how a mobile app will send/interact with SQS – likely thru a webservice endpoint- that is my point. SQS likely can stand behind the webservice end point.
Thx
1. jayendrapatil says:
  
  March 9, 2017 at 9:49 pm
  
  Mobile application with javascript sdk can easily push to SQS directly. Or an application with any other sdk with a role can push to SQS.
  1. Sam T says:
    
    March 10, 2017 at 8:37 am
    
    Jay – OK, yes that would work.
    Also C (unlike A) uses mobile push notifications – so is the better answer anyway.
    
    Thanks
Atit says:

March 20, 2017 at 6:55 pm

Hello, I think the answer to Q11. should be a,b & c.
1. jayendrapatil says:
  
  March 20, 2017 at 7:02 pm
  
  Multi AZ is not a scaling option, but only an High Availability option.
VN says:

May 11, 2017 at 5:54 am

For Question 16, isn’t READ REPLICA required along with Cloud Front, since it has to serve millions of fans.
1. jayendrapatil says:
  
  May 11, 2017 at 1:40 pm
  
  You can generate the reports once and store in S3. Once generate, only the report needs to be served, which can be done using CloudFront with caching and origin as S3.
KT says:

July 25, 2017 at 4:23 am

Hey jayendra
Answer for Q3 should be D, as the size preference data is 50k it will be very expensive to store data in dynamoDB. As per the docs “A unit of write capacity represents one write per second for items as large as 1 KB.If your items are larger than 1 KB in size, you will need to round the item size up to the next 1 KB boundary. For example, if your items are 1.5 KB and you want to do 10 writes per second, then you would need to provision 10 (writes per second) × 2 (1.5 KB rounded up to the next whole number) = 20 write capacity units.”

it will be cheaper to store data on S3.
Let me know what do you think?
1. jayendrapatil says:
  
  July 25, 2017 at 10:19 am
  
  Couple of things do not work in Option D’s favour.
  1. Access control, the Mobile Application needs to utilize STS, Web Identity Federation to access even DynamoDB. Which it mentions directly.
  2. Control of Authentication using S3 ACLs
  3. For the Cost, if you end up storing objects in S3 and data in DynamoDB. Remember you reduce DynamoDB cost, but pay for S3 storage, PUTs and GETs as well. So you are almost paying twice for same request. Also, with S3, it would not make sense if the data is changing and not just read only.
VSR says:

August 30, 2017 at 6:03 pm

Hi Jayendra,

for Q.6 –
Why isn’t this an answer ?
– Set up a second installation in another region, and use the Amazon Route 53 latency-based routing feature to select the right region.
1. jayendrapatil says:
  
  August 30, 2017 at 7:46 pm
  
  For E, even if you setup two regions the traffic for this two regions would improve and it would improve slightly for worldwide audience depending upon the region, which is not specified in the question. This solution also increases cost as you would need to have a identical setup.
  CloudFront can help in handling static and dynamic caching of resources, which would improve the times as well as the load on RDS is heaving so caching can improve the performance.
  1. Arthanari says:
    
    March 1, 2018 at 8:20 am
    
    Can you brief on why D is a suitable answer for question 6.
    1. jayendrapatil says:
      
      March 6, 2018 at 12:47 pm
      
      Its more of scale out or scale up. Other 2 options do not really have any benefit and would be costly.
UDP says:

March 2, 2018 at 11:16 pm

Jayendra,
For Q4, why is below answer not correct?

( B) Use CloudFront and the static website hosting feature of S3 with the Javascript SDK to call the Login With Amazon service to authenticate the user, use IAM Roles to gain permissions to a DynamoDB table to store the users vote.
1. jayendrapatil says:
  
  March 6, 2018 at 11:30 am
  
  The cost. If you have direct insertion into DynamoDB, the provisioned throughput required will be very high and hence the cost. SQS can help reduce the cost while providing loose coupling.
  1. Lakhan says:
    
    April 14, 2018 at 5:29 pm
    
    But in the Option D it is launching additional EC2 instances to do the ingestion job. Isn’t that adding to the cost?
    1. jayendrapatil says:
      
      April 16, 2018 at 7:29 pm
      
      that is scaled as per the load, so the costs are still minimal, which cannot be done with DynamoDB you pay for the provisioned throughput all the way and it can be quiet expensive.
Tanya Atanasova says:

March 13, 2018 at 9:11 pm

I think the answer to the question 15 is not a) but rather c). SWF will do this but there is no explanation of how total space taken by user will be recorded and aggregated. Your answer just simple states that email will be dispatched.

C) explains how the total file space will be kept up to date.
1. jayendrapatil says:
  
  March 14, 2018 at 10:44 pm
  
  Iterating over S3 is not a good option, as it does not return all the records and needs to be paginated. You need to inline increment the counter and define thresholds to trigger emails.
  1. Tanya Atanasova says:
    
    March 14, 2018 at 11:55 pm
    
    But a) explains none of that. S3 does return all the records with pagination. I politely disagree.
Mohd Akram says:

April 1, 2018 at 2:41 pm

Hello Sir,
In the last of some questions, you wrote professional, I just want to know that, these questions only for professional level or also for executive level?
1. jayendrapatil says:
  
  April 1, 2018 at 7:53 pm
  
  they are mainly professional level, with longer prose and long answer options. They are good for preparation, however if they seem too tough you can ignore.
  1. Mohd Akram says:
    
    April 1, 2018 at 9:20 pm
    
    Thank you so much
Rashmi Vijay says:

April 8, 2018 at 11:04 pm

Hi Jayendra, for Qtn 5 why dont we consider answer D

A large real-estate brokerage is exploring the option to adding a cost-effective location-based alert to their existing mobile application. The application backend infrastructure currently runs on AWS. Users who opt in to this service will receive alerts on their mobile device regarding real-estate offers in proximity to their location. For the alerts to be relevant delivery time needs to be in the low minute count. The existing mobile app has 5 million users across the US. Which one of the following architectural suggestions would you make to the customer? [PROFESSIONAL]
1. jayendrapatil says:
  
  April 11, 2018 at 8:30 pm
  
  You can send location using Mobile push, but use it to push to Mobile. Hence D is not an valid option.
Hussein says:

July 9, 2018 at 2:28 pm

Hi Jayendra, why do you think it’s not A?
1. jayendrapatil says:
  
  July 10, 2018 at 1:43 pm
  
  With A this is a cost involved with ELB and scaling and makes it a request/response model. SQS helps buffer the requests. Also AWS mobile push would be a better cost effective options then the third party provides.
  If available, always prefer an AWS service over the external services.
Hussein says:

July 9, 2018 at 2:29 pm

sorry .. this is for Qtn 5
Vijay says:

August 9, 2018 at 9:22 am

Q8) EBS would be preferred over S3 as the updated are made to single temp file. S3 is not a storage and not a efficient solution to do updates to a single file. So Option should be D ?
1. jayendrapatil says:
  
  August 14, 2018 at 5:59 pm
  
  The question is actually poorly framed or copied. S3 would not be suitable choice, However, the point is with EBS volumes the single file cannot be shared. It would be different file.
Ash_win says:

October 11, 2018 at 1:17 am

Hi – Quick thing, the reference to whitepaper link is broken. Could you please fix it.
1. jayendrapatil says:
  
  October 15, 2018 at 6:47 pm
  
  Thanks, changed the linked to active reference now.
Pasle Choix says:

December 20, 2019 at 8:11 am

Hi Jayendra:

For Q6:

(a) Lower the scale up trigger of your Auto Scaling group to 30% so it scales more aggressively.

Wouldn’t this adds more computing power to improve the page load?

Thank you.
1. jayendrapatil says:
  
  February 9, 2020 at 6:28 am
  
  The DB is short of resources and need to be improved. Instead of increasing EC2 capacity, it would be better to use CloudFront for worldwide users.
parzian parzian.com says:

July 3, 2020 at 4:36 am

Hey there ,How should I copy a wordpress theme that someone is using on their wordpress blog? gracias
1. jayendrapatil says:
  
  July 3, 2020 at 8:16 am
  
  you need to check the theme, and use that theme.