AWS RDS Replication – Multi-AZ vs Read Replica

RDS Multi-AZ vs Read Replica

RDS DB instances replicas can be created in two ways Multi-AZ & Read Replica, which provide high availability, durability, and scalability to RDS.

RDS Multi-AZ vs Read Replica

Purpose

  • Multi-AZ DB Instance deployments provide high availability, durability, and automatic failover support.
  • Read replicas enable increased scalability and database availability in the case of an AZ failure. Read Replicas allow elastic scaling beyond the capacity constraints of a single DB instance for read-heavy database workloads

RDS Read Replicas vs Multi-AZ

Region & Availability Zones

  • RDS Multi-AZ deployment automatically provisions and manages a standby instance in a different AZ (independent infrastructure in a physically separate location) within the same AWS region.
  • RDS Read Replicas can be provisioned within the same AZ, Cross-AZ or even as a Cross-Region replica.

Replication Mode

  • RDS Multi-AZ deployment manages a synchronous standby instance in a different AZ
  • RDS Read Replicas has the data replicated asynchronously from the Primary instance to the read replicas

Standby Instance can Accept Reads

  • Multi-AZ DB instance deployment is a high-availability solution and the standby instance does not support requests.
  • Read Replica deployment provides readable instances to increase application read-throughput.

Automatic Failover & Failover Time

  • Multi-AZ DB instance deployment performs an automatic failover to the standby instance without administrative intervention, and the failover time can be up to 120 seconds based on the crash recovery.
    • Planned database maintenance
    • Software patching
    • Rebooting the Primary instance with failover
    • Primary DB instance connectivity or host failure, or an
    • Availability Zone failure
  • RDS maintains the same endpoint for the DB Instance after a failover, so the application can resume database operation without the need for manual administrative intervention.
  • Read Replica deployment does not provide automatic failover. Read Replica instance needs to be manually promoted to a Standalone instance.

Upgrades

  • For a Multi-AZ deployment, Database engine version upgrades happen on the Primary instance.
  • For Read Replicas, the Database engine version upgrade is independent of the Primary instance.

Automated Backups

  • Multi-AZ deployment has the Automated Backups taken from the Standby instance
  • Read Replicas do not have any backups configured, by default.

AWS Certification Exam Practice Questions

  • Questions are collected from Internet and the answers are marked as per my knowledge and understanding (which might differ with yours).
  • AWS services are updated everyday and both the answers and questions might be outdated soon, so research accordingly.
  • AWS exam questions are not updated to keep up the pace with AWS updates, so even if the underlying feature has changed the question might not be updated
  • Open to further feedback, discussion and correction.
  1. You are running a successful multi-tier web application on AWS and your marketing department has asked you to add a reporting tier to the application. The reporting tier will aggregate and publish status reports every 30 minutes from user-generated information that is being stored in your web applications database. You are currently running a Multi-AZ RDS MySQL instance for the database tier. You also have implemented ElastiCache as a database caching layer between the application tier and database tier. Please select the answer that will allow you to successfully implement the reporting tier with as little impact as possible to your database.
    1. Continually send transaction logs from your master database to an S3 bucket and generate the reports of the S3 bucket using S3 byte range requests.
    2. Generate the reports by querying the synchronously replicated standby RDS MySQL instance maintained through Multi-AZ (Standby instance cannot be used as a scaling solution)
    3. Launch an RDS Read Replica connected to your Multi-AZ master database and generate reports by querying the Read Replica.
    4. Generate the reports by querying the ElastiCache database caching tier. (ElasticCache does not maintain full data and is simply a caching solution)
  2. A company is deploying a new two-tier web application in AWS. The company has limited staff and requires high availability, and the application requires complex queries and table joins. Which configuration provides the solution for the company’s requirements?
    1. MySQL Installed on two Amazon EC2 Instances in a single Availability Zone (does not provide High Availability out of the box)
    2. Amazon RDS for MySQL with Multi-AZ
    3. Amazon ElastiCache (Just a caching solution)
    4. Amazon DynamoDB (Not suitable for complex queries and joins)
  3. Your company is getting ready to do a major public announcement of a social media site on AWS. The website is running on EC2 instances deployed across multiple Availability Zones with a Multi-AZ RDS MySQL Extra Large DB Instance. The site performs a high number of small reads and writes per second and relies on an eventual consistency model. After comprehensive tests you discover that there is read contention on RDS MySQL. Which are the best approaches to meet these requirements? (Choose 2 answers)
    1. Deploy ElastiCache in-memory cache running in each availability zone
    2. Implement sharding to distribute load to multiple RDS MySQL instances (this is only a read contention, the writes work fine)
    3. Increase the RDS MySQL Instance size and Implement provisioned IOPS (not scalable, this is only a read contention, the writes work fine)
    4. Add an RDS MySQL read replica in each availability zone
  4. Your company has HQ in Tokyo and branch offices all over the world and is using logistics software with a multi-regional deployment on AWS in Japan, Europe and US. The logistic software has a 3-tier architecture and currently uses MySQL 5.6 for data persistence. Each region has deployed its own database. In the HQ region you run an hourly batch process reading data from every region to compute cross-regional reports that are sent by email to all offices this batch process must be completed as fast as possible to quickly optimize logistics. How do you build the database architecture in order to meet the requirements?
    1. For each regional deployment, use RDS MySQL with a master in the region and a read replica in the HQ region
    2. For each regional deployment, use MySQL on EC2 with a master in the region and send hourly EBS snapshots to the HQ region
    3. For each regional deployment, use RDS MySQL with a master in the region and send hourly RDS snapshots to the HQ region
    4. For each regional deployment, use MySQL on EC2 with a master in the region and use S3 to copy data files hourly to the HQ region
    5. Use Direct Connect to connect all regional MySQL deployments to the HQ region and reduce network latency for the batch process
  5. What would happen to an RDS (Relational Database Service) Multi-Availability Zone deployment if the primary DB instance fails?
    1. IP of the primary DB Instance is switched to the standby DB Instance.
    2. A new DB instance is created in the standby availability zone.
    3. The canonical name record (CNAME) is changed from primary to standby.
    4. The RDS (Relational Database Service) DB instance reboots.
  6. Your business is building a new application that will store its entire customer database on a RDS MySQL database, and will have various applications and users that will query that data for different purposes. Large analytics jobs on the database are likely to cause other applications to not be able to get the query results they need to, before time out. Also, as your data grows, these analytics jobs will start to take more time, increasing the negative effect on the other applications. How do you solve the contention issues between these different workloads on the same data?
    1. Enable Multi-AZ mode on the RDS instance
    2. Use ElastiCache to offload the analytics job data
    3. Create RDS Read-Replicas for the analytics work
    4. Run the RDS instance on the largest size possible
  7. Will my standby RDS instance be in the same Availability Zone as my primary?
    1. Only for Oracle RDS types
    2. Yes
    3. Only if configured at launch
    4. No
  8. Is creating a Read Replica of another Read Replica supported?
    1. Only in certain regions
    2. Only with MySQL based RDS
    3. Only for Oracle RDS types
    4. No
  9. A user is planning to set up the Multi-AZ feature of RDS. Which of the below mentioned conditions won’t take advantage of the Multi-AZ feature?
    1. Availability zone outage
    2. A manual failover of the DB instance using Reboot with failover option
    3. Region outage
    4. When the user changes the DB instance’s server type
  10. When you run a DB Instance as a Multi-AZ deployment, the “_____” serves database writes and reads
    1. secondary
    2. backup
    3. stand by
    4. primary
  11. When running my DB Instance as a Multi-AZ deployment, can I use the standby for read or write operations?
    1. Yes
    2. Only with MSSQL based RDS
    3. Only for Oracle RDS instances
    4. No
  12. Read Replicas require a transactional storage engine and are only supported for the _________ storage engine
    1. OracleISAM
    2. MSSQLDB
    3. InnoDB
    4. MyISAM
  13. A user is configuring the Multi-AZ feature of an RDS DB. The user came to know that this RDS DB does not use the AWS technology, but uses server mirroring to achieve replication. Which DB is the user using right now?
    1. MySQL
    2. Oracle
    3. MS SQL
    4. PostgreSQL
  14. If I have multiple Read Replicas for my master DB Instance and I promote one of them, what happens to the rest of the Read Replicas?
    1. The remaining Read Replicas will still replicate from the older master DB Instance
    2. The remaining Read Replicas will be deleted
    3. The remaining Read Replicas will be combined to one read replica
  15. If you have chosen Multi-AZ deployment, in the event of a planned or unplanned outage of your primary DB Instance, Amazon RDS automatically switches to the standby replica. The automatic failover mechanism simply changes the ______ record of the main DB Instance to point to the standby DB Instance.
    1. DNAME
    2. CNAME
    3. TXT
    4. MX
  16. When automatic failover occurs, Amazon RDS will emit a DB Instance event to inform you that automatic failover occurred. You can use the _____ to return information about events related to your DB Instance
    1. FetchFailure
    2. DescriveFailure
    3. DescribeEvents
    4. FetchEvents
  17. The new DB Instance that is created when you promote a Read Replica retains the backup window period.
    1. TRUE
    2. FALSE
  18. Will I be alerted when automatic failover occurs?
    1. Only if SNS configured
    2. No
    3. Yes
    4. Only if Cloudwatch configured
  19. Can I initiate a “forced failover” for my MySQL Multi-AZ DB Instance deployment?
    1. Only in certain regions
    2. Only in VPC
    3. Yes
    4. No
  20. A user is accessing RDS from an application. The user has enabled the Multi-AZ feature with the MS SQL RDS DB. During a planned outage how will AWS ensure that a switch from DB to a standby replica will not affect access to the application?
    1. RDS will have an internal IP which will redirect all requests to the new DB
    2. RDS uses DNS to switch over to standby replica for seamless transition
    3. The switch over changes Hardware so RDS does not need to worry about access
    4. RDS will have both the DBs running independently and the user has to manually switch over
  21. Which of the following is part of the failover process for a Multi-AZ Amazon Relational Database Service (RDS) instance?
    1. The failed RDS DB instance reboots.
    2. The IP of the primary DB instance is switched to the standby DB instance.
    3. The DNS record for the RDS endpoint is changed from primary to standby.
    4. A new DB instance is created in the standby availability zone.
  22. Which of these is not a reason a Multi-AZ RDS instance will failover?
    1. An Availability Zone outage
    2. A manual failover of the DB instance was initiated using Reboot with failover
    3. To autoscale to a higher instance class (Refer link)
    4. Master database corruption occurs
    5. The primary DB instance fails
  23. You need to scale an RDS deployment. You are operating at 10% writes and 90% reads, based on your logging. How best can you scale this in a simple way?
    1. Create a second master RDS instance and peer the RDS groups.
    2. Cache all the database responses on the read side with CloudFront.
    3. Create read replicas for RDS since the load is mostly reads.
    4. Create a Multi-AZ RDS installs and route read traffic to standby.
  24. How does Amazon RDS multi Availability Zone model work?
    1. A second, standby database is deployed and maintained in a different availability zone from master, using synchronous replication. (Refer link)
    2. A second, standby database is deployed and maintained in a different availability zone from master using asynchronous replication.
    3. A second, standby database is deployed and maintained in a different region from master using asynchronous replication.
    4. A second, standby database is deployed and maintained in a different region from master using synchronous replication.
  25. A customer is running an application in US-West (Northern California) region and wants to setup disaster recovery failover to the Asian Pacific (Singapore) region. The customer is interested in achieving a low Recovery Point Objective (RPO) for an Amazon RDS multi-AZ MySQL database instance. Which approach is best suited to this need?
    1. Synchronous replication
    2. Asynchronous replication
    3. Route53 health checks
    4. Copying of RDS incremental snapshots
  26. A user is using a small MySQL RDS DB. The user is experiencing high latency due to the Multi AZ feature. Which of the below mentioned options may not help the user in this situation?
    1. Schedule the automated back up in non-working hours
    2. Use a large or higher size instance
    3. Use PIOPS
    4. Take a snapshot from standby Replica
  27. Are Reserved Instances available for Multi-AZ Deployments?
    1. Only for Cluster Compute instances
    2. Yes for all instance types
    3. Only for M3 instance types
  28. My Read Replica appears “stuck” after a Multi-AZ failover and is unable to obtain or apply updates from the source DB Instance. What do I do?
    1. You will need to delete the Read Replica and create a new one to replace it.
    2. You will need to disassociate the DB Engine and re-associate it.
    3. The instance should be deployed to Single AZ and then moved to Multi-AZ once again
    4. You will need to delete the DB Instance and create a new one to replace it.
  29. What is the charge for the data transfer incurred in replicating data between your primary and standby?
    1. No charge. It is free.
    2. Double the standard data transfer charge
    3. Same as the standard data transfer charge
    4. Half of the standard data transfer charge
  30. A user has enabled the Multi-AZ feature with the MS SQL RDS database server. Which of the below mentioned statements will help the user understand the Multi-AZ feature better?
    1. In a Multi-AZ, AWS runs two DBs in parallel and copies the data asynchronously to the replica copy
    2. In a Multi-AZ, AWS runs two DBs in parallel and copies the data synchronously to the replica copy
    3. In a Multi-AZ, AWS runs just one DB but copies the data synchronously to the standby replica
    4. AWS MS SQL does not support the Multi-AZ feature
  31. A company is running a batch analysis every hour on their main transactional DB running on an RDS MySQL instance to populate their central Data Warehouse running on Redshift. During the execution of the batch their transactional applications are very slow. When the batch completes they need to update the top management dashboard with the new data. The dashboard is produced by another system running on-premises that is currently started when a manually sent email notifies that an update is required The on-premises system cannot be modified because is managed by another team. How would you optimize this scenario to solve performance issues and automate the process as much as possible?
    1. Replace RDS with Redshift for the batch analysis and SNS to notify the on-premises system to update the dashboard
    2. Replace RDS with Redshift for the batch analysis and SQS to send a message to the on-premises system to update the dashboard
    3. Create an RDS Read Replica for the batch analysis and SNS to notify me on-premises system to update the dashboard
    4. Create an RDS Read Replica for the batch analysis and SQS to send a message to the on-premises system to update the dashboard.

39 thoughts on “AWS RDS Replication – Multi-AZ vs Read Replica

  1. Question (2)
    i am not sure about the answer cause Multi-AZ only without specifying that it will be a read replica will not help, i thought (c. Amazon ElastiCache) would be better answer , what do you think ?

    1. Key point in the questions is
      1. limited staff and requires high availability,
      2. application requires complex queries and table joins

      #A is ruled out cause of 1, as it would need manual failover
      #C is rules out cause its not a DB but a caching solution
      #D is rules out cause of 2
      #B is the correct option as Multi AZ provides complex queries and table joins with ability to failover without manual intervention

  2. Excellent article with real-life scenarios and explainations right to the point. It sums up more than a whole semester in our colleges these days (Germany). Thank you for this high-quality information, I will use it in the next few weeks when I’ll be building a data-warehouse & a report-section for one of my bigger clients. It was a pleasure to read – thank you very much!

    1. Elastic cache is for mere caching of the query results for improving performance by not hitting the database.
      Whereas the Analytics job would require actual data to perform the analysis on but mostly in read only mode

  3. Postgres now supporting cross region read replicas, as of June 2016. But … when will exams reflect this?

    1. Thanks for update, will update the answer. However, the exam is slow to reflect any updates.

  4. FROM : https://aws.amazon.com/rds/faqs/

    Q: Will I be alerted when automatic failover occurs?

    Yes, Amazon RDS will emit a DB Instance event to inform you that automatic failover occurred. You can use the DescribeEvents to return information about events related to your DB Instance, or click the “DB Events” section of the AWS Management Console.

    1. Automatic failover alerts are possible, however need to setup SNS, SMS or email for notification

  5. For question (3), I believe that answers are A and C. How does read replica helps for Read and Writes? Thx.

  6. Hey..

    I have a postgres in an ec2 on AWS . Now we are planing to move on RDS . & we do not want any downtime . We were planning on postgres replication between the post gres on ec2 & RDS postgres . Is this possible ? If yes can you give me pointers. ?

    1. I would prefer to create a database backup and import it directly to the RDS database. Is the database a production database ? Whats the size ? as the operation can be performed with minimal downtime given both the resources are within AWS.

      1. Yes its a prod database . The size is around 500gb . for that i will have to take incremental backups..which i don’t want to.

        AWS migration tools is there vut the source version must be 9.4 or above we r on 9.3

  7. Can someone explain me Q 25

    RPO diff between asynchronous replication or DB snapshot ?

    We can also minimizes RPO by frequent snapshots as replication also has some lag depend upon various factor like network available etc.

    1. As per my undetstanding,
      With asynchronous replication, the RPO is very very low, it should, in fact RDS should start the asynchronous process almost instantaneously when master transaction is complete.
      Where as with frequent snapshots as you mentioned, you can only reduce the RPO, but in practise asynchronous replication would win over frequent snapshots.

  8. #29 is marked with two (2) answers and I am thinking it should just be A as Amazon doesn’t charge for replication bandwidth…was there a typo or did i miss something?

    Thanks,
    Prince

    1. Thanks Prince, yup its by error. Its A as it the data transfer between primary and standby is free.

  9. Jayendra – A stupendous effort on your part to maintain this blog. Hats off to you.

    #25. For multi-AZ scenario, the replication mode would be Synchronous and asynchronous is typically associated with Read Replicas. The answer should be a and not b.
    Am i missing something here?

    Regards
    Anand

    1. Multi-AZ does not work across Region, so you have to setup up Read Replica across Region, which in this would need asynchronous replication.

      1. Hello Jayendra,

        Q says: ‘The customer is interested in achieving a low Recovery Point Objective (RPO) for an Amazon RDS multi-AZ MySQL database instance.’ Not cross-region. So, shouldn’t the answer be synchronous here?

        If we are to consider 1rst statement which talks about cross-region and ignore multi-az, then wouldn’t read replica (being asynchronous in nature) fall behind because we have different regions. So, in this case ‘snapshots’ might make more sense.

        The wording is a little confusing. Can you please explain your approach to this q?

        Thanks,
        Manisha

        1. Read Replica can be created across region for MySQL, even though it can fall behind but it would still have the lowest RPO. Also, Read Replica can be easily converted to Master/Primary database and be ready to use.
          Snapshots are taken only daily, during the backup window which would need to be copied over. Also, they are supplemented by transaction logs for point in time recovery, which would not work across region.

  10. Hello Jayendra,

    Can you please explain for Q 4, why is A the right answer.

    Here is my thought process. I’m trying to eliminate options.

    B and D gets eliminated because you always to use managed services db over creating your own DB on EBS.

    A – Read Replica are asynchronous by nature. So over time, replicas can lag from the original db and thus you cannot guarantee the accuracy of the reports.
    So A gets eliminated.

    C – If you send snapshots, there is time spent in extracting data from snapshots (which may be minimal). So if I have to compare C and E, I would choose between C and E.

    I think I’m overthinking the problem. Anyway, hope to hear back from you.

    Regards,
    Manisha

    1. The key point here is to reduce the time. Hence A as you can create a Cross Region Read replica in the Tokyo HQ for DB in each region and use them to compute reports as the Read Replica will always almost have the latest data.
      Option B, C & D are wrong as it would take time for data, snapshots to be transferred and restored to perform the batch analysis
      Option E is wrong as there Direct Connect would be expensive and take time to setup.

      1. Thanks. Well I think setup time cost should not be the point here, what is asking here is “this batch process must be completed as fast as possible to quickly optimize logistics.”, so it is not about the initial setup, it is about how to implement the right solution to deliver the “as fast as possible” service AFTER the setup

        I am open to discussion but the answer is absolutely E in my understanding.

        1. Direct Connect is usually to connect your data center to AWS, usually within the same region. In this case, the setup is already available in AWS but across multiple regions, so there is no need for Direct Connect. Also, it does not mention if the instances in other regions are read replicas or active-active.

    1. Thanks Amit, that right. Vertical scaling does result in automatic failover although the scaling is first performed on the standby and once successful the failover happens. Have corrected the question.

  11. Hello jayendrapatil,

    If we are migrating a 4 X DB server to RDS (1 master, 3 salves, each with 100GB storage and there will be 1 TB analytics data), what will be the best architechture to migrate to RDS? Does the below make sense to you?

    1 Multi-AZ DB instance with 100 GB storage. (db.m5.2xlarge)

    1 Single-AZ DB read replica in the same AZ as the master DB (upgrade the storage to 1TB, db.m5.2xlarge)

    2 Single-AZ DB rea replica with 100GB (in different regions)

    Thank you.

    Nick HE

  12. great resource, keep up the good work, you ae to be commended, the subject matter is huge, so any coverage and to this depth too, is an great help

  13. please update your content as read replica is available for SQL SERVER and AMAZON AURORA as well.

  14. Hi sir, pls answer this..

    A company wants to increase the availability and durability of a critical business application. The application currently uses a MySQL database running on anAmazon EC2 instance. The company wants to minimize application changes. How should the company meet these requirements?
    * A. Shut down the EC2 instance. Enable multi-AZ replication within the EC2 instance, then restart the instance.
    * B. Launch a secondary EC2 instance running MySQL. Configure a cron job that backs up the database on the primary EC2 instance and copies it to the secondary instance every 30 minutes.
    * C. Migrate the database to an Amazon RDS Aurora DB instance and create a Read Replica in another Availability Zone.
    * D. Create an Amazon RDS Microsoft SQL DB instance and enable multi-AZ replication. Back up the existing data and import it into the new database.

    1. Option C would work fine with any application changes and increasing the availability and durability.

Comments are closed.