REL 1: How do you manage service quotas and constraints?

For cloud-based workload architectures, there are service quotas (which are also referred to as service limits). These quotas exist to prevent accidentally provisioning more resources than you need and to limit request rates on API operations so as to protect services from abuse. There are also resource constraints, for example, the rate that you can push bits down a fiber-optic cable, or the amount of storage on a physical disk.

Resources

AWS Live re:Inforce 2019 - Service Quotas
What Is Service Quotas?
AWS Service Quotas (formerly referred to as service limits)
Amazon EC2 Service Limits
AWS Trusted Advisor Best Practice Checks (see the Service Limits section)
AWS limit monitor on AWS answers
AWS Marketplace: CMDB products that help track limits
APN Partner: partners that can help with configuration management

Best Practices:

Improvement Plan

Aware of service quotas and constraints

  • Identify service quotas across all relevant accounts, Regions, and Availability Zones: The limits are scoped to account and Region.
    What is Service Quotas?
  • Manage service quotas across accounts and regions

  • Identify service quotas across all relevant accounts, Regions, and Availability Zones: The limits are scoped to account and Region.
    What is Service Quotas?
  • Accommodate fixed service quotas and constraints through architecture

  • Be aware of fixed service quotas: Be aware of fixed service quotas and constraints and architect around these.
    AWS Service Quotas
  • Monitor and manage quotas

  • Monitor and manage your quotas: Evaluate your potential usage on AWS, increase your regional service quotas appropriately, and allow planned growth in usage.
  • Automate quota management

  • Set up automated monitoring: Implement tools using SDKs to alert you when thresholds are being approached.
  • Ensure that a sufficient gap exists between the current quotas and the maximum usage to accommodate failover

  • Ensure that there is a sufficient gap between your service quota and your maximum usage to accommodate for a failover