Control: AWS > Well-Architected Tool > AWS Well-Architected Framework > Reliability > REL 05. How do you design interactions in a distributed system to mitigate or withstand failures?
Distributed systems rely on communications networks to interconnect components (such as servers or services). Your workload must operate reliably despite data loss or latency over these networks. Components of the distributed system must operate in a way that does not negatively impact other components or the workload. These best practices enable workloads to withstand stresses or failures, more quickly recover from them, and mitigate the impact of such impairments. The result is improved mean time to recovery (MTTR).
See REL 05. How do you design interactions in a distributed system to mitigate or withstand failures? for more information.
Resource Types
This control targets the following resource types:
Policies
The following policies can be used to configure this control:
This control type relies on these other policies when running actions:
- AWS > Well-Architected Tool > AWS Well-Architected Framework > Reliability > REL 05. How do you design interactions in a distributed system to mitigate or withstand failures? > Implement graceful degradation to transform applicable hard dependencies into soft dependencies
- AWS > Well-Architected Tool > AWS Well-Architected Framework > Reliability > REL 05. How do you design interactions in a distributed system to mitigate or withstand failures? > Throttle requests
- AWS > Well-Architected Tool > AWS Well-Architected Framework > Reliability > REL 05. How do you design interactions in a distributed system to mitigate or withstand failures? > Control and limit retry calls
- AWS > Well-Architected Tool > AWS Well-Architected Framework > Reliability > REL 05. How do you design interactions in a distributed system to mitigate or withstand failures? > Fail fast and limit queues
- AWS > Well-Architected Tool > AWS Well-Architected Framework > Reliability > REL 05. How do you design interactions in a distributed system to mitigate or withstand failures? > Set client timeouts
- AWS > Well-Architected Tool > AWS Well-Architected Framework > Reliability > REL 05. How do you design interactions in a distributed system to mitigate or withstand failures? > Make services stateless where possible
- AWS > Well-Architected Tool > AWS Well-Architected Framework > Reliability > REL 05. How do you design interactions in a distributed system to mitigate or withstand failures? > Implement emergency levers
Category
In Your Workspace
Developers
- tmod:@turbot/aws-wellarchitected-framework#/control/types/rel05
- tmod:@turbot/turbot#/control/categories/cmdb
- turbot graphql controls --filter "controlTypeId:tmod:@turbot/aws-wellarchitected-framework#/control/types/rel05"
Get Controls