

# Resources
<a name="resources-rel"></a>

## Change management
<a name="resources-rel-chg-mgmt"></a>

 **Hands-on labs:** 
+  [One Observability Workshop](https://catalog.workshops.aws/observability/en-US) 
+  [Viewing Amazon CloudWatch metrics with Amazon Managed Service for Prometheus and Amazon Managed Grafana](https://aws.amazon.com/blogs/mt/viewing-amazon-cloudwatch-metrics-with-amazon-managed-service-for-prometheus-and-amazon-managed-grafana/) 

 **Reference architecture:** 
+  [Guidance for Deep Application Observability on AWS](https://d1.awsstatic.com/solutions/guidance/architecture-diagrams/deep-application-observability-on-AWS.pdf) 

 **Videos:** 
+  [AWS Summit SF 2022 - Full-stack observability and application monitoring with AWS](https://www.youtube.com/watch?v=or7uFFyHIX0) 
+  [AWS Cloud Operations - How to](https://www.youtube.com/playlist?list=PLhr1KZpdzukdbisTs-Eskg4xsfLFOki1T) 
+  [Publishing custom metric](https://docs.aws.amazon.com/AmazonCloudWatch/latest/monitoring/publishingMetrics.html) 
+  [Comprehensive observability for Amazon EKS](https://aws.amazon.com/blogs/mt/announcing-aws-observability-accelerator-to-configure-comprehensive-observability-for-amazon-eks/) 

 **Monitoring tools:** 
+  [AWS Observability Services](https://docs.aws.amazon.com/wellarchitected/latest/management-and-governance-guide/aws-observability-tools.html) 
+  [AWS observability repositories](https://github.com/aws-observability) 

 **Workload load and stress testing tools:** 
+  [Taurus](https://gettaurus.org/) 
+  [Apache JMeter](https://jmeter.apache.org/) 

 **Blogs:** 
+  [Ensure Optimal Application Performance with Distributed Load Testing on AWS](https://aws.amazon.com/blogs/architecture/ensure-optimal-application-performance-with-distributed-load-testing-on-aws/) 

 **Guides:** 
+  [Amazon EC2 Auto Scaling now gives recommendations about activating predictive scaling policy](https://aws.amazon.com/about-aws/whats-new/2023/01/amazon-ec2-auto-scaling-activating-predictive-scaling-policy/) 
+  [Step and simple scaling policies for Amazon EC2 Auto Scaling](https://docs.aws.amazon.com/autoscaling/ec2/userguide/as-scaling-simple-step.html) 
+  [Target tracking scaling policies for Amazon EC2 Auto Scaling](https://docs.aws.amazon.com/autoscaling/ec2/userguide/as-scaling-target-tracking.html) 
+  [Scheduled scaling for Amazon EC2 Auto Scaling](https://docs.aws.amazon.com/autoscaling/ec2/userguide/ec2-auto-scaling-scheduled-scaling.html) 
+  [AWS services that you can use with Application Auto Scaling](https://docs.aws.amazon.com/autoscaling/application/userguide/integrated-services-list.html) 
+  [Using AWS Config managed rules with Audit Manager](https://docs.aws.amazon.com/audit-manager/latest/userguide/control-data-sources-config.html#aws-config-managed-rules) 
+  [Using AWS Config custom rules with Audit Manager](https://docs.aws.amazon.com/audit-manager/latest/userguide/control-data-sources-config.html#aws-config-custom-rules) 
+  [Troubleshooting AWS Config integration with Audit Manager](https://docs.aws.amazon.com/audit-manager/latest/userguide/control-data-sources-config.html#aws-config-rules-troubleshoot) 
+  [Distributed Load Testing on AWS](https://aws.amazon.com/solutions/implementations/distributed-load-testing-on-aws/) 

 **Amazon Builders' Library:** 
+  [Ensuring rollback safety during deployments](https://aws.amazon.com/builders-library/ensuring-rollback-safety-during-deployments/) 

## Failure management
<a name="resources-rel-failure-mgmt"></a>

 **Reference architecture:** 
+  [Data Protection Reference Architecture with AWS Backup](https://d1.awsstatic.com/architecture-diagrams/ArchitectureDiagrams/data-protection-with-aws-backup-ra.pdf?stod_bck4) 

 **Blogs:** 
+  [Best practices for data lake protection with AWS Backup](https://aws.amazon.com/blogs/storage/best-practices-for-data-lake-protection-with-aws-backup/) 

 **Guides:** 
+  [Automating backups with Backup plan](https://docs.aws.amazon.com/aws-backup/latest/devguide/creating-a-backup-plan.html) 
+  [Point-in-time recovery for DynamoDB](https://docs.aws.amazon.com/amazondynamodb/latest/developerguide/PointInTimeRecovery.html) 
+  [Feature availability by resource](https://docs.aws.amazon.com/aws-backup/latest/devguide/whatisbackup.html#features-by-resource) 
+  [Feature availability by AWS Region](https://docs.aws.amazon.com/aws-backup/latest/devguide/whatisbackup.html#features-by-region) 
+  [Security best practices in IAM](https://docs.aws.amazon.com/IAM/latest/UserGuide/best-practices.html) 
+  [Legal hold](https://docs.aws.amazon.com/aws-backup/latest/devguide/legalhold.html) 
+  [AWS Backup Vault Lock](https://docs.aws.amazon.com/aws-backup/latest/devguide/vault-lock.html) 
+  [Audit backups and create reports with AWS Backup Audit Manager](https://docs.aws.amazon.com/aws-backup/latest/devguide/aws-backup-audit-manager.html) 
+  [What Is AWS Backup?](https://docs.aws.amazon.com/aws-backup/latest/devguide/whatisbackup.html)
+  [Data protection in AWS Backup](https://docs.aws.amazon.com/aws-backup/latest/devguide/data-protection.html) 
+  [Encryption for backups in AWS Backup](https://docs.aws.amazon.com/aws-backup/latest/devguide/encryption.html) 
+ [Reliability Pillar: AWS Well-Architected](https://docs.aws.amazon.com/wellarchitected/latest/reliability-pillar/welcome.html)



 **Infrastructure map:** 
+  [AWS Global Infrastructure](https://aws.amazon.com/about-aws/global-infrastructure/) 

 **Videos:** 
+  [AWS Summit ANZ 2021 - Everything fails, all the time: Designing for resilience](https://www.youtube.com/watch?v=wUzSeSfu1XA) 

 **Whitepapers:** 
+  [AWS Fault Isolation Boundaries](https://docs.aws.amazon.com/whitepapers/latest/aws-fault-isolation-boundaries/abstract-and-introduction.html) 
+  [Choose the right Amazon RDS deployment option: Single-AZ instance, Multi-AZ instance, or Multi-AZ database cluster](https://aws.amazon.com/blogs/database/choose-the-right-amazon-rds-deployment-option-single-az-instance-multi-az-instance-or-multi-az-database-cluster/) 
+  [AWS Auto Scaling: How Scaling Plans Work](https://docs.aws.amazon.com/autoscaling/plans/userguide/how-it-works.html) 
+  [Implementing Microservices on AWS](https://docs.aws.amazon.com/whitepapers/latest/microservices-on-aws/introduction.html) 
+  [Disaster Recovery of Workloads on AWS: Recovery in the Cloud](https://docs.aws.amazon.com/whitepapers/latest/disaster-recovery-workloads-on-aws/disaster-recovery-workloads-on-aws.html) 

 **Workshops:** 
+  [AWS Well-Architected Reliability Labs](https://wellarchitectedlabs.com/Reliability/) 
+  [Advanced Multi-AZ Resilience Patterns](https://catalog.workshops.aws/multi-az-gray-failures/en-US/workshop-overview) 
+  [Level 200: Testing Backup and Restore of Data](https://wellarchitectedlabs.com/reliability/200_labs/200_testing_backup_and_restore_of_data/) 

 **AWS Builders' Library:** 
+  [The Amazon Builders' Library: How Amazon builds and operates software](https://aws.amazon.com/builders-library/) 

 **Books:** 
+  Robert S. Hammer, *[Patterns for Fault Tolerant Software](https://www.amazon.com/Patterns-Fault-Tolerant-Software-Wiley-ebook/dp/B00DXK33SK/)* 
+  Andrew Tanenbaum and Marten van Steen, *[Distributed Systems: Principles and Paradigms](https://www.amazon.com/Distributed-Systems-Principles-Paradigms-2nd/dp/0132392275/)* 