Resilient Banking Systems: Applying SRE, Cloud Automation, and Observability at Scale
DOI:
https://doi.org/10.47392/Eclearnix.2026.B005Abstract
Resilient Banking Systems: Applying SRE, Cloud Automation, and
Observability at Scale
Objectives:
1. Understand the core principles of cloud-native architecture and their role in building scalable
and resilient banking systems.
2. Explore how Site Reliability Engineering (SRE) enhances reliability, availability, and
operational excellence in financial platforms.
3. Learn how cloud automation and Infrastructure as Code (IaC) improve efficiency,
consistency, and disaster recovery.
4. Examine observability practices that enable proactive monitoring, faster incident resolution,
and performance optimization.
5. Analyze integrated strategies combining SRE, automation, and observability to achieve end-
to-end resilience and compliance in banking systems.
Table of Contents
CHAPTER 1 The Fragility of Legacy Banking IT
CHAPTER 2 Defining Resilience in a Regulated World
CHAPTER 3 The Unified Framework: SRE, Cloud, and Observability
CHAPTER 4 SRE Fundamentals for Financial Services
CHAPTER 5 Structuring SRE Teams for Success in a Bank
CHAPTER 6 The Cloud Foundation: Architecture and Security
CHAPTER 7 Advanced Automation: CI/CD, Incident Response, and GameDays
CHAPTER 8 Data Resilience and Disaster Recovery
CHAPTER 9 Integrating Security into Resilience (DevSecOps)
CHAPTER 10 The Three Pillars of Observability
CHAPTER 11 From Data to Insight: Alerting, Dashboards, and AIOps
CHAPTER 12 Proactive Resilience Validation: Chaos Engineering
Downloads
Published
Issue
Section
License
Copyright (c) 2026 International Research Journal on Advanced Engineering and Management (IRJAEM)

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
.