devops 13
- Incident Management and Escalation Handling: Keeping Systems Reliable
- Monitoring CPU Usage with Python for System Reliability
- Managing Infrastructure with Ansible
- Automating a simple CI/CD Pipeline with GitHub Actions
- A Simple AWS Setup with Terraform
- Understanding File Systems in Modern Infrastructure: Beyond Symlinks and Hardlinks
- SRE Foundations to Production: Securing Postgres on K8s (Part 7)
- SRE Foundations to Production: Advanced Monitoring Setup (Part 6)
- SRE Foundations to Production: Scaling and Load Testing (Part 5)
- SRE Foundations to Production: SQLite, Loki, and Dashboards (Part 4)
- SRE Foundations to Production: Alerting and EC2 Deployment (Part 3)
- SRE Foundations to Production: Grafana Metrics (Part 2)
- SRE Foundations to Production: Monitorable Flask App Setup (Part 1)