In the world of observability and monitoring, choosing the right tool can make or break your infrastructure management strategy. Whether you’re an SRE, DevOps engineer, or a cloud enthusiast, monitoring
AWS Monitoring/Remediation with CloudWatch, EventBridge and SSM RunbooksAWS Monitoring/Remediation with CloudWatch, EventBridge and SSM Runbooks
Introduction In any cloud environment, particularly one with mission-critical workloads, monitoring and logging are essential. These processes ensure system health, optimize performance, and automate responses to events. The focus of
Observability: Using Prometheus and GrafanaObservability: Using Prometheus and Grafana
Introduction The Importance of Observability in Modern Applications: In today’s digital landscape, applications are expected to perform seamlessly at all times. Downtime or performance degradation can lead to a loss
Ansible Automation to mitigate Crowdstrike kind of outagesAnsible Automation to mitigate Crowdstrike kind of outages
We all are very well aware of the major incident/outage due to Crowdstrike update which resulted in the unpleasant Blue Screen of Death (BSOD) problem on Windows machines. During this
Streamlining CI/CD Pipeline with AWS CodeCommit, CodePipelineStreamlining CI/CD Pipeline with AWS CodeCommit, CodePipeline
In our previous efforts, we utilized shell scripts to automate the creation of various AWS resources necessary for our web application deployment. While shell scripts served us well, we realized
Web application deployment using AWS PAAS & BeanstalkWeb application deployment using AWS PAAS & Beanstalk
Introduction Welcome to the first article in my DevOps & SRE learning series. Here, I’ll share my experiences and solutions using the STAR format (Situation, Task, Action, Result). Today, we’ll