How to Elevate From Basic to Advanced Infrastructure Monitoring

Times are changing fast and technology continues to advance at an unrelenting pace. An explosion of systems and devices, complex architectures, pressures to deploy faster, and demand for optimal performance…

Learning from Failures: Better Crash Reporting for Better Incident Response

Crash events are one of the more serious problems that can occur when operating a service. Crashing components often cause cascading failures and service outages. To reveal the magnitude of…

Five Signs Your Monitoring Solution is Failing You

In a recent post I talked about the strain being placed on IT Infrastructure with the current surge in demand for online services being driven by the COVID-19 pandemic. I…

COVID-19 is Placing Tremendous Strain on Online Services, Making Analytics More Important than Ever in Driving Business Success

The Challenge COVID-19 is impacting nearly every company around the world. While the pandemic is affecting companies in different ways and to different degrees, a commonality many are experiencing is…

Circonus Spring 2020 Release Includes Kubernetes Monitoring Solution

This week, we announced the availability of our Spring 2020 release. The highlight of the release is our Kubernetes monitoring solution, which provides health-based alerting and horizontal pod auto-scaling. Additional…

Monitoring Latency SLOs with Histograms and CAQL

Latency SLOs help us quantify the performance of an API endpoint over a period of time. A typical latency SLO reads as follows: The proportion of valid* requests served over…