The Uptime Institute recently released its Annual Outage Analysis 2023 report. Overall, the report highlights the increasing costs, frequency, and duration of outages, the prominent role of cloud and digital services in outages, the shortcomings […]
IT outage times are rapidly increasing as businesses modernize to meet the needs of remote workers, accelerate their digitalization transformations, and adopt new microservices-based architectures and platforms. Research shows that mean time to recovery (MTTR) […]
As the cost of computerization and connectivity continues to plummet, the number of computers, servers, devices, and sensors continues to rapidly proliferate — and they are generating an unfathomable amount of telemetry data. Telemetry data […]
As you are likely already aware, on December 9, 2021, Apache disclosed that Log4j contains a critical vulnerability allowing for unauthenticated remote code execution. This vulnerability – CVE-2021-44228 – is also known as Log4Shell or […]
Latency measurements have become an important part of IT infrastructure and application monitoring. The latencies of a wide variety of events like requests, function calls, garbage collection, disk IO, system-call, CPU scheduling, etc. are of […]
One year ago this month, I wrote a post about how the COVID-19 pandemic was going to greatly accelerate the pace of global, digital transformation. How literally overnight we were being forced to find new […]
Introduction We recently surveyed 200 Kubernetes operators about their Kubernetes deployments, including their top challenges and goals as it relates to Kubernetes overall as well as monitoring specifically. Why? We wanted to better understand what […]
In the next decade, the world will produce an unfathomable amount of machine data — metrics, measurements, and telemetry data that is emitted from everything from servers to robots and satellites. And the pace of […]