DevOps Newsletter 258
In case you missed it. Here’s our new Ops companion app. It’s very useful. It’s very free. In case you missed it. Here’s our new Ops companion app. It’s very useful. It’s very free. We switched to...
View ArticleThe Hardest Hire: Technical Support
The problem with articulating a compelling vision for technical support, is that nobody believes you anymore. “Your call is important to us?” No, it’s not. Most support experiences remind us just how...
View ArticleDevOps Newsletter 259
How much is Spotify paying @googlecloud ? Splunk vs ELK: The Log Management Tools Decision Making Guide SSD reliability in the real world: Google’s experience Machine Learning: The High-Interest...
View ArticleHow to Monitor Nagios itself for free
So, you run your own monitoring? Nice! But how do you monitor your monitoring? Who will alert you when your monitoring service is down? We wanted to encourage all Nagios and Icinga administrators out...
View ArticleMonitoring meta-Monitoring
Our rallying cry to server monitoring users out there is deceptively simple. Spend more time with your customers. Spend more time building your business. Spend more time with family. Leave server...
View ArticleDevOps Newsletter 260
How to monitor #Nagios itself for free. #Icinga too! How Badoo saved one million dollars switching to PHP7 The absolute horror of WiFi light switches Managing two million web servers Google Blocking...
View ArticleDevOps Newsletter 261
Lessons Learned From A Year Of Elasticsearch In Production Facebook’s new front-end server design delivers on performance without sucking up power JPMorgan Algorithm Knows You’re a Rogue Employee...
View ArticleOps love APIs: Here is Why
Inspired . . . Delighted . . . Hooked. We always run out of superlatives when we talk about APIs. Wait a minute, you say, haven’t APIs been around for awhile? Can’t you pick any of the newer...
View ArticleDevOps Newsletter 262
An introduction to Tmux d’Oh My Zsh – How I unexpectedly built a monster of an open source project CSC’s suffered the largest unplanned outage in years but 1.7PB & 850 million files were recovered...
View ArticleWhat we learned at IncontroDevOps
Last Friday I attended IncontroDevOps, one of the most prominent DevOps events in Italy. It was a great opportunity to vocalise our thoughts about the human side of operations (checklists, incidents,...
View ArticleIPv6 Support is Here
The phone numbers of the Internet are changing. Slowly but surely the interweb is moving away from the old Internet Protocol, IPv4, to the newer one, IPv6. Operative word: slowly. At a rate of 4% a...
View ArticleDevOps Newsletter 263
Micro-services for performance Almost everyone is doing the API economy wrong AWS Networking, Environments and You Moore’s law really is dead this time Lessons from Building a Node App in Docker...
View ArticleBeyond servers: How we monitor energy consumption
Running an office is very similar to running a SaaS infrastructure. We need uninterrupted power supply so we can get things done, but we also want the smallest possible carbon footprint so we can be...
View ArticleDevOps Newsletter 264
How Buffer Saved $132k a Year With an IT Infrastructure Audit Immutability is not enough Containers in Production: Case Studies When you run your own monitoring, who is monitoring your monitoring?...
View Article5 Tips to Being a Better Writer
Less talking please. Why? What’s wrong with talking? There’s nothing inherently wrong with it. Humans need to talk. That’s how we understand each other. The best predictor [so far] of empathic accuracy...
View ArticleDevOps Newsletter 265
Why aren’t we using SSH for everything? My approach at making AWS EC2 affordable: Automatic replacement of Autoscaling nodes with equivalent spot instances Challenges of micro-service deployments...
View ArticleReal Cloud Monitoring
In 2012, a storm took down one of AWS datacenters in Virginia. The ELB service stopped working and, as a result, the likes of Netflix, Pinterest and Instagram went offline. In the same year, hurricane...
View ArticleHumanOps: Making Operations Human
What is the number one sysadmin skill? The ability to problem solve, right? We’re not talking about sudoku and crosswords here. Errors and delays can cost millions. With scale comes complexity, and an...
View ArticleLaunching Sparklines for iOS
Want people to understand you? Tell them a story. Stories are narratives. They take data (events) and interpret them by placing them into context (timeline). Unlike computers, most humans aren’t great...
View ArticleDevOps Newsletter 266
Kafka Ecosystem at LinkedIn Open Source Storage Server: 60 Hard Drives 480TB Storage Do GPU optimized databases threaten the hegemony of Oracle, Splunk and Hadoop? In case you missed it, here is how...
View Article