One of the core tenets of Site Reliability Engineering (SRE) is that blameless postmortems / retrospectives should be held for oncall incidents. Its part of the continuous improvement process where we learn from what went wrong and try and create processes to ensure it doesn’t happen again. Very explicitly it is not about blaming anyone…
Tag: sre
Folsom Dev Summit sessions
I thought I should write up the dev summit sessions I am hosting now that the program is starting to look solid. This is mostly for my own benefit, so I have a solid understanding of where to start these sessions off. Both are short brainstorm sessions, so I am not intending to produce slide…
Reflecting on Essex
This post is kind of long, and a little self indulgent. However, I really wanted to spend some time thinking about what I did for the Essex release cycle, and what I want to do for the Folsom release. I spent Essex mostly hacking on things in isolation, except for when Padraig Brady and I…
Further adventures with base images in OpenStack
I was bored over the New Years weekend, so I figured I’d have a go at implementing image cache management as discussed previously. I actually have an implementation of about 75% of that blueprint now, but its not ready for prime time yet. The point of this post is more to document some stuff I…
Openstack compute node cleanup
I’ve never used openstack before, which I imagine is similar to many other people out there. Its actually pretty cool, although I encountered a problem the other day that I think is worthy of some more documentation. Openstack runs virtual machines for users, in much the same manner as Amazon’s EC2 system. These instances are…
Managing MySQL the Slack Way: How Google Deploys New MySQL Servers
Thanks to everyone who came along to Grant and my talk this morning about slack and how we bootstrap MySQL servers. Here is the slide deck as promised.
MySQL Users Conference
Well, they’re definitely thinking about getting started. Like last year I caught the VTA down — it’s hard to beat a $1.75 trip without having to worry about traffic. Registraton wasn’t as smooth this year as last, for example I didn’t get my free book (there didn’t seem to be any attempt to hand those…
Managing MySQL the Slack Way: How Google Deploys New MySQL Servers
I’ll be presenting about Slack (the open sourced tool kit we use for deployment software configuration) at the MySQL user’s conference in Santa Clara in late April. The talk will focus on the interesting aspects of Slack as it relates to MySQL and should be fun. A DBA mate of mine is gonna present with…
Thoughts on the first day of the MySQL user’s conference
So, I attended the first day of the MySQL user’s conference yesterday, which was the tutorial day. Overall I was fairly impressed. Registration was easy, the actual rooms presentations are given in are comfortable, the PA system seemed to work after some initial problems in the morning tutorial I attended. The conference center seems to…
Links from Rasmus’ PHP talk
The talk, and the demo are both very cool.