Filling the Gaps in My Knowledge

I started working as a sysadmin just as cloud really took off. I wasn’t really exposed to a lot of the networking minutiae. That was over 9 years ago. I never had to deal with something complicated in the world of networking. I stuck in my Linux lane and never wandered over to the networking lane. I knew some of the basics, but nothing further. It’s been 8 years and I’ve realized that it’s held me back a bit. One of the changes I’ve made ever since I read the Google SRE book is how I approach technical problems. I’m no longer happy to stop at, “Look, I got it working” or “The bug is not in my code, it’s in the library or a layer above”. I want to figure out the root cause.

Recently, I read Julia Evans’ post abut learning skills and it reminded me that networking is something I don’t know very well yet. I’ve looked at books that explain some basics, but I haven’t really gone in depth to understand how the pieces fit together. I don’t have the pressure of learning to pass a competitive exam. I just want to learn so I can fill in the gaps in my knowledge. Just in time, LinkedIn had offered me free premium for a month which also gives me access to LinkedIn Learning. I spent some time looking for a reasonably good course on networking. It’s been a great watch! The course is actually for CompTIA’s Network+ exam, which I have no intention of writing at the moment. However, it presented a good explanation of networking and TCP/IP. I knew some of the topics individually, but I couldn’t tie all of my knowledge together yet. The few days of watching Networking videos has been great. I don’t understand everything in great depth, but I know most of it and I know where to look for more details.

I’ve been reading and listening a great deal about growth mindset and deliberate practice. I’ve been in the tech industry for the last 9 years. I don’t have a degree yet, and even when I finish my current degree, it will not be in computers. It was humbling at first to sit down and learn something from scratch. At the same time, it’s very relieving. I’m more confident that I can understand networks better. I had most of the networking debugging skills I needed, but now I understand the theory better as I debug problems. Similarly, as a python programmer, I’ve barely ever looked deeply into the Unix kernel. However, as a sysadmin, when I debug problems, I would need a more in-depth grasp of what goes on behind the scenes. In the later year, I spent some time reading Advanced Programming in the Unix Environment. I don’t have it committed to memory, but I’ve read it broadly enough to understand where to look. I’ve understood a lot about what happens for IO/Networking/Process Management in Unixes. It helps me appreciate what goes on in Gluster better. It also helps debug some of the more weirder errors that I might run into.

I write this out as a note to myself. There is no shame in sitting down to learn something that you don’t know. It’s not going to be easy, but I’m going to be grateful for it.

Writing Golang as a Python Dev

I’ve gone through the Golang tutorial once before but in the last month or so, I fully dove into it. I started by writing a simple hello world web application. I found the implementation of the webserver so neat that most of the uses I’d have for a framework is redundant. The in-built libraries already take care of handling most of the use-cases I have. I did a couple of views and a couple of templates. It seems to be working well.

As someone coming from Python, I keep tripping over types. I started my professional career with PHP and then moved to Python. Both of these languages aren’t very strongly typed by default. So it’s been fun to find errors and fix them. I learn more and more that I can’t be lazy.

The other thing that’s tripped me over in go is the use of tab vs spaces. Python’s standard has been my second nature for years, so it’s fun to find out that’s not how you do things in Go. I’ve downloaded vim-go and that’s helped make most of it second nature.

All of this started in a fun way. I wrote a tiny flask app to try out kubernetes on Google Cloud. It’s a simple hello world app with some templates. I’ve figured out some k8s basics and now I see why having one binary helps. The next step in that learning process is to plug in my Go binary rather that copy in multiple files.

I’ve played around with the date libraries in Go now, and jeez, that’s a fun one 🙂 I’m used to specifying a specific format. The golang method is to tell you to specify the layout with a particular date instead. That took me quite a long time to figure out. I’ll admit, it’s definitely easier, but damn, it tripped me over for a while since I couldn’t believe what I was seeing.

One of the things I’ve done quite a lot is write web scrapers. I used colly briefly to do some scraping. It’s a very cool API. I did some decent scraping. It was quite easy to write. The bit I found amusing is that I found a bug in a dependent library and sent in a pull request for a fix.

I didn’t have to wait for it to be merged. I could compile my binary with my fixes myself. Of course, the side effect is every time one of my dependencies have a security bug fixed, I have to send a bug fix myself. That kinda sucks, but in the brave new world of containers, that’s the sort of thing I expect to happen.

This tiny experiment is slowly increasing in scope as I figure out what’s the next bit of new knowledge I need to gain. My current goal is to replace my python hello world web application in my kubernetes cluster with a golang one.

At some point in this process, I can also inject figuring out multi-stage builds. That should make the process of pushing out a new build to my hello-world app easier.

The other thing I’m thinking of is converting the salary converter backend to golang. If it works well, I might move it out to be hosted on a Kubernetes cluster rather than on a full-sized server. That should help split out some of the applications I run to be hosted in a shared but highly-available clusters that I don’t have to manage uptime.

Testing Ansible With Molecule

My colleague was recently assigned a task to create tests for an ansible role that she works on. She pinged me for help and we got started in figuring out what to do.

My colleague was recently assigned a task to create tests for an ansible role that she works on. She pinged me for help and we got started in figuring out what to do.

The first thing we attempted was to run tests inside docker using Ansible following the instructions in an Ansible.com blog post. The idea was we would run the role we wanted to test. Then run a second test playbook that would do a couple of asserts. I was stuck here for a bit for various reasons. The containers that are used in the blog post have not been updated in over a year. And we ran into some trouble trying to find a container with systemd running inside that was also public. The right way to do that would be to generate the container using a Dockerfile on the fly and run tests inside them. That was okay with me, but it added more complexity.

For two days or so, I briefly looked at the idea of doing this in VMs generated on the fly, but it added way too much overhead. Michael, my colleague, pointed me to molecule. His team has been using it regularly, though he himself hasn’t looked at it.

Molecule is an interesting project. It seems to do what I need, but there isn’t spectacular documentation on how to use it for a project that already exists. There are ascii videos, but I’m a fan of reading more than watching. Getting molecule to work on Fedora 28 was a bit of a pain. Ansible needs libselinux-python to work with Docker on a host that has selinux enabled. Now, you can’t installed libselinux-python from pip. It has to be installed from packages. I tried installing it in a virtualenv with site packages and installing molecule from packages, both of them failed in interesting ways that I’m yet to debug.

Eventually, I gave up and created a Centos 7 VM for this. A virtualenv with site packages actually worked inside my Centos 7 VM. This is great news, because this is the sort environment I expect to run molecule. The bit I really like about molecule is that it takes care of the harness and I can write asserts in Python. The tests will actually look like what Python tests look like. The bit I don’t like is that its documentation isn’t as thorough as I’d like. I plan to submit a pull request for the docs for a full flow on how to write tests with molecule. I found various blog posts on the internet that were far helpful. It took some guess work to realize that testinfra is it’s own python module and I should be looking at the module for documentation on how to write my own asserts. This is still a work in progress, but I expect a lot of our ansible pieces will end up being better tested now that we have this in place.

Dublin Adventures: Dublin Pen Corner

I’ve been wanting to buy myself a new Lamy pen for quite some time. I was looking at either getting the black Safari, AL-star, or Vista. I’d looked up pen stores in Dublin…

I’ve been wanting to buy myself a new Lamy pen for quite some time. I was looking at either getting the black Safari, AL-star, or Vista. I’d looked up pen stores in Dublin before my trip and narrowed down to discovered Pen Corner. I’m happy that I didn’t go entirely broke shopping there. I got myself the black Safari. It’s a beauty to write with and two Rhodia notebooks. I’ve heard a lot of good things about Rhodia and it does handle fountain pens really well. I have a problem with hoarding stationery – especially notebooks. I’m finally using the paper I got from Tokyo last year. I’ve been using the meeting book though. It helps a lot to sit down after a meeting and write down notes and action items in pen and paper. It makes it more final for me rather than using a Google doc alone.

 

Moving from pyrax to libcloud: A story in 3 parts

Softserve is a service that lets our community loan machines to debug test failures. They create cloud VMs based on the image that we use for our test machines. We originally…

Softserve is a service that lets our community loan machines to debug test failures. They create cloud VMs based on the image that we use for our test machines. We originally used pyrax to create and delete VMs. I spent some time trying to re-do that with libcloud.

Part 1: Writing libcloud code

I started by writing a simple script that created a VM with libcloud. Then, I modified it to do an SSHKeyDeployment, and further re-wrote that code to work with MultiStepDeployment with two keys. Once I got that working, all I had left was to delete the server. All went well, I plugged in the code and pushed it. Have you seen the bug yet?

Part 2: Deepshikha tries to deploy it

Because I’m an idiot, I didn’t test my code with our application. Our tests don’t actually go and create a cloud server. We ran into bugs. We rolled back and I went about fixing them. The first one we ran into was installing dependencies. Turns out that the process of installing dependencies for libcloud was slightly more complicated for some reason (more on that later!). We needed to pull in a few new devel packages. I sat down and actually fixed all bugs I could trigger. Turns out, there were plenty.

Part 3: I find bugs

Now I ran into subtle installation bugs. Pip would throw up some weird error. The default Python on Centos 7 is pretty old. I upgraded pip and setuptools inside my virtualenv to see if that would solve the pip errors and it did. I suspect some newer packages depend on newer setuptools and pip features and it fails quite badly when those are older.

After that, I ran into an bug that was incredibly satisfying to debug. The logs had a traceback that said I wasn’t passing a string as pubkey. I couldn’t reproduce that bug locally. On my local setup, the type was str, so I had to debug on that server with a few print statements. It turns out that the variable had type unicode. Well, that’s weird. I don’t know why that’s happening – unicode sounds right, something is broken on my local setup. A check for “strings” in python2 should check for str and unicode. The code does the following check which returns False when pubkey is a unicode:

isinstance(pubkey, basestring) 

On a first glance, that looked right. On python2, str and unicode are instances of basestring. A bit of sleuthing later, I discovered that libcloud has their own overridden basestring. This does not consider unicode to be an instance of basestring. I found this definition of basestring for python2:

basestring = unicode = str 

That doesn’t work as I expect on Python 2. How did this ever work? Is it that almost everyone passes strings and never runs into this bug? I have a bug filed. When I figure out how to fix this correctly, I’ll send a patch. My first instinct is to replace it with a line that just says

basestring 

In python2, that should Just Work™. Link to code in case anyone is curious. One part of me is screaming to replace all of this with a library like six that will handle edge cases way better.