Calliope Sounds

Hilary Gridley and building feedback tools

This morning YouTube added the following interview to my feed. Embarrassingly, I realized that I have spent all of my time understanding how to use AI as a coder or as a student/researcher. I had little understanding how other professionals use it every day. Hilary Gridley, the guest, showed a fascinating example of how she uses ChatGPT to build out a feedback tool (a custom "GPT") for herself and for her staff. The feedback is on slide decks, but it is essentially what my employeer is doing for PRs without all the custom coding. I found the whole interview fascinating and now want to see more of its kind.

How custom GPTs can make you a better manager | Hilary Gridley (Head of Core Product at Whoop)
https://www.youtube.com/watch?v=xDMkkOC-EhI

SAGA and AWI

After showing my dark age miniatures I decided that perhaps I would enjoy a solo game of SAGA. After buying a nice mat and setting up the game table I discovered I no longer had the faction dice. I had forgotten I had give them away. So I followed this advice and retrofitted a load of D6 dice and the Viking and Welsh faction boards.

I played a few games. I didn't really enjoy them. I'm not an experienced solo gamer and so that was a factor. What really bothered me was the scale, ie the sweep of the game. The 28mm figures and terrain on a 6'x4' table was visually congested, and the game play had no satisfying built-up or peak. Most faction units were in melee within a few turns and the outcome was obvious. It felt flat. Perhaps if I had been able to extract a story from the encounters and ensuing battles I would have enjoyed them more. (Bloodbaths are not stories.)

I have recently been reading Henry Hyde's Shot, Steel, and Stone rules, and listening to his podcast and that of the Yarkshire gamer. Their advocacy for the large wargame has definitely affected me. Next year is the 250th anniversary of the American War of Independence. The battles are smaller than Napoleon's so it seems practical to have enough figures and landscape to make it feel authentic and not gamey. Gaming the 165ish significant battles, some quite local, on a large table full of 10mm units seems wonderful.

Side-Effects are the Complexity Iceberg

I liked Kris Jenkins's talk Side-Effects are the Complexity Iceberg. Three takeaways for me were

His description of side-effect as "every function as two sets of inputs and two sets of outputs" nicely encapsulates the idea of parameters, results, and before and after states.
The urge to rewrite the system becomes more pressing as ever fewer people remain who understand why the system is as it is. That is, it is an institutional problem and not a technical problem.
If you really hate someone, teach them to recognize bad kerning.

"A flower in the mind is better than a bee in the pocket"

I have amused myself today asking AIs to explain fictitious parables and idioms. For example, 'Explain the idiom "A flower in the mind is better than a bee in the pocket."' The explanations are often eye opening.

Wednesday Gamers

A decade ago I was invited to join the "Wednesday Gamers." This was a small group who had been playing miniature wargames together every Wednesday for decades. It was a rare gift to be in the company of these older men who had gone through life's joys and troubles, remained friends, patient with each's foibles, and continued to have a genuine enthusiasm for the hobby. Within a few years, as each retired, their situations fundamentally changed and the Wednesday gaming sessions stopped. I will forever be grateful to Al, Maurice, Leo, Kim, and Kevin.

Board game UX

I don't know if the game Star Wars: Outer Rim is enjoyable, but the game's physical design is amazing. Which reminds me I have What the Tech World Can Learn from Video Game UX lined up to watch.

Update: The video was about using gamification to enhance learning to use a handheld ultrasound scanner. Nothing new.

Inter-company event sourcing

I was in a client's meeting today and they were discussing how one of their batch processes has a dependency on data generated by a supplier's batch process. The problem is that the client does not know when the supplier's process is complete so they can start their process. (The supplier's data is accessed via an API rather than a bulk file delivery.) The discussion went along the lines of

We know their process starts around 10 PM and takes about an hour to complete. So let's give it a couple of hours to finish, just in case it's slow that day, and we will start our process at, say, midnight.

I scream into the void. This kind inter-company data processing coordination is quite common. Common enough that I am still surprised there is no standard, or industry-specific, solution we use.

South Kingstown & Narragansett restaurants

If you are looking for good restaurants in South Kingstown or Narragansett try Purslane, Duck Press, Tsunama, Agave Social Cocina Mexicana, and Pasquale's Pizzeria Napoletana.

Recent history of education funding in Rhode Island

“For decades, Rhode Island’s policymakers have operated under the myth that cutting taxes for businesses and high earners would spur economic growth. Yet the data is clear..."

SEEDS UNPLANTED - The Recent History of Education Funding in Rhode Island

A pair of formal diagram explanations. One for software and one for buildings.

The C4 set of diagrams are directly relevant to my work as a software engineer. Seeing the architecture profession's sets of diagrams is a useful reminder -- and an immediately obvious one for anyone who has lived in a house! -- that different diagrams are needed for different purposes (ie, contexts).

The C4 Model – Misconceptions, Misuses & Mistakes • Simon Brown • GOTO 2024

What's in my set of architectural documents? Sharing everything: drawings, schedules, + specs.

Why I don't want color coded logs

Most developers laugh at me when I say I don't want color coded logs. They rarely ask why. Logs with color coded structures provide no actionable information. They actually obscure actionable information by forcing a distinction without a difference. What I am looking for are clues in the logs. Those clues are often easily overlooked small tokens. I use color or inversion to distinguish them so that their occurrence immediately stands out from the background noise. The highlight script is a simple tool for this.

Unnecessary frustration and toil

I spent a good part of yesterday tracking down a problem with the staging deployment of a feature I first started back in October of last year. (That it has taken this long to get it to staging has everything to do with how this organization manages work.) When you have such a extended period between implementation and deployment you rarely retain the feature's context and even rarer an environment within which to debug problems. It took me some time to regain that context and environment. (I should have left better notes for my future self.) Once I had that it became obvious that the feature worked and that the problem lay in the deployment.

The deployment is a small Kubernetes cluster. Each service has 2 or 3 container in several pods. I figured out the pod ids and container names (why are they all different!) and opened a terminal window for each and streamed the logs. I used a small script to highlight text that I was expecting to find. I then used the feature and discovered the problem was due to a corrupted private key stored in the deployment's database.

The organization uses Honeybadger.io to record exceptions and Elasticsearch to aggregate logs. These tools are intended to improve access to the details needed to debug issues. Each tool has its own user interface and mechanisms for accessing and searching. To use them you obviously need to understand these mechanisms and, more significantly, you need to know how the organization has configured them. That is, no two organizations use the same data model for how it records exceptions and log details.

The developer needs documentation about the configuration and there was none. Well, that is not quite true. This organization has thousands of incomplete, unmaintained, and contradictory Confluence pages. The "information" available to the developer is actually worse than none at all as they will waste time trying to piece together some semblance of a coherent (partial) picture. What I eventually concluded was that it could not be done and my best path forward was to look at the raw container logs.

I understand that at this organization I am a contractor and so just developer meat. But what I have seen is that this global, financial, highly profitable organization does not do any better for their developer employees. Perhaps all industries are like this. I have only experienced the software development industry and here they are mostly the same. It makes me sad and mad to see and experience such unnecessary frustration and toil.

Transactions and some concurrency problems

A group of us are reading Kleppmann's Designing Data-Intensive Applications. Chapter 7 is on transactions and especially the different approaches used to address concurrency problems, ie the Isolation in ACID. What becomes clear is that transaction isolation levels can only mitigate some problems. It is your application's data model design and use that are mostly responsible for avoiding them. Here are the concurrency problems raised in this chapter:

Lost Updates. These occur when a process reads some records, modifies them, and writes them back. If the updated records had been modified by another process after the read then those updates would be lost.

Read Skew. This is a variation of Lost Updates due to the delays between steps in a multiple step operation. Processes A and B are interacting with the same records. Process A reads records X and Y (two steps). Process B updates records X and Y (two steps). Due to the delay between A's and B's steps, process A has the original X value but the updated Y value.

Write Skew. This occurs when process A reads some records to make a decision and then updates other records appropriate to the decision. While the decision is being made process B changes some records that would alter process A's decision. Process A is unaware of process B's changes and continues to make its updates which invalidates the data model.

Phantoms. This is a variation of Write Skew. Process A queries for records to make a decision. Process B inserts records that would have been included in process A's query results. Unaware of these inserts, process A makes its updates which invalidates the data model. The "phantoms" are the records not included in process A's query result.

Costs of helpful data flexibility.

I'm having a discussion currently with a young developer who has only ever worked in Ruby and JavaScript. I noticed that the developer had chained "symbolize_keys" to the end of a method call. In Ruby this converts a hash's keys from strings to symbols, ie { "a" => 123 } becomes { :a => 123 }. Their reason for this was to offer flexibility to the called as to how it returned the result. They thought this provided flexility and robustness. I countered that it did the exact opposite.

When a function can be given parameters and return results in multiple formats then robustness is only had when the function handles all formats equally. To do that the function needs to be tested with all formats. This can be done, but in practice, and I've seen across many organizations, it is not. Not only does the function need to be tested with the multiple formats, but the callers and the called need to be tested too. It's a combinatorial explosion of testing.

The other detriment to this flexibility is that since no function is sure of the format every function converts the data to its preferred format even if the data is already in the preferred format. This conversion adds to the function's code size and has a runtime cost (CPU and memory) on every invocation. The cost of a single use might be small, but our applications work in a world with thousands of concurrent sessions each with deep call chains, and expect microsecond responses. Those single uses add up.

My recommendation to the developer was to require one format as part of its contract and add validation that runs at least during testing. (I'd like to just tell them to use a typed language where this wouldn't even be an issue!)

I mentioned that the developer's experience is in Ruby and JavaScript. I have found that is common for such developers to not expect data to be in a specific format or type. I assume some of this comes from never being trained to always validate and convert data coming from the outside before using it inside. (Eg, directly passing around an INPUT element's value or a database's column value.) Once inside, you can be assured of its correctness. Instead, data is passed around without any function knowing a priori that it is correct.

I am unsure if I will convince this developer to not use "symbolize_keys". I am rowing against the tide.

Update: Not only did I not convince the developer, but the system's architect rejected it also.

RI DOT

A long time ago I organized a study group to read the whole RI state budget. We were lucky to get Tom Sgouros to guide us through this massive document. At the time there was no online version so we got printed copies. I remember struggling to carry the weight of multiple copies of its multiple volumes as I walked to my car. One of the things we learned was that DOT has almost no debt service. How can a $981M department that is responsible for roads, bridges, etc with lots of bond money projects have only $330K of debt service? It achieves this by hiding it within the Department of Administration. Most of the DOA's $211M debt service is actually DOT's. DOT costs Rhode Islander's well over a billion dollars a year. I honestly don't know if this cost is outrageous, or if it is money well spent. But it is useful to know the scale of the effort to build and maintain the road infrastructure.

FY 2025 Budget

SSL terminating tunnel using ghostunnel

From time to time the need for a simple SSL terminating tunnel is wanted. This is used to enable the browser to use an HTTPS connection to an HTTP server. It is common to use a proxy server, but I was curious if there was something simpler. I was able to create an SSL tunnel using ghostunnel

https://github.com/ghostunnel/ghostunnel

To build it for MacOS 14.7 I needed to update the go.mod to use toolchain go1.22.7 (instead of toolchain go1.22.4).

Created the cert and key

openssl req \
  -x509 \
  -newkey rsa:4096 \
  -keyout key.pem \
  -out cert.pem \
  -sha256 \
  -days 3650 \
  -nodes \
  -subj "/C=US/ST=RI/L=Providence/O=MojoTech/OU=Labs/CN=clientsite.com"

Add the client's domain name to /etc/hosts

127.0.0.1 clientsite.com

Run the tunnel

sudo ghostunnel server \
  --listen clientsite.com:443 \
  --target localhost:3000 \
  --cert cert.pem \
  --key key.pem \
  --disable-authentication

Run Python's file directory serving http server

python3 -m http.server 3000

And finally, open https://clientsite.com in the browser or with curl

curl -k https://clientsite.com

I think since this is Go and executables are statically linked, you could share the ghostunnel executable and PEMs with other developers.

"His train goes to a different station" is the best description of eccentricity I have heard in a long time.

Bye little Linode VM

The website https://andrewgilmatin.com/ is no more. I wasn't using the little Linode VM for much of anything anymore. If I were to keep it running I really needed to move it off of the discontinued CentOS 7. I would have to transition content, old code, and figure out security. Much has changed since I last needed to do that. I was not up for that marathon again.

Sensitive side of pure evil

I am reading Lord of the Rings for the first time. Yes, reading LotR is a right of passage for geeks, but I'm really only a geek by circumstances rather than by anything deeper. (I have watched Peter Jackson's movies several times, if that helps.) I am enjoying the books, having starting with the Hobbit. But several times I have wondered how a young reader today, one not raised in bucolic Devon, responds to Tolkien's beautifully rendered landscapes? Those landscapes are integral to the book and, for me, a sustaining attraction.

I did try watching the first season of the Rings of Power, but quickly gave up. Others have well explained its many, many failures. It is now in its second season and, apparently, has very strange things to say about the sensitive side of pure evil.

Rings of Power’s orc baby: Amazon’s Lord of the Rings prequel doesn’t get it right. | Vox

Ad hoc systems for managing work

I love seeing people's systems for managing their work. Even those of fictional people. This short from The Bear on managing the restaurant's guests and their orders is great.