
Serving a 7B Model on ECS GPU Instances You Own
An ML developer came to me with a model to serve. He had already made the decisions that matter most: an open model tuned for his use case, …
Read MorePlatform and infrastructure engineer with 20+ years of experience designing, scaling, and evolving the systems that underpin engineering velocity.
Founder of InfraHouse, where he designs production infrastructure, database systems, and open-source IaC modules for consulting clients.
Previously Staff SRE at Pinterest (operating a MySQL fleet serving ~5M queries/sec across thousands of servers), Staff Database Engineer at Box, and MySQL SRE at Dropbox.
Creator of undrop-for-innodb and etcdb. Regular conference speaker.

An ML developer came to me with a model to serve. He had already made the decisions that matter most: an open model tuned for his use case, …
Read More
On January 30 this year, I filed an issue against our self-hosted GitHub Actions runner module
Read More
The startup I work with builds AI agents for enterprises. I handle the infrastructure. The founders and their engineers build the product.
Read More
I run infrastructure for a startup. Over the past two years I’ve built a system that manages the company’s entire GitHub …
Read MoreI run infrastructure for a startup. When I joined, the GitHub organization had a handful of repos created by hand through the web UI. …
Read MoreOpenClaw is everywhere right now. 247,000 GitHub stars, an AWS Lightsail blueprint, people running autonomous AI agents from their phones …
Read MoreWe recently came in as consultants to a startup with PostgreSQL performance problems. They didn’t have a DBA on staff - not unusual …
Read MoreOn Friday, December 26th, 2025, I released a bugfix for our private PyPI server. The server had been running for months in a degraded …
Read MoreIf you write or review infrastructure code-Terraform, AWS IaC, CI/CD pipelines, automation scripts - you’ve likely felt the pain …
Read MoreDiscover how InfraHouse transformed a routine Lambda module into production excellence through disciplined AI collaboration. Same timeline, …
Read MoreA practical engineering story about replacing Keycloak with Cognito to create a self-hosted Terraform registry using Tapir, AWS ECS, and ALB …
Read MorePart 1 of the Vulnerability Management Series — how to manage dependency vulnerabilities with OSV-Scanner and ih-github while meeting SLAs …
Read MoreWhen HashiCorp releases a new major version of the AWS Terraform provider, engineering teams often brace themselves. Major upgrades bring …
Read MoreI had a conversation with a colleague other day, and he asked who has access to a specific password. We use
Read More