#TreehouseOpsLog on our tech debt todo list, non-exhaustive:
- set up logs collection and aggregation so it doesn't take forever to figure out where those 502s come from
- rework our nginx / docker compose file setup for easier horizontal scaling
- finish rolling out alpine 3.23
- set up internal service sso (this one's mostly an excuse in figuring out how to set up authentik or whatever)
- acquire remaining parts / schedule downtime for hardware upgrades
- migrate backups to cheaper hosting
- figure out why woodpecker keeps dying during jobs (it's probably untuned postgres on proxmox zfs *again*. somehow. sqlite was worse, but it's still annoying to see 15 minute build jobs fail)
- update to nginx >=1.29.4 for some very nice bug fixes and performance improvements
all of this is taken from our ops book, powered by mdbook+curl-PUT-to-git-pages (and a hacky woodpecker+apko+justfile build system, which is itself an image built using a similar setup in another repo)
figuring out how to release some of our ops stuff for public curiosity is on the very long term todo list; part of the reason it's not is because writing for public consumption is harder than writing for a restricted audience.