@dustinrue, that's cool. It's probably what this set up here in my home, well, lab is as well. But there are a few users on the Mastodon instance so I try to aim for production quality.
Anyways, I now made a lot of effort to move all the NFS pods handling the ReadWriteMany endpoints for the ReadWriteOnce things to one single node, to the node I trust the most, and not the one which hangs every day now for what I believe are memory chip issues... So I don't have three single points of failures, but one, which is an improvement I hope.
Let's see what happens in a day or so when #Curie goes down again, giving me a lot of practice and exercise in hard reboots... Before I get a new node which should also come by mail tomorrow, the two nodes don't seem to have enough capacity to run everything anyhow, so when Curie goes down, all the rest fall like the North American electrical netw... sorry, like dominoes.