Weird networking issue, seeking help
https://piefed.ca/c/linux/p/629526/weird-networking-issue-seeking-help
Weird networking issue, seeking help
https://piefed.ca/c/linux/p/629526/weird-networking-issue-seeking-help
Ugh, this is the bane of my existence. For decades, this worked reliably. Then they introduced networkmanager and systemd-resolved, and everything went out the window and now it’s impossible to trace behavior.
I would verify your network and dns settings. One or both of those things I mentioned are probably interfering in resolution.
Hmm, interesting…
But we learned now, that your desktop somehow isn’t reaching your DNS (pi-hole) until restart of Network Manager
So next thing i would do is traceroute and dig on your desktop while it is not working, to see what DnS is used in this state
nmcli device show eth0
Or replace eth0 with whatever interface it is. Will give you a list of all network parameters including the DNS servers. There ideally should only be one, your pihole
GENERAL.CON-PATH ends in 2, whereas I have another interface lo which ends in 1. Would that be a clue of anything?
resolvectl status
So I would start with checking if the request is reaching PiHole.
Next time it breaks, before restarting networkmanager, go check the pihole requests log and see if your DNS queries are even showing up there.
If they are, what does pihole show it’s returning for the query, is it the correct IP?
If that’s working properly then I would check if you can ping the server by IP directly, make sure that connection is working.
I will check this next time it happens (I’m very confident that will be later today lol)
Out of curiosity, can you think of a reason why a problem with Pihole would be fixed by restarting my NetworkManager? I was certain the problem had to be on my pc.
Nice, narrowing it down!
Next thing I would check is your configured DNS servers on the desktop, it needs to have only the pihole IP, if it has any other servers configured that is likely the issue.
Also worth pinging the pihole IP next time it’s not working, to check if it’s actually a networking issue instead of a DNS issue.
Do you have multiple NIC? I sporadically run into issues where traffic will try to route itself via a secondary network which has limited external access and it is resolved in a similar manner, by bringing down all networks, then re-starting them in a specific order.
Realistically I should probably define some static routes, but it works automatically 99% of the time so I never bother.
If this sounds like a possible cause, check your routes while in failed, and functional states, and set static routes as needed to resolve.