I'm running into a problem where my system (running 11.0-RELEASE-p9) becomes unresponsive and I'm looking for some advice on how to debug it.
When it happens, the system does not respond to any inputs. I am always in X when it happens, but I can't say whether X is actually the problem. No keyboard or mouse input seems to be recognized, and I can't ssh into the system (it always times out).
All that I can tell is that the fans on my system kick in, which suggests that CPU usage is very high. Again, I have no idea what is actually using it.
I do not know how to trigger the problem. It is an intermittent issue. I've checked the logs and nothing of note is listed. dmesg doesn't report anything out of the ordinary. (I find it very odd that seemingly nothing gets logged.) I've let the system run in this state for up to 20 minutes and nothing changes.
I'm not asking here for help on debugging exactly why my system is doing this (which is why I have deliberately left out what I'm running on the system). I'm looking for advice on what I can use or do to debug it. Is there an option I can pass to the kernel to get more logging or debug info? If so, how much logging output will it produce and can I force it to be written to a file? And so forth.
In short, what's an accepted/effective way to collect information on a system that hangs intermittently, and does not allow introspection when the hanging occurs?
For the record, here's the output of
When it happens, the system does not respond to any inputs. I am always in X when it happens, but I can't say whether X is actually the problem. No keyboard or mouse input seems to be recognized, and I can't ssh into the system (it always times out).
All that I can tell is that the fans on my system kick in, which suggests that CPU usage is very high. Again, I have no idea what is actually using it.
I do not know how to trigger the problem. It is an intermittent issue. I've checked the logs and nothing of note is listed. dmesg doesn't report anything out of the ordinary. (I find it very odd that seemingly nothing gets logged.) I've let the system run in this state for up to 20 minutes and nothing changes.
I'm not asking here for help on debugging exactly why my system is doing this (which is why I have deliberately left out what I'm running on the system). I'm looking for advice on what I can use or do to debug it. Is there an option I can pass to the kernel to get more logging or debug info? If so, how much logging output will it produce and can I force it to be written to a file? And so forth.
In short, what's an accepted/effective way to collect information on a system that hangs intermittently, and does not allow introspection when the hanging occurs?
For the record, here's the output of
uname -a
. I'm happy to provide more info if anyone thinks it will help.
Code:
FreeBSD freebsd.local 11.0-RELEASE-p9 FreeBSD 11.0-RELEASE-p9 #0: Tue Apr 11 08:48:40 UTC 2017 root@amd64-builder.daemonology.net:/usr/obj/usr/src/sys/GENERIC amd64