I still use physical spinning disks and can hear when they're busy. I had an issue awhile ago where a periodic job was running and writing to disk every few seconds, but AFAIK, I am no longer running that job yet there is something periodically writing to disk every few seconds.
Whenever it does, the system appears to hang, the keyboard and mouse are non-responsive and sometimes, the keyboard input is not even captured during that time. So, if I were typing while the system was out to lunch, they keystrokes were silently dropped.
If I open a new terminal window, it takes about a second for it to load (I have a bunch of things it does). After a few minutes when everything is cached, opening a new tab is instantaneous.
I am running iotop with a refresh every second, but I don't see anything standing out. I recently switched to a different hard drive as I rebuilt my system - I should say that all my drives are in some state of failure and I think this one might be worse off. My workstation and router are running on the same physical box in 2 separate jails, I have recently setup rctl to limit resources the jails can use.
My router uses fairly minimal resources, I have it set to 50% CPU and 2G of ram, listing the resources it consumes, it is well below those limits. For the workstation, I have it set to 300% CPU and 16G of ram. I only approach 16G of ram when using go fix on some larger projects or when digikam is running.
I have no other resource limits set.
htop shows a fairly minimal load on the system both in terms of CPU and memory and perhaps I don't know how ot read iotop, but nothing stood out there either.
What other tool(s) shall I use to investigate this? My other system for reference did not have rctl setup, but I noticed the pausing even before using rctl, so I don't believe that is the culprit or factor. Perhaps it is indeed the drive, I can always swap over to that for comparison. If the drive were going, would dmesg show that or perhaps SMART tools?
EDIT #1:
drive A:
drive B:
Drive B has a lower raw read error rate, but higher seek error rate. It was powered on for about 2000 more hours.
EDIT #2:
If I look at iostat -w1x, I periodically see the tout, KB/t, and tps numbers increase every 5 seconds which seems to correspond to the hard drive noise I can hear. My system CPU is an i5-3470 and I'm using the onboard GPU, not an external unit. I know the onboard GPU's performance isn't great, but I can generally watch full HD videos without the system pausing. Beyond that, I notice pauses and the frames dropped increases.
iostat -wx1
I'm mainly wondering if there is a way to improve this pausing that seemed to crop up recently. Perhaps I try disabling resource limits to see if that has an effect.
I disabled rctl and reenabled it, and that is where I can see the difference. For example, with rctl enabled, whenever I open a new terminal, the terminal sets up an ssh-agent if it needs to. With rctl enabled, it waits for a lock, with it disabled, I can open many tabs concurrently, and they all complete quickly. It seems rctl is affecting lock files? I need to investigate more.
My idea for using rctl was to prevent a jail from bringing down the host by consuming too much resources. However, the only settings I'm touching are CPU and memory.
EDIT #3:
I am not certain the perceived hangup has anything to do with the disk. It doesn't seem like there is that much activity presently. I came across another post that suggests that the system hanging could actually be the monitor:
In my case, I notice it now with watching videos. A video with a bandwidth of 1958 kb/s plays fine, but 4416 kb/s is choppy. Both are the same framerate, codec, and resolution.
Whenever it does, the system appears to hang, the keyboard and mouse are non-responsive and sometimes, the keyboard input is not even captured during that time. So, if I were typing while the system was out to lunch, they keystrokes were silently dropped.
If I open a new terminal window, it takes about a second for it to load (I have a bunch of things it does). After a few minutes when everything is cached, opening a new tab is instantaneous.
I am running iotop with a refresh every second, but I don't see anything standing out. I recently switched to a different hard drive as I rebuilt my system - I should say that all my drives are in some state of failure and I think this one might be worse off. My workstation and router are running on the same physical box in 2 separate jails, I have recently setup rctl to limit resources the jails can use.
My router uses fairly minimal resources, I have it set to 50% CPU and 2G of ram, listing the resources it consumes, it is well below those limits. For the workstation, I have it set to 300% CPU and 16G of ram. I only approach 16G of ram when using go fix on some larger projects or when digikam is running.
I have no other resource limits set.
htop shows a fairly minimal load on the system both in terms of CPU and memory and perhaps I don't know how ot read iotop, but nothing stood out there either.
What other tool(s) shall I use to investigate this? My other system for reference did not have rctl setup, but I noticed the pausing even before using rctl, so I don't believe that is the culprit or factor. Perhaps it is indeed the drive, I can always swap over to that for comparison. If the drive were going, would dmesg show that or perhaps SMART tools?
EDIT #1:
drive A:
raw read error rate: 51334312
seek error rate: 444635657
drive B:
raw read error rate: 4270608
seek error rate: 498599102
Drive B has a lower raw read error rate, but higher seek error rate. It was powered on for about 2000 more hours.
EDIT #2:
If I look at iostat -w1x, I periodically see the tout, KB/t, and tps numbers increase every 5 seconds which seems to correspond to the hard drive noise I can hear. My system CPU is an i5-3470 and I'm using the onboard GPU, not an external unit. I know the onboard GPU's performance isn't great, but I can generally watch full HD videos without the system pausing. Beyond that, I notice pauses and the frames dropped increases.
iostat -wx1
2 237 16.4 74 1.19 0.0 0 0.00 0.0 0 0.00 2 0 0 0 98
I'm mainly wondering if there is a way to improve this pausing that seemed to crop up recently. Perhaps I try disabling resource limits to see if that has an effect.
I disabled rctl and reenabled it, and that is where I can see the difference. For example, with rctl enabled, whenever I open a new terminal, the terminal sets up an ssh-agent if it needs to. With rctl enabled, it waits for a lock, with it disabled, I can open many tabs concurrently, and they all complete quickly. It seems rctl is affecting lock files? I need to investigate more.
My idea for using rctl was to prevent a jail from bringing down the host by consuming too much resources. However, the only settings I'm touching are CPU and memory.
EDIT #3:
I am not certain the perceived hangup has anything to do with the disk. It doesn't seem like there is that much activity presently. I came across another post that suggests that the system hanging could actually be the monitor:
I'm running into a problem where my system (running 11.0-RELEASE-p9) becomes unresponsive and I'm looking for some advice on how to debug it.
When it happens, the system does not respond to any inputs. I am always in X when it happens, but I can't say whether X is actually the problem. No keyboard or mouse input seems to be recognized, and I can't ssh into the system (it always times out).
All that I can tell is that the fans on my system kick in, which suggests that CPU usage is very high. Again, I have no idea what is actually using it.
I do not know how to trigger the problem. It...
When it happens, the system does not respond to any inputs. I am always in X when it happens, but I can't say whether X is actually the problem. No keyboard or mouse input seems to be recognized, and I can't ssh into the system (it always times out).
All that I can tell is that the fans on my system kick in, which suggests that CPU usage is very high. Again, I have no idea what is actually using it.
I do not know how to trigger the problem. It...
- GeoffWozniak
- Replies: 11
- Forum: General
In my case, I notice it now with watching videos. A video with a bandwidth of 1958 kb/s plays fine, but 4416 kb/s is choppy. Both are the same framerate, codec, and resolution.
Last edited: