Hello FreeBSD forum!
I have a backup solution for some Windows Servers backed by some iSCSI zfs zvols I use ctld/ctladm to provision the disks to Windows, and I'm using:
on ctld.conf.
The FreeBSD box was originally a 11.0, after upgrading to 11.1-RELEASE-p1, it started to panic regularly. The server hardly get 1 day uptime after this upgrade. I thought it could be something with my deploy, so I rebuilt the system from scratch and the issue persisted. All the crashes point to the same zfs error:
I was looking for some way to analyse the vmcore dump with kgdb but got no luck and having a really bad time.
The process executed by Windows is to copy some files, proccess them and then delete old entries. I'm suspecting this could have something with Windows getting high loads and sending the unmap command to iSCSI before it actually completes/commits the operation. And this could be related to the compressed zfs arc too, since it's one of the big changes I noticed between 11.0 and 11.1.
Do you have any advice or tips on how to start debugging this problem?
Thanks a lot!
I have a backup solution for some Windows Servers backed by some iSCSI zfs zvols I use ctld/ctladm to provision the disks to Windows, and I'm using:
Code:
option pblocksize 0 option unmap on
The FreeBSD box was originally a 11.0, after upgrading to 11.1-RELEASE-p1, it started to panic regularly. The server hardly get 1 day uptime after this upgrade. I thought it could be something with my deploy, so I rebuilt the system from scratch and the issue persisted. All the crashes point to the same zfs error:
Code:
Dump header from device: /dev/da0p2
Architecture: amd64
Architecture Version: 2
Dump Length: 5561098240
Blocksize: 512
Dumptime: Fri Sep 15 23:03:17 2017
Hostname: F01833PAPP0
Magic: FreeBSD Kernel Dump
Version String: FreeBSD 11.1-RELEASE-p1 #0: Wed Aug 9 11:55:48 UTC 2017
root@amd64-builder.daemonology.net:/usr/obj/usr/src/sys/GENERIC
Panic String: Solaris(panic): zfs: allocating allocated segment(offset=42595471360 size=36864)
Dump Parity: 1614276210
Bounds: 1
Dump Status: good
I was looking for some way to analyse the vmcore dump with kgdb but got no luck and having a really bad time.
The process executed by Windows is to copy some files, proccess them and then delete old entries. I'm suspecting this could have something with Windows getting high loads and sending the unmap command to iSCSI before it actually completes/commits the operation. And this could be related to the compressed zfs arc too, since it's one of the big changes I noticed between 11.0 and 11.1.
Do you have any advice or tips on how to start debugging this problem?
Thanks a lot!