Severl hours ago my zpool suddenly exploded. The server rebooted and could not boot.
I'm currently saving zpool images, but are there any chances? What should I try? Or the kernel managed to destroy all data, leaving nothing?
UPD.
UPD2. Looks like I've hit https://github.com/openzfs/zfs/issues/12559 and this issue is not in kernel yet. So beware of zstd!
Code:
root@:~ # zdb -AAA -F -d -e iile-boot
zdb: can't open 'iile-boot': Integrity check failed
Code:
root@:~ # zpool import
pool: iile-boot
id: 4380822407036168996
state: FAULTED
status: One or more devices are missing from the system.
action: The pool cannot be imported. Attach the missing
devices and try again.
The pool may be active on another system, but can be imported using
the '-f' flag.
see: https://openzfs.github.io/openzfs-docs/msg/ZFS-8000-3C
config:
iile-boot FAULTED corrupted data
mirror-0 FAULTED corrupted data
gpt/iile-boot-1 UNAVAIL cannot open
gpt/iile-boot-0 ONLINE
pool: iile
id: 4721818964728306628
state: FAULTED
status: One or more devices are missing from the system.
action: The pool cannot be imported. Attach the missing
devices and try again.
The pool may be active on another system, but can be imported using
the '-f' flag.
see: https://openzfs.github.io/openzfs-docs/msg/ZFS-8000-3C
config:
iile FAULTED corrupted data
mirror-0 DEGRADED
8445171921478463808 UNAVAIL cannot open
gpt/iile-0 ONLINE
Code:
root@:~ # zdb -AAA -e iile-boot
Configuration for import:
vdev_children: 1
version: 5000
pool_guid: 4380822407036168996
name: 'iile-boot'
state: 0
vdev_tree:
type: 'root'
id: 0
guid: 4380822407036168996
children[0]:
type: 'mirror'
id: 0
guid: 15675021958327973475
whole_disk: 0
metaslab_array: 256
metaslab_shift: 32
ashift: 9
asize: 536866193408
is_log: 0
create_txg: 4
children[0]:
type: 'disk'
id: 0
guid: 1991294491525726088
path: '/dev/gpt/iile-boot-1'
whole_disk: 1
DTL: 2866
create_txg: 4
children[1]:
type: 'disk'
id: 1
guid: 6740896146295478304
whole_disk: 1
DTL: 2172
create_txg: 4
path: '/dev/gpt/iile-boot-0'
load-policy:
load-request-txg: 18446744073709551615
load-rewind-policy: 2
(very long time)
zdb: can't open 'iile-boot': Integrity check failed
ZFS_DBGMSG(zdb) START:
spa.c:5998:spa_import(): spa_import: importing iile-boot
spa_misc.c:411:spa_load_note(): spa_load(iile-boot, config trusted): LOADING
vdev.c:131:vdev_dbgmsg(): disk vdev '/dev/gpt/iile-boot-0': best uberblock found for spa iile-boot. txg 3110443
spa_misc.c:411:spa_load_note(): spa_load(iile-boot, config untrusted): using uberblock with txg=3110443
vdev.c:136:vdev_dbgmsg(): mirror-0 vdev (guid 15675021958327973475): metaslab_init failed [error=97]
vdev.c:136:vdev_dbgmsg(): mirror-0 vdev (guid 15675021958327973475): vdev_load: metaslab_init failed [error=97]
spa_misc.c:396:spa_load_failed(): spa_load(iile-boot, config trusted): FAILED: vdev_load failed [error=97]
spa_misc.c:411:spa_load_note(): spa_load(iile-boot, config trusted): UNLOADING
ZFS_DBGMSG(zdb) END
I'm currently saving zpool images, but are there any chances? What should I try? Or the kernel managed to destroy all data, leaving nothing?
UPD.
Code:
root@:~ # sysctl vfs.zfs.spa.load_verify_metadata=0
vfs.zfs.spa.load_verify_metadata: 1 -> 0
root@:~ # sysctl vfs.zfs.spa.load_verify_data=0
vfs.zfs.spa.load_verify_data: 1 -> 0
root@:~ # zpool import -f -R /mnt -o readonly -N iile-boot
internal error: cannot import 'iile-boot': Integrity check failed
Abort (core dumped)
root@:~ # tail /var/log/messages
Dec 27 13:41:52 ZFS[2167]: pool I/O failure, zpool=iile-boot error=97
Dec 27 13:41:52 ZFS[2171]: vdev problem, zpool=iile-boot path= type=ereport.fs.zfs.vdev.corrupt_data
Dec 27 13:41:52 ZFS[2175]: failed to load zpool iile-boot
Dec 27 13:41:54 ZFS[2183]: pool I/O failure, zpool=iile-boot error=97
Dec 27 13:41:54 ZFS[2187]: vdev problem, zpool=iile-boot path= type=ereport.fs.zfs.vdev.corrupt_data
Dec 27 13:41:54 ZFS[2191]: failed to load zpool iile-boot
Dec 27 13:41:54 ZFS[2199]: pool I/O failure, zpool=iile-boot error=97
Dec 27 13:41:54 ZFS[2203]: vdev problem, zpool=iile-boot path= type=ereport.fs.zfs.vdev.corrupt_data
Dec 27 13:41:54 ZFS[2207]: failed to load zpool iile-boot
Dec 27 13:41:54 kernel: pid 2131 (zpool), jid 0, uid 0: exited on signal 6 (core dumped)
UPD2. Looks like I've hit https://github.com/openzfs/zfs/issues/12559 and this issue is not in kernel yet. So beware of zstd!