uname -a
Code:
FreeBSD bsd 13.0-RC3 FreeBSD 13.0-RC3 #0 releng/13.0-n244696-8f731a397ad: Fri Mar 19 04:00:20 UTC 2021 root@releng1.nyi.freebsd.org:/usr/obj/usr/src/amd64.amd64/sys/GENERIC amd64
My problem began with these console messages.
Code:
Apr 6 05:41:26 bsd kernel: mfisyspd1: hard error cmd=read 1104-1655
Apr 6 05:41:26 bsd kernel: mfi0: I/O error, cmd=0xfffffe001e0c7530, status=0x3c, scsi_status=0
Apr 6 05:41:26 bsd kernel: mfi0: sense error 0, sense_key 0, asc 0, ascq 0
Apr 6 05:41:26 bsd kernel: mfisyspd0: hard error cmd=read 0-551
Apr 6 05:41:26 bsd kernel: mfi0: I/O error, cmd=0xfffffe001e0c57f8, status=0x3c, scsi_status=0
Apr 6 05:41:26 bsd kernel: mfi0: sense error 0, sense_key 0, asc 0, ascq 0
Apr 6 05:41:26 bsd kernel: mfisyspd0: hard error cmd=read 552-1103
Apr 6 05:41:26 bsd kernel: mfi0: I/O error, cmd=0xfffffe001e0c5088, status=0x3c, scsi_status=0
Apr 6 05:41:26 bsd kernel: mfi0: sense error 0, sense_key 0, asc 0, ascq 0
Apr 6 05:41:26 bsd kernel: mfisyspd0: hard error cmd=read 1104-1655
The zpool does not show degraded, so I'm confused. I want to run smartcl on the disks in the pool.
zpool status
Code:
pool: tank
state: ONLINE
status: One or more devices has experienced an unrecoverable error. An
attempt was made to correct the error. Applications are unaffected.
action: Determine if the device needs to be replaced, and clear the errors
using 'zpool clear' or replace the device with 'zpool replace'.
see: https://openzfs.github.io/openzfs-docs/msg/ZFS-8000-9P
scan: resilvered 506M in 00:01:04 with 0 errors on Thu Apr 1 08:37:28 2021
config:
NAME STATE READ WRITE CKSUM
tank ONLINE 0 0 0
raidz2-0 ONLINE 0 10.8K 0
mfisyspd0 ONLINE 43 36.7K 7
mfisyspd1 ONLINE 0 40K 3
mfisyspd2 ONLINE 0 38.5K 5
mfisyspd4 ONLINE 22 36.9K 7
mfisyspd5 ONLINE 17 38.1K 7
mfisyspd6 ONLINE 17 37.7K 4
According to Thomas Krenn's article, the command is
smartctl -a -d megaraid,N /dev/??
, where the N corresponds to the DID of the disk on the HBA. I am having difficulty determining the relationship between DID and the device.First I loaded the mfip module,
kldstat -v | grep mfip
resulting in
Code:
13 1 0xffffffff82b02000 21b8 mfip.ko (/boot/kernel/mfip.ko)
510 mfi/mfip
mfiutil show adapter
Code:
mfi0 Adapter:
Product Name: LSI MegaRAID SAS 9271-8i
Serial Number: SK84375861
Firmware: 23.34.0-0019
RAID Levels: JBOD, RAID0, RAID1, RAID5, RAID6, RAID10, RAID50
Battery Backup: not present
NVRAM: 32K
Onboard Memory: 1024M
Minimum Stripe: 8K
Maximum Stripe: 1M
camcontrol devlist -v
Code:
scbus7 on mfi0 bus 0:
<ATA WDC WD20EFRX-68E 0A80> at scbus7 target 8 lun 0 (pass3)
<ATA WDC WD20EFRX-68E 0A82> at scbus7 target 9 lun 0 (pass4)
<ATA WDC WD20EFRX-68E 0A82> at scbus7 target 10 lun 0 (pass5)
<ATA ST3000DM001-9YN1 CC9E> at scbus7 target 12 lun 0 (pass6)
<ATA WDC WD20EFRX-68E 0A82> at scbus7 target 13 lun 0 (pass7)
<ATA WDC WD20EFRX-68E 0A82> at scbus7 target 14 lun 0 (pass8)
<ATA WDC WD20EFRX-68A 0A80> at scbus7 target 15 lun 0 (pass9)
storcli /c0 /eall /sall show
Code:
CLI Version = 007.1211.0000.0000 Nov 07, 2019
Operating system = FreeBSD 13.0-RC3
Controller = 0
Status = Success
Description = Show Drive Information Succeeded.
Drive Information :
=================
--------------------------------------------------------------------------------
EID:Slt DID State DG Size Intf Med SED PI SeSz Model Sp Type
--------------------------------------------------------------------------------
252:0 13 JBOD - 1.819 TB SATA HDD N N 512B WDC WD20EFRX-68EUZN0 U -
252:1 10 JBOD - 1.819 TB SATA HDD N N 512B WDC WD20EFRX-68EUZN0 U -
252:2 14 JBOD - 1.819 TB SATA HDD N N 512B WDC WD20EFRX-68EUZN0 U -
252:3 9 JBOD - 1.819 TB SATA HDD N N 512B WDC WD20EFRX-68EUZN0 U -
252:5 12 JBOD - 2.728 TB SATA HDD N N 512B ST3000DM001-9YN166 U -
252:6 8 JBOD - 1.819 TB SATA HDD N N 512B WDC WD20EFRX-68EUZN0 U -
252:7 15 JBOD - 1.819 TB SATA HDD N N 512B WDC WD20EFRX-68AX9N0 U -
mfiutil show drives
Code:
mfi0 Physical Drives:
8 ( 1863G) JBOD <WDC WD20EFRX-68E 0A80 serial=WD-WMC4M3291997> SATA E1:S6
9 ( 1863G) JBOD <WDC WD20EFRX-68E 0A82 serial=WD-WCC4M1SRNXN0> SATA E1:S3
10 ( 1863G) JBOD <WDC WD20EFRX-68E 0A82 serial=WD-WCC4M3XAHPP6> SATA E1:S1
12 ( 2795G) JBOD <ST3000DM001-9YN1 CC9E serial=S1F0C6EK> SATA E1:S5
13 ( 1863G) JBOD <WDC WD20EFRX-68E 0A82 serial=WD-WCC4M4PTHTTT> SATA E1:S0
14 ( 1863G) JBOD <WDC WD20EFRX-68E 0A82 serial=WD-WCC4M1SRN3HS> SATA E1:S2
15 ( 1863G) JBOD <WDC WD20EFRX-68A 0A80 serial=WD-WMC301253288> SATA E1:S7
Finally,
smartctl -a -d megaraid,13 /dev/mfisyspd4
Code:
smartctl 7.2 2020-12-30 r5155 [FreeBSD 13.0-RC3 amd64] (local build)
Copyright (C) 2002-20, Bruce Allen, Christian Franke, www.smartmontools.org
/dev/mfisyspd4: Unknown device type 'megaraid,13'
=======> VALID ARGUMENTS ARE: ata, scsi[+TYPE], nvme[,NSID], sat[,auto][,N][+TYPE], usbcypress[,X], usbjmicron[,p][,x][,N], usbprolific, usbsunplus, sntjmicron[,NSID], sntrealtek, intelliprop,N[+TYPE], jmb39x[-q],N[,sLBA][,force][+TYPE], jms56x,N[,sLBA][,force][+TYPE], 3ware,N, hpt,L/M/N, cciss,N, areca,N/E, atacam, auto, test <=======
It doesn't matter what number I use for the DID (N), the error is the same.
Sorry for the long post. Any suggestions?