Greetings,
I recently encountered a problem with my jails (I use iocage to manage them), which leads to a crash of the host-system.
I'm using this setup for quite a while without similar issues, but after upgrading to 13 directing network-traffic IN or OUT of my jails leads to a Kernel panic.
So I have a jail running zabbix which crashed my server when I try to access it's webui.
Same goes when I try to access anything from inside the jail (connected via console)
boom, host is down.
The only special thing in my setup I can think of is that I use vxlan interfaces (and fibs) to separate my networks.
I also have bhyve-based virtual machines on the same vxlan, which still work perfectly after the Upgrade.
I've tried upgrading 12.2 to 13 in April with the same result, rolled back my boot env. and tried again yesterday, the outcome is identical.
As soon as I'm back to 12.2 everything works just fine.
Has anyone encountered a similar issue or has an idea what to do?
I might just try setting up a testjail via ezjail or without any of these helping scripts to check wether it's an iocage-only problem, but i feel like a jail should not be able to crash the host at all.
Unfortunately there aren't many normal logs of this events, and I'm not skilled enough to do something usefull with the core-dumps.
I see a great amount of these login_getclass errors
I recently encountered a problem with my jails (I use iocage to manage them), which leads to a crash of the host-system.
I'm using this setup for quite a while without similar issues, but after upgrading to 13 directing network-traffic IN or OUT of my jails leads to a Kernel panic.
So I have a jail running zabbix which crashed my server when I try to access it's webui.
Same goes when I try to access anything from inside the jail (connected via console)
Code:
root@phcn-zabbix:~ # curl www.google.de
client_loop: send disconnect: Connection reset
The only special thing in my setup I can think of is that I use vxlan interfaces (and fibs) to separate my networks.
I also have bhyve-based virtual machines on the same vxlan, which still work perfectly after the Upgrade.
I've tried upgrading 12.2 to 13 in April with the same result, rolled back my boot env. and tried again yesterday, the outcome is identical.
As soon as I'm back to 12.2 everything works just fine.
Has anyone encountered a similar issue or has an idea what to do?
I might just try setting up a testjail via ezjail or without any of these helping scripts to check wether it's an iocage-only problem, but i feel like a jail should not be able to crash the host at all.
Unfortunately there aren't many normal logs of this events, and I'm not skilled enough to do something usefull with the core-dumps.
I see a great amount of these login_getclass errors
Code:
Jun 25 23:26:21 Server1 jail[2984]: login_getclass: unknown class 'root'
Jun 25 23:26:21 Server1 jail[2985]: login_getclass: unknown class 'root'
Jun 25 23:26:21 Server1 jail[2986]: login_getclass: unknown class 'root'
Jun 25 23:26:21 Server1 jail[2987]: login_getclass: unknown class 'root'
Jun 25 23:26:21 Server1 jail[2988]: login_getclass: unknown class 'root'
Jun 25 23:26:56 Server1 jail[3617]: login_getclass: unknown class 'root'
Jun 25 23:26:56 Server1 jail[3618]: login_getclass: unknown class 'root'
Jun 25 23:26:56 Server1 jail[3619]: login_getclass: unknown class 'root'
Jun 25 23:26:56 Server1 jail[3620]: login_getclass: unknown class 'root'
Jun 25 23:26:56 Server1 jail[3621]: login_getclass: unknown class 'root'
Jun 25 23:26:56 Server1 jail[3622]: login_getclass: unknown class 'root'
Jun 25 23:28:21 Server1 jexec[3916]: login_getclass: unknown class 'root'
Jun 25 23:29:25 Server1 syslogd: kernel boot file is /boot/kernel/kernel
Jun 25 23:29:25 Server1 kernel: ---<<BOOT>>---
Jun 25 23:29:25 Server1 kernel: Copyright (c) 1992-2021 The FreeBSD Project.
Jun 25 23:29:25 Server1 kernel: Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
Jun 25 23:29:25 Server1 kernel: The Regents of the University of California. All rights reserved.
Jun 25 23:29:25 Server1 kernel: FreeBSD is a registered trademark of The FreeBSD Foundation.
Jun 25 23:29:25 Server1 kernel: FreeBSD 13.0-RELEASE-p1 #0: Wed May 26 22:15:09 UTC 2021
Jun 25 23:29:25 Server1 kernel: [email]root@amd64-builder.daemonology.net[/email]:/usr/obj/usr/src/amd64.amd64/sys/GENERIC amd64
Jun 25 23:29:25 Server1 kernel: FreeBSD clang version 11.0.1 ([email]git@github.com[/email]:llvm/llvm-project.git llvmorg-11.0.1-0-g43ff75f2c3fe)
Jun 25 23:29:25 Server1 kernel: VT(vga): resolution 640x480
Jun 25 23:29:25 Server1 kernel: CPU: AMD Ryzen 7 3700X 8-Core Processor (3600.07-MHz K8-class CPU)
root@Server1 ~ grep -r -i "panic" /var/log
/var/log/messages:Apr 18 14:27:03 Server1 savecore[1700]: reboot after panic: double fault
/var/log/messages:Apr 18 14:28:35 Server1 savecore[1685]: reboot after panic: double fault
/var/log/messages:Apr 18 14:30:23 Server1 savecore[1685]: reboot after panic: double fault
/var/log/messages:Apr 18 14:57:27 Server1 savecore[1681]: reboot after panic: double fault
/var/log/messages:Apr 18 14:59:31 Server1 savecore[1679]: reboot after panic: double fault
/var/log/messages:Apr 20 19:56:25 Server1 savecore[1693]: reboot after panic: double fault
/var/log/messages:Jun 25 23:03:04 Server1 savecore[1691]: reboot after panic: double fault
/var/log/messages:Jun 25 23:13:31 Server1 savecore[1678]: reboot after panic: double fault
/var/log/daemon.log:Apr 18 14:27:03 Server1 savecore[1700]: reboot after panic: double fault
/var/log/daemon.log:Apr 18 14:28:35 Server1 savecore[1685]: reboot after panic: double fault
/var/log/daemon.log:Apr 18 14:30:23 Server1 savecore[1685]: reboot after panic: double fault
/var/log/daemon.log:Apr 18 14:57:27 Server1 savecore[1681]: reboot after panic: double fault
/var/log/daemon.log:Apr 18 14:59:31 Server1 savecore[1679]: reboot after panic: double fault
/var/log/daemon.log:Apr 20 19:56:25 Server1 savecore[1693]: reboot after panic: double fault
/var/log/daemon.log:Jun 25 23:03:04 Server1 savecore[1691]: reboot after panic: double fault
/var/log/daemon.log:Jun 25 23:13:31 Server1 savecore[1678]: reboot after panic: double fault