Project

General

Profile

Actions

Bug #15618

closed

HAproxy causes system panic

Added by Amin Sadeghi 6 months ago. Updated 5 months ago.

Status:
Duplicate
Priority:
Normal
Assignee:
-
Category:
haproxy
Target version:
-
Start date:
Due date:
% Done:

0%

Estimated time:
Plus Target Version:
Affected Version:
Affected Plus Version:
24.03
Affected Architecture:
amd64

Description

Crash report begins. Anonymous machine information:

amd64
15.0-CURRENT
FreeBSD 15.0-CURRENT #0 plus-RELENG_24_03-n256311-e71f834dd81: Fri Apr 19 00:28:14 UTC 2024 root@freebsd:/var/jenkins/workspace/pfSense-Plus-snapshots-24_03-main/obj/amd64/Y4MAEJ2R/var/jenkins/workspace/pfSense-Plus-snapshots-24_03-main/sources/FreeBS

Crash report details:

No PHP errors found.

Filename: /var/crash/info.0
Dump header from device: /dev/da0p2
Architecture: amd64
Architecture Version: 4
Dump Length: 632832
Blocksize: 512
Compression: none
Dumptime: 2024-07-09 19:36:31 +0330
Hostname: ipsfw.securenss.com
Magic: FreeBSD Text Dump
Version String: FreeBSD 15.0-CURRENT #0 plus-RELENG_24_03-n256311-e71f834dd81: Fri Apr 19 00:28:14 UTC 2024
root@freebsd:/var/jenkins/workspace/pfSense-Plus-snapshots-24_03-main/obj/amd64/Y4MAEJ2R/var/j
Panic String: page fault
Dump Parity: 963414578
Bounds: 0
Dump Status: good


Files

textdump.tar.0 (618 KB) textdump.tar.0 Amin Sadeghi, 07/14/2024 07:32 AM
textdump.tar (1).0 (618 KB) textdump.tar (1).0 Amin Sadeghi, 07/24/2024 02:50 PM

Related issues

Related to Bug #15457: HAproxy disable zero copy forwardingResolved

Actions
Actions #1

Updated by Marcos M 6 months ago

  • Category changed from PHP Interpreter to Operating System
  • Status changed from New to Incomplete
  • Priority changed from High to Normal

There's not enough information to investigate further. There should be a compressed file containing the backtrace along with the panic.

Actions #2

Updated by Amin Sadeghi 6 months ago

Actions #3

Updated by Amin Sadeghi 6 months ago

Marcos M wrote in #note-1:

There's not enough information to investigate further. There should be a compressed file containing the backtrace along with the panic.

Hi Marcos,

I've attached the dump file containing the backtrace and the panic details for further investigation.

Thanks for your assistance.

Best regards,

Actions #4

Updated by Amin Sadeghi 6 months ago

everyday crash and very unstable

Actions #5

Updated by Marcos M 6 months ago

  • Project changed from pfSense Plus to pfSense Packages
  • Subject changed from Crash report begins. fSense-Plus-snapshots-24_03-main to HAproxy causes system panic
  • Category changed from Operating System to haproxy
  • Status changed from Incomplete to Confirmed
  • Release Notes deleted (Default)

A similar/related issue is supposedly fixed with an haproxy update from upstream:
https://redmine.pfsense.org/issues/15457

See discussion:
https://forum.netgate.com/topic/188861/
There's a request there for a full core dump to aid in further troubleshooting.

Backtrace and panic from attached dumps:

db:1:pfs> bt
Tracing pid 2 tid 100149 td 0xfffff80005d3d740
kdb_enter() at kdb_enter+0x33/frame 0xfffffe00fda178f0
panic() at panic+0x43/frame 0xfffffe00fda17950
trap_fatal() at trap_fatal+0x40f/frame 0xfffffe00fda179b0
trap_pfault() at trap_pfault+0x4f/frame 0xfffffe00fda17a10
calltrap() at calltrap+0x8/frame 0xfffffe00fda17a10
--- trap 0xc, rip = 0xffffffff80f246e2, rsp = 0xfffffe00fda17ae0, rbp = 0xfffffe00fda17b70 ---
tcp_m_copym() at tcp_m_copym+0x62/frame 0xfffffe00fda17b70
tcp_default_output() at tcp_default_output+0x1294/frame 0xfffffe00fda17d60
tcp_timer_rexmt() at tcp_timer_rexmt+0x53c/frame 0xfffffe00fda17dc0
tcp_timer_enter() at tcp_timer_enter+0x101/frame 0xfffffe00fda17e00
softclock_call_cc() at softclock_call_cc+0x12e/frame 0xfffffe00fda17ec0
softclock_thread() at softclock_thread+0xe9/frame 0xfffffe00fda17ef0
fork_exit() at fork_exit+0x7f/frame 0xfffffe00fda17f30
fork_trampoline() at fork_trampoline+0xe/frame 0xfffffe00fda17f30
--- trap 0, rip = 0, rsp = 0, rbp = 0 ---

Fatal trap 12: page fault while in kernel mode
cpuid = 12; apic id = 10
fault virtual address    = 0x1c
fault code        = supervisor read data, page not present
instruction pointer    = 0x20:0xffffffff80f246e2
stack pointer            = 0x28:0xfffffe00fda17ae0
frame pointer            = 0x28:0xfffffe00fda17b70
code segment        = base 0x0, limit 0xfffff, type 0x1b
            = DPL 0, pres 1, long 1, def32 0, gran 1
processor eflags    = interrupt enabled, resume, IOPL = 0
current process        = 2 (clock (12))
rdi: 0000000000000000 rsi: 0000000000000000 rdx: fffffe00fda17cf8
rcx: 0000000000000000  r8: 0000000000000598  r9: 0000000000000000
rax: 0000000000000000 rbx: 0000000000000000 rbp: fffffe00fda17b70
r10: 0000000000001fe7 r11: 000000000000df51 r12: 0000000000000000
r13: 0000000000000598 r14: fffff805dfc21a80 r15: 0000000000000034
trap number        = 12
panic: page fault
cpuid = 12
time = 1721814976
KDB: enter: panic
Actions #6

Updated by Marcos M 6 months ago

  • Related to Bug #15457: HAproxy disable zero copy forwarding added
Actions #7

Updated by Steve Wheeler 5 months ago

  • Status changed from Confirmed to Duplicate

This appears to be unrelated to HAProxy directly. Simply that it's more likely to occur when HAProxy is running. See: https://redmine.pfsense.org/issues/15684

Actions

Also available in: Atom PDF