Bug #14891
open
High CPU usage when interface get down and up due to proces check_reload_status
Added by Thijs K about 1 year ago.
Updated 25 days ago.
Affected Architecture:
amd64
Description
Today I noticed that the cpu usage was high on my pfSense appliance (N5105, I226).
After looking in top I see that check_reload_status is fully taxing one core.
This process seems to be triggered when the wan interface comes down and up.
The process keeps running and taxing the CPU until it is manually stopped.
Files
I've seen this issue on two different 4100 units in the past 2 days (Nov 21 and 22).
It seems to prevent the WAN connections / routing from updating properly which causes the affected WAN gateway to remain offline even though pfSense can ping the gateway, and I can remotely login to the firewall using interface with the "offline" gateway. This also prevents the DNS resolver (unbound) from sending any DNS lookups (as confirmed via packet capture).
Disabling and re-enabling the affected interface seems to clear the issue, as does killing the check_reload_status process using command line "kill -s KILL <pid>".
This may be a recurring issue on one firewall that I didn't know what to look for until now.
It seems to occur when a WAN connection goes up and down several times in a short time period.
Both units are running 23.05.1
- Priority changed from High to Normal
If this can be reproduced on 24.11, do the following to get more info:
- SSH into the device
- Get the process ID, e.g. with
pgrep check_reload_staus
or top
- Run
procstat -[f|t|kk] <process ID> > <output file path>
, e.g. procstat -f 674 > /tmp/output_ps1.txt
, procstat -t 674 > /tmp/output_ps2.txt
, and procstat -kk 674 > /tmp/output_ps3.txt
- Run
truss -o <output file path> -fDp <process ID>
, e.g. truss -o /tmp/output_truss.txt -fDp 674
- Wait a moment; if it's still going, quit with
Ctrl
+C
Upload the output files here for review.
Also available in: Atom
PDF