Environment:
Management: R80.20 T47
Gateway clusters: R77.30 T216
Hardware: 13800
There are two 10g interfaces. The CPU tied to ingress interface spikes to 100% during the busy times. Typical bandwidth (from cpview) is around 2 Gbps, concurrent connections during peak times is about 500K. Over 80% of the processing is done by SXL. Fw ctl multik stat show equal distribution of load among all workers. MQ is currently not enabled.
The other egress interface showing the same amount of traffic, hardly see spikes there, typically stays 60% or under (during peak).
Total of 20 CPUs with 14/6 split. Workers never show high load, normally stays around 20% (when the ingress CPU is spiking to 100%), otherwise stays at 5%. This looks like a typical example where dispatcher is loaded but workers are low. One recommendation is to move 2 workers to SND and enable MQ. This make sense to me but I have another similar cluster that has much less load (BW around 1 gig, and 200K concurrent connections) but showing same symptoms.
Questions are: Why the CPU tied to the egress interface does not show high load?
What else I can check to make sure what is actually causing the load on CPU?