Hi All,
We have been having some performance issues on our Production firewall, so decided to run some Load tests.
Important Info
- Running R80.10
- VSX
- 23800 appliances - clustered in a pair
- SecureXL enabled
- CoreXL enabled
- IPS, IA are blades enabled
- Running VSLS, VS's are split between the two appliances, 5 active on each, with other 5 in standby
- Hyper-Threading - disabled
- Of the 24 cores, 6 - SND, 18 - FWK
- Mix of 10GB and 1 GB interfaces, grouped in bonds.
Some more specific info on the Load environment we are running the performance tests on.
This has 10 instances on this VS, when running the load test we move all the other VS's to be active on the other Gateway in the pair, so this is the only VS active on this gateway. This is done for "just in case" reason to protect production traffic. This VS has a bond of a pair of 1GB interfaces and no virtual switch.
The issue we get is when we try to push more than 800MB through it, we start to see latency between the load generators and the servers we are sending the load to.
The CPU is around 550% over the 10 instances, with the higher ones being 60 - 70%. We have done quite a lot of work to improve this balance, by sending the information from several IPs via a Load Balancer and now it is fairly evenly spread.
If I turn off SecureXL at this point, the latency disappears and we are able to push the load up to 880MB.
If I then enable drop optimisation and turn SecureXL back on the highest we get is 750MB.
If I leave secureXL on and turn off IPS - not much change, back to 800MB, CPU drops to 450%
I then turn Threat Prevention off, CPU drops to 20% overall, and throughput is 950MB
TP off and IPS on, CPU goes upto 130% and throughput is around 900MB
(Sorry this is so long, we do have a call open with TAC)...
So my question is - do we have an issue with SecureXL, as things improve when it is off and how could I troubleshoot that?
We are also planning an upgrade to R80.20, to get the much needed dynamic dispatcher for VSX...
Thanks....