I have a R80.10 cluster operating which I want to upgrade, however when we fail traffic onto the secondary node three business critical VPNs stop receiving traffic. The issue I am seeing does not appear on my R80.40 cluster, but I can't find any notes suggesting changes to the behaviour. I would like to understand what happens to site-to-site VPNs at a cluster failover.
Specifically, on the R80.40 cluster I noted the VPNs dropped and re-established when the cluster failed over. On the R80.10 cluster there were no such logs. Looking at the firewall it was like it just handed the traffic over from one to the other, but ultimately traffic was not making it to the destination host, and I was unable to determine if it was in face making it to the firewall, as I had a very limited outage window. All other traffic on all interfaces correctly failed over, including other (non-VPN) traffic on the same interface the VPN exits from worked perfectly. Only the VPN traffic was impacted.
When a cluster fails over should a VPN drop and re-establishment be expected?
If this does not happen should manually forcing them to drop through VPN TU option 9 (Delete all IPsec SAs for ALL peers and users) work?
Is there anything specific I can look at in the config to determine how the VPNs may behave at failover?
Are there any recommended commands for monitoring this during the failover?
My plan is to schedule another test / outage window but once again there will be strict limits on the time I have available for testing / roll-back so I need to be sure of everything I may need to in advance.
Thanks Matt