I have a strange VPN Issue with Cloudguard R82.
Enviroment:
Azure: Cloudguard R82 T60+time fix (also tried T91) single gateway, 2 cores.
HQ: 6000 Appliance with R81.20 + SmartCenter R82 T91 (includes time-fix)
Setup/Changes
VPN Community between Azure Cloudguard and HQ Gatway been running fine for years.
Yesterday we upgraded SmartCenter to R82 and deployed new Cloudguard FW on R82.
Both gateways managed by the same SmartCenter.
Issue:
After installing a R82 Cloudguard, establishing SIC+license and policy push
IPSec VPN would just not work. Cloudguard R82 FW was sending "port unreachable" messages back to HQ FW.
I did a cpstop;cpstart on Cloudguard FW, then VPN was established but only for some networks.
At HQ we have a list of /24 networks only. On Cloudguard, we have one /16 network in encdomain.
However tunnel was established for various /30,/32,/28,/29 networks.. (supernetting in reverse).
Changed to "One VPN tunnel per gateway" on Community which seemed to work fine.
Then after a few hours, vpn stopped working again.. SA`s were up but "vpn tu tlist" showed tunnel as down.
A bunch of "port unreachable again" from Azure FW. Tried a "vpn tu" to reset tunnel with no change..
Did another cpstop;cpstart and it came up.. worked for 7-8 hours and it was down again.. for 45 mins until it suddently worked.
(We do have some reports that in these 7-8 hours there were several periods of 1,5,10-30 minutes of packetloss aswell)
This is the output of "vpn tu tlist" when issue is present, looks the same on both sides..
Then after 1,5,10,30 minutes its connected again..
+-----------------------------------------+----------------------------------+---------------------+
| Peer: IP-IN-AZURE - FWAzure | MSA: 7fe6e4631258 | i: 0 ref: 15 |
| Methods: ESP Tunnel AES-GCM-256 | | i: 1 ref: 15 |
| My TS: 0.0.0.0/0 | | i: 2 ref: 19 |
| Peer TS: 0.0.0.0/0 | | |
| MSPI: 1000298 (i: 2, p: 0, d: 0) | No outbound SPI | |
| Tunnel created: | NAT-T | |
| Tunnel expiration: | Disconnected | |
+-----------------------------------------+----------------------------------+---------------------+
Already have a TAC-case and several remote sessions already. Currently waiting for it to occur again to gather even more debugs.Only change was R82 Management + R82 Cloudguard. HQ FW have several other tunnels working just fine.
Anyone else experienced something like this?
The suspect here is definately the R82 Cloudguard..
CCSM / CCSE / CCVS / CCTE