Thank you for the reply.
Version is R80.40, Take 102 for management and gateways.
IPs are currently 'fake' as I'm doing tests in a lab which simulates the end result as much as possible.
ISP1 = 213.165.x.x
ISP2 = 195.158.x.x (this one is behind an interlink, so sometimes you will see 192.168.x.x)
Azure ISP: 13.13.x.x
Routing Config:
GW1 (on prem)
- ISP Redundancy in HA
- Primary: ISP 2 (195.158)
- Secondary: ISP 1 (213.165)
- Apply settings to VPN - DISABLED
- Static routes to GW4 - ISP 1:
- Via ISP 1 – preference 3
- Via ISP 2 – preference 6
- Static routes to GW4 - ISP 2:
- Via ISP 2 – preference 3
- Via ISP 1 – preference 6
GW4 (on prem)
- ISP Redundancy in HA
- Primary: ISP 2 (195.158)
- Secondary: ISP 1 (213.165)
- Apply settings to VPN - DISABLED
- Static routes to GW 1 - ISP 1:
- Via ISP 1 – preference 3
- Via ISP 2 – preference 6
- Static routes to GW1 - ISP 2:
- Via ISP 2 – preference 3
- Via ISP 1 – preference 6
CP-Cluster (Azure)
- Not actually a Cluster.
- 1 ISP (13.13.x.x)
- No special static routes
VPN config:
GW1:
- IP selection by remote peer: Use probing – HA – ISP 1 primary
- Outgoing route selection: Route based probing
- Source IP address setting: Automatic
GW4
- IP selection by remote peer: Use probing – HA – ISP 1 primary
- Outgoing route selection: Route based probing
- Source IP address setting: Automatic
CP-Cluster
- selection by remote peer: Selected from topology table: 13.13.x.x
- Outgoing route selection: OS routing table
- Setup: use outgoing traffic configuration
- Source IP address setting: Automatic
Results:
For GW1-to-GW4 and GW4-to-GW1, the tunnel is up and both ends are using ISP1.
Then for CPGW01-to-CPCluster, the next-hop IP is 192.168.x.x, ie ISP2 – cool this is what we want.
But for CPCluster-to-CPGW1, the Peer IP is ISP1. This is confirmed from TCPdumps (taken from GW1. GW1 is talking to 13.13.x.x via 192.168.x.x - ISP2, but CPCluster is talking back to 213.165.x.x - ISP1).
(I will try to post screenshots, but for some reason it's not working yet...)
So the VPN is coming up asymmetrically. This asymmetry is causing issues elsewhere.
We need to somehow inform CPCluster that the peer address should be 195.158.x.x, then fall back to 213.165.x.x if need be.
Should I still go through the different steps I took during testing?
Steven.