I am just surprised there isn't a process to recover in case of a hardware failure so that you can get the central license back into management as detached so that you can push it to the replacement firewall quickly. I guess that would take a TAC case or something if the rare chance it happens. I know of 1 appliance with another company that failed about a year ago so I know this is rare but can happen. That was one with regular license generated for the IP of the firewall though and not a central license.
My thought process on this is to make sure that I have a valid license on all gateways at all times so that I can flip back to the old hardware very quickly in case of problems. I also of course need to make sure the check out / check in of the central license is done correctly. I also don't want to pull the license from the old gateway hardware and it go to a single core and need to deal with the core count changes, etc. Making use of a trial license sounds like that is the process to help with this. I had an issue a few years ago where the core count (fw workers) on R80.40 got locked to 1 even with a valid 8 core license on it. After a TAC case it was discovered that a file wasn't being changed correctly that restricted it. I am sure that is fixed by now, but I would rather not have the old hardware go to 1 core when I detach the central license from it if I can.
OldP = Old Primary
NewP = New Primary
OldS = Old Secondary
NewS = New Secondary
Goals with this procedure...
Minimize downtime and steps in case of rollback to old hardware.
Make sure license changes don't cause core count change (having a condition where removing the central license resulting in no license on old hardware)... which seems to require a reboot when that changes.
A valid license on both old and new gateways at all times in case of rollback needed.
Deal with the management and it's view of the central license (gateway licenses generated for management IP) needs to be in sync with the check in and check out to the gateways.
1. Verify that OldP is active in HA, MVC is enabled on NewS, and that same Gaia/OS settings are on Old* vs New*.
2.. Add the 2 trial licenses to management so that management is aware of them for the future.
3. Manually attach the 2 trial licenses to NewP and NewS on the console of the new gateways (which are off the network at this point) and reboot to make sure corexl fw worker core counts are correct. These are not central licenses so can manually be attached on the gateway itself.
4. Use SmartUpdate to attach the trial licenses to existing OldP and OldS. At this point OldP and OldS now have 2 licenses (existing central and new trial) so that one can be removed later on and there will still be a license at all times on the gateways in case of rollback needed.
5. Detach central license from OldS through Smart Update. (Trial license is still left on the OldS).
6. Physically disconnect network cables from OldS and Connecting to NewS.
7. Establish SIC to NewS
8. Attach central license to NewS using Smart Update from management that was previously removed from OldS. There are now 2 licenses on NewS (trial and central). Verify that gateway is seeing a valid license.
9. Change cluster version in management for gateway cluster to new version (R81.20).
10. Push policy (with checkbox unchecked during policy that allows only 1 firewall to succeed because of major Checkpoint version change)
11. Verifying that HA state is READY on NewP.
12. cphaprob stop on OldP to force secondary to go active
13. Test thoroughly NewS and verify logging, etc.
14. Detach central license from OldP. (The trial license is still left on OldP).
15. Physically disconnect network cables from OldP and connect to NewP.
16. Establish SIC to NewP
17. Attach central license to NewP using Smart Update from management that was previously removed from OldP. There are now 2 licenses on NewP (trial and central).
18. Push policy (checkbox back to being checked to require both firewalls to get policy)
19. Verify that state is active on NewS and backup on NewP
20. clusterxl_admin down on NewS to force NewP to go active, verify state and then clusterxl_admin up to allow secondary to be able to take over in case of problem.
21. test thoroughly NewP and verify logging.
22. Make sure to disable MVC on gateways.
Any critiques of that? I have always attached new licenses before removing old ones and haven't experienced any issues so I assume 2 licenses (if both are valid) will be ok for the short time during this to make sure there is at least 1 valid license at all times on the firewalls (even the old hardware).