- CheckMates
- :
- Products
- :
- CloudMates Products
- :
- Cloud Network Security
- :
- Discussion
- :
- AWS standby gateway increasing memora usage till c...
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Mute
- Printer Friendly Page
Are you a member of CheckMates?
×- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
AWS standby gateway increasing memora usage till can't be accessed via SSH or AWS console.
Hello CheckMates.
I have a cluster deployed in AWS cloud and noticed that within some period of time the standby (Member B) member presents error in Smartconsole and it's not possibel to access via SSH nor via AWS console.
In AWS we can check the status check and it shows 2/3 checks passed, if we just try to reboot via AWS it does not work and we must stop the instance and start it again.
It takes around 24h tops to present the issue again.
I used the following command to check the memory usage.
In the file I have noticed a lot of “python3 /opt/CPsuite-R81.20/fw1/scripts/cloudwatch.py” which one using 5.3MB. The last time i run the command to get the output it jumped from 403 occurences to 465 in approximatly 45 minutes.
Have you ever see this kind of behavior?
Note:
R81.20 on JHF take 98
Funny fact, before apply the JHF the member which has the issue was Member A and after the JHF the issue seems to be "migrated" to Member B.
I can't forget to quote that there is an ongoing SR.
Regards
P.S.: Attached has a print screen of the AWS check and the memory use when alread have lost the connectivity.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
I assume the TAC case is related to this issue?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Yes, the case is realted to this issue.
Already uploaded the CPINFO after reboot, and a second one after few hours with memory already in high use. Also sent the HCP output and the files created after followed sk35496.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Take 99 addresses some memory issues but best to consult TAC to properly diagnose / validate this is your issue.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Sorry for the delay to post an update.
After a long TAC case the solution arrived, the issue was the cloudwatch.py script which has infinite runs causing the memory use increase till the FW get "stuck".
Checkpoint sent a new cloudwatch.py script to replace the original one.
The steps:
1. Stop the Cloudwatch cron calls using:
/sbin/cloudwatch stop
2. Reboot the member.
3.Backup the old cloudwatch.py using:
mv $FWDIR/scripts/cloudwatch.py{,.bck}
4.Copy the new cloudwatch.py file to $FWDIR/scripts
5.Start Cloudwatch again using:
/sbin/cloudwatch start
Voilà, Problem solved!!! 😀
