<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic MDS root partition nearly full stopping mgmt HA sync in R80.10 in Firewall and Security Management</title>
    <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/MDS-root-partition-nearly-full-stopping-mgmt-HA-sync-in-R80-10/m-p/54510#M85761</link>
    <description>&lt;P&gt;Hi, been a long time since I have posted here, too busy &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;just stumbled across interesting thing with R80.10 take 142 MDS - we have a HA solution and couple of days ago sync suddenly stopped working with the yellow warning in the SmartConsole&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="mgmt_ha_sync_error.png" style="width: 454px;"&gt;&lt;img src="https://community.checkpoint.com/t5/image/serverpage/image-id/1376iA78C25FFEB9FCEF3/image-size/large?v=v2&amp;amp;px=999" role="button" title="mgmt_ha_sync_error.png" alt="mgmt_ha_sync_error.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;When I tried to sync it manually FWM process died on primary MDS. Analyses showed that root partition reached 100% during the sync&lt;/P&gt;
&lt;P&gt;I did a manual check and saw a lot of diskspace used in &lt;STRONG&gt;$MDSDIR/tmp/mgha&lt;/STRONG&gt;, so I cleaned it up manually and after reboot MDS was functioning again.&lt;/P&gt;
&lt;P&gt;At this point we had 10GB free in 100GB root partition. Another attempt to sync MDS resulted in the same - partition was filled up with huge files in&amp;nbsp;&lt;STRONG&gt;$MDSDIR/tmp/mgha.&amp;nbsp;&lt;/STRONG&gt;So obviously sync required more than 10GB but there was nothing too obvious to clean up.&lt;/P&gt;
&lt;P&gt;Went into our lab and noticed that the same MDS in lab environment had 40GB free of 100GB. Which felt strange as lab is 100% replica of the production. So i had two options -. try to build a new VM and make root partition bigger or try to salvage existing VM that MDS run on with the same 100GB root partition.&lt;/P&gt;
&lt;P&gt;Since I had similar disk usage on the secondary MDS, I thought to try to take full backup and restore on the same VM to see if it does any difference. And voila! After backup restore root partition usage went down from 90% to 60%! That would mean that MDS would store a lot of temp data in all CMA directories that backup restore seems to clean up.&lt;/P&gt;
&lt;P&gt;Did the same then on primary MDS (take backup and then restore it on the same VM) and we were back in business - root partition usage reduced to 61%. Here's disk usage before and after restore:&lt;/P&gt;
&lt;P style="padding-left: 30px;"&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="2"&gt;[Expert@mds01:0]# df -h&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="2"&gt;Filesystem Size Used Avail Use% Mounted on&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="2"&gt;/dev/mapper/vg_splat-lv_current&amp;nbsp; &lt;/FONT&gt;&lt;FONT face="courier new,courier" size="2"&gt;&lt;STRONG&gt;&lt;FONT color="#FF0000"&gt;97G&amp;nbsp; 83G&amp;nbsp; 9.2G&amp;nbsp; 90% /&lt;/FONT&gt;&lt;/STRONG&gt;&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="2"&gt;/dev/sda1&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 289M 24M&amp;nbsp; 251M&amp;nbsp; 9%&amp;nbsp; &lt;/FONT&gt;&lt;FONT face="courier new,courier" size="2"&gt;/boot&lt;BR /&gt;&lt;/FONT&gt;&lt;FONT face="courier new,courier" size="2"&gt;tmpfs&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 63G&amp;nbsp; 4.0K 63G&amp;nbsp; &amp;nbsp;1%&amp;nbsp; /dev/shm&lt;BR /&gt;&lt;/FONT&gt;&lt;FONT face="courier new,courier" size="2"&gt;/dev/mapper/vg_splat-lv_log&amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;/FONT&gt;&lt;FONT face="courier new,courier" size="2"&gt;238G 91G&amp;nbsp; 135G&amp;nbsp; 41% /var/log&lt;/FONT&gt;&lt;/P&gt;
&lt;P style="padding-left: 30px;"&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="2"&gt;[Expert@mds01:0]# df -h&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="2"&gt;Filesystem Size Used Avail Use% Mounted on&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="2"&gt;/dev/mapper/vg_splat-lv_current&amp;nbsp;&amp;nbsp;&lt;/FONT&gt;&lt;FONT face="courier new,courier" size="2"&gt;&lt;STRONG&gt;&lt;FONT color="#FF0000"&gt;97G&amp;nbsp; 57G&amp;nbsp; 36G&amp;nbsp; &amp;nbsp;61% /&lt;/FONT&gt;&lt;/STRONG&gt;&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="2"&gt;/dev/sda1&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 289M 24M&amp;nbsp; 251M&amp;nbsp; 9% /boot&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="2"&gt;tmpfs&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 63G&amp;nbsp; 4.0K 63G&amp;nbsp; &amp;nbsp;1% /dev/shm&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="2"&gt;/dev/mapper/vg_splat-lv_log&amp;nbsp; &amp;nbsp; &amp;nbsp;&amp;nbsp;&lt;/FONT&gt;&lt;FONT face="courier new,courier" size="2"&gt;238G 109G 117G&amp;nbsp; 49% /var/log&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;After this HA sync worked like a clock and I measured that it consumed 18GB of temp disk space in the root partition during the process! That seems to match our backup size roughly&lt;/P&gt;
&lt;P&gt;Just wondering if anyone else has noticed anything like that? And a bit of warning if you run MDS HA - have a look at the root partition usage and make sure you have enough disk space to do full sync..&lt;/P&gt;
&lt;P&gt;&amp;nbsp;And those running R80.20 - I wonder if it is a bit more efficient regarding temp disk space during full HA sync?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Tue, 28 May 2019 08:30:14 GMT</pubDate>
    <dc:creator>Kaspars_Zibarts</dc:creator>
    <dc:date>2019-05-28T08:30:14Z</dc:date>
    <item>
      <title>MDS root partition nearly full stopping mgmt HA sync in R80.10</title>
      <link>https://community.checkpoint.com/t5/Firewall-and-Security-Management/MDS-root-partition-nearly-full-stopping-mgmt-HA-sync-in-R80-10/m-p/54510#M85761</link>
      <description>&lt;P&gt;Hi, been a long time since I have posted here, too busy &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;just stumbled across interesting thing with R80.10 take 142 MDS - we have a HA solution and couple of days ago sync suddenly stopped working with the yellow warning in the SmartConsole&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="mgmt_ha_sync_error.png" style="width: 454px;"&gt;&lt;img src="https://community.checkpoint.com/t5/image/serverpage/image-id/1376iA78C25FFEB9FCEF3/image-size/large?v=v2&amp;amp;px=999" role="button" title="mgmt_ha_sync_error.png" alt="mgmt_ha_sync_error.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;When I tried to sync it manually FWM process died on primary MDS. Analyses showed that root partition reached 100% during the sync&lt;/P&gt;
&lt;P&gt;I did a manual check and saw a lot of diskspace used in &lt;STRONG&gt;$MDSDIR/tmp/mgha&lt;/STRONG&gt;, so I cleaned it up manually and after reboot MDS was functioning again.&lt;/P&gt;
&lt;P&gt;At this point we had 10GB free in 100GB root partition. Another attempt to sync MDS resulted in the same - partition was filled up with huge files in&amp;nbsp;&lt;STRONG&gt;$MDSDIR/tmp/mgha.&amp;nbsp;&lt;/STRONG&gt;So obviously sync required more than 10GB but there was nothing too obvious to clean up.&lt;/P&gt;
&lt;P&gt;Went into our lab and noticed that the same MDS in lab environment had 40GB free of 100GB. Which felt strange as lab is 100% replica of the production. So i had two options -. try to build a new VM and make root partition bigger or try to salvage existing VM that MDS run on with the same 100GB root partition.&lt;/P&gt;
&lt;P&gt;Since I had similar disk usage on the secondary MDS, I thought to try to take full backup and restore on the same VM to see if it does any difference. And voila! After backup restore root partition usage went down from 90% to 60%! That would mean that MDS would store a lot of temp data in all CMA directories that backup restore seems to clean up.&lt;/P&gt;
&lt;P&gt;Did the same then on primary MDS (take backup and then restore it on the same VM) and we were back in business - root partition usage reduced to 61%. Here's disk usage before and after restore:&lt;/P&gt;
&lt;P style="padding-left: 30px;"&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="2"&gt;[Expert@mds01:0]# df -h&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="2"&gt;Filesystem Size Used Avail Use% Mounted on&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="2"&gt;/dev/mapper/vg_splat-lv_current&amp;nbsp; &lt;/FONT&gt;&lt;FONT face="courier new,courier" size="2"&gt;&lt;STRONG&gt;&lt;FONT color="#FF0000"&gt;97G&amp;nbsp; 83G&amp;nbsp; 9.2G&amp;nbsp; 90% /&lt;/FONT&gt;&lt;/STRONG&gt;&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="2"&gt;/dev/sda1&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 289M 24M&amp;nbsp; 251M&amp;nbsp; 9%&amp;nbsp; &lt;/FONT&gt;&lt;FONT face="courier new,courier" size="2"&gt;/boot&lt;BR /&gt;&lt;/FONT&gt;&lt;FONT face="courier new,courier" size="2"&gt;tmpfs&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 63G&amp;nbsp; 4.0K 63G&amp;nbsp; &amp;nbsp;1%&amp;nbsp; /dev/shm&lt;BR /&gt;&lt;/FONT&gt;&lt;FONT face="courier new,courier" size="2"&gt;/dev/mapper/vg_splat-lv_log&amp;nbsp; &amp;nbsp; &amp;nbsp; &lt;/FONT&gt;&lt;FONT face="courier new,courier" size="2"&gt;238G 91G&amp;nbsp; 135G&amp;nbsp; 41% /var/log&lt;/FONT&gt;&lt;/P&gt;
&lt;P style="padding-left: 30px;"&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="2"&gt;[Expert@mds01:0]# df -h&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="2"&gt;Filesystem Size Used Avail Use% Mounted on&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="2"&gt;/dev/mapper/vg_splat-lv_current&amp;nbsp;&amp;nbsp;&lt;/FONT&gt;&lt;FONT face="courier new,courier" size="2"&gt;&lt;STRONG&gt;&lt;FONT color="#FF0000"&gt;97G&amp;nbsp; 57G&amp;nbsp; 36G&amp;nbsp; &amp;nbsp;61% /&lt;/FONT&gt;&lt;/STRONG&gt;&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="2"&gt;/dev/sda1&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 289M 24M&amp;nbsp; 251M&amp;nbsp; 9% /boot&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="2"&gt;tmpfs&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 63G&amp;nbsp; 4.0K 63G&amp;nbsp; &amp;nbsp;1% /dev/shm&lt;/FONT&gt;&lt;BR /&gt;&lt;FONT face="courier new,courier" size="2"&gt;/dev/mapper/vg_splat-lv_log&amp;nbsp; &amp;nbsp; &amp;nbsp;&amp;nbsp;&lt;/FONT&gt;&lt;FONT face="courier new,courier" size="2"&gt;238G 109G 117G&amp;nbsp; 49% /var/log&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;After this HA sync worked like a clock and I measured that it consumed 18GB of temp disk space in the root partition during the process! That seems to match our backup size roughly&lt;/P&gt;
&lt;P&gt;Just wondering if anyone else has noticed anything like that? And a bit of warning if you run MDS HA - have a look at the root partition usage and make sure you have enough disk space to do full sync..&lt;/P&gt;
&lt;P&gt;&amp;nbsp;And those running R80.20 - I wonder if it is a bit more efficient regarding temp disk space during full HA sync?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 28 May 2019 08:30:14 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/Firewall-and-Security-Management/MDS-root-partition-nearly-full-stopping-mgmt-HA-sync-in-R80-10/m-p/54510#M85761</guid>
      <dc:creator>Kaspars_Zibarts</dc:creator>
      <dc:date>2019-05-28T08:30:14Z</dc:date>
    </item>
  </channel>
</rss>

