<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Performance issue on R80.30 Take 219 - SND cores heavily used in General Topics</title>
    <link>https://community.checkpoint.com/t5/General-Topics/Performance-issue-on-R80-30-Take-219-SND-cores-heavily-used/m-p/104079#M19992</link>
    <description>&lt;P&gt;R80.40 or R81 with Dynamic Workloads as well as the ability to use multiqueue on more than 5 interfaces might be helpful here.&lt;/P&gt;</description>
    <pubDate>Thu, 03 Dec 2020 02:28:08 GMT</pubDate>
    <dc:creator>PhoneBoy</dc:creator>
    <dc:date>2020-12-03T02:28:08Z</dc:date>
    <item>
      <title>Performance issue on R80.30 Take 219 - SND cores heavily used</title>
      <link>https://community.checkpoint.com/t5/General-Topics/Performance-issue-on-R80-30-Take-219-SND-cores-heavily-used/m-p/103829#M19948</link>
      <description>&lt;P&gt;Hello to all,&lt;/P&gt;&lt;P&gt;This is my first post to this community, it's a privilege though and I'm confident that the high level of technical skillset will be of great assistance.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I've recently updated our R80.30 cluster to Take 219 JHF, because we had a lot of private fixes that got included to this and to gain overall performance. The cluster consists of 2x 23900 in Active-Standby with multiple 10G interfaces and multiqueue enabled (plus some static affinity configured) and hyperthreading disabled.&lt;/P&gt;&lt;P&gt;So the above basically translated to 6 CPUs for SND and 30 for CoreXL. From 6 SND, 4 are multiqueue allocated and 2 are responsible for physical remaining interfaces:&lt;/P&gt;&lt;P&gt;[Expert@WALL1.2:0]# cpmq get&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Active ixgbe interfaces:&lt;/P&gt;&lt;P&gt;eth1-01 [On]&lt;/P&gt;&lt;P&gt;eth1-02 [Off]&lt;/P&gt;&lt;P&gt;eth4-01 [On]&lt;/P&gt;&lt;P&gt;eth4-02 [Off]&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Active igb interfaces:&lt;/P&gt;&lt;P&gt;Mgmt [Off]&lt;/P&gt;&lt;P&gt;Sync [Off]&lt;/P&gt;&lt;P&gt;eth2-01 [Off]&lt;/P&gt;&lt;P&gt;eth2-02 [Off]&lt;/P&gt;&lt;P&gt;eth2-03 [On]&lt;/P&gt;&lt;P&gt;eth2-04 [On]&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN class="uiOutputText"&gt;[&lt;/SPAN&gt;&lt;A href="mailto:Expert@WALL1.2" target="_blank" rel="noopener"&gt;Expert@WALL1.2&lt;/A&gt;&lt;SPAN class="uiOutputText"&gt;:0]# fw ctl affinity -l -r | grep -e CPU\ 4 -e CPU\ 5&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN class="uiOutputText"&gt;CPU 4: eth4-02 Sync eth2-01&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN class="uiOutputText"&gt;CPU 5: Mgmt eth1-02 eth2-02&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;We had also configured 4 RX queues for multi queue. Since yesterday morning though, we experienced heavy utilization on the CPU cores 4 &amp;amp; 5, which at the end impacted all traffic traversing these interfaces. After a lot of troubleshooting, including CP support, what we ended configuring was changing the interfaces participating in multi queue (since there is a limitation of only 5 that this can be enabled), reconfigured CoreXL from 30 to 28 and statically configuring affinity for eth1-01 and eth4-01 to the newly introduced SND cores:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;[Expert@WALL1.1:0]# sim affinity -l&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;eth4-01 : 6&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;eth2-01 : 4&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;eth1-01 : 7&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;eth2-02 : 5&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;Mgmt : 5&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;Sync : 4&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;Multi queue interfaces: eth1-02 eth4-02 eth2-03 eth2-04&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;[Expert@WALL1.1:0]# fw ctl affinity -l -r&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;CPU 0:&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;CPU 1:&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;CPU 2:&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;CPU 3:&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;CPU 4: Sync eth2-01&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;CPU 5: Mgmt eth2-02&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;CPU 6: eth4-01&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;CPU 7: eth1-01&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;[Expert@WALL1.1:0]# cpmq get &lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p2"&gt;&amp;nbsp;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;Active ixgbe interfaces:&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;eth1-01 [Off] &lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;eth1-02 [On] &lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;eth4-01 [Off] &lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;eth4-02 [On] &lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p2"&gt;&amp;nbsp;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;Active igb interfaces:&lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;Mgmt [Off] &lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;Sync [Off] &lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;eth2-01 [Off] &lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;eth2-02 [Off] &lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;eth2-03 [On] &lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;eth2-04 [On] &lt;/SPAN&gt;&lt;/P&gt;&lt;P class="p1"&gt;&amp;nbsp;&lt;/P&gt;&lt;P class="p1"&gt;&lt;SPAN class="s1"&gt;Now, even though performance is way way better, the SND cores are still used quite enough and we have not reached yet the day's peak, e.g.:&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;[Expert@WALL1.1:0]# cpstat -f multi_cpu os&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Processors load&lt;BR /&gt;---------------------------------------------------------------------------------&lt;BR /&gt;|CPU#|User Time(%)|System Time(%)|Idle Time(%)|Usage(%)|Run queue|Interrupts/sec|&lt;BR /&gt;---------------------------------------------------------------------------------&lt;BR /&gt;| 1| 0| 38| 62| 38| ?| 87408|&lt;BR /&gt;| 2| 0| 52| 48| 52| ?| 87409|&lt;BR /&gt;| 3| 0| 35| 65| 35| ?| 87409|&lt;BR /&gt;| 4| 0| 31| 69| 31| ?| 87410|&lt;BR /&gt;| 5| 0| 26| 74| 26| ?| 87411|&lt;BR /&gt;| 6| 0| 42| 58| 42| ?| 87411|&lt;BR /&gt;| 7| 0| 17| 83| 17| ?| 87412|&lt;BR /&gt;| 8| 0| 47| 53| 47| ?| 87413|&lt;/P&gt;&lt;P&gt;Also, SecureXL seems to do a decent job:&lt;/P&gt;&lt;P&gt;[Expert@WALL1.1:0]# fwaccel stats -s&lt;BR /&gt;Accelerated conns/Total conns : 5212/40836 (12%)&lt;BR /&gt;Accelerated pkts/Total pkts : 466598460/846769916 (55%)&lt;BR /&gt;F2Fed pkts/Total pkts : 87904870/846769916 (10%)&lt;BR /&gt;F2V pkts/Total pkts : 4047817/846769916 (0%)&lt;BR /&gt;CPASXL pkts/Total pkts : 0/846769916 (0%)&lt;BR /&gt;PSLXL pkts/Total pkts : 292266586/846769916 (34%)&lt;BR /&gt;QOS inbound pkts/Total pkts : 0/846769916 (0%)&lt;BR /&gt;QOS outbound pkts/Total pkts : 0/846769916 (0%)&lt;BR /&gt;Corrected pkts/Total pkts : 0/846769916 (0%)&lt;/P&gt;&lt;P&gt;[Expert@WALL1.1:0]# fwaccel stat&lt;BR /&gt;+-----------------------------------------------------------------------------+&lt;BR /&gt;|Id|Name |Status |Interfaces |Features |&lt;BR /&gt;+-----------------------------------------------------------------------------+&lt;BR /&gt;|0 |SND |enabled |Mgmt,Sync,eth2-01, |&lt;BR /&gt;| | | |eth2-02,eth2-03,eth2-04, |&lt;BR /&gt;| | | |eth4-01,eth4-02,eth1-01, |&lt;BR /&gt;| | | |eth1-02,pimreg0 |Acceleration,Cryptography |&lt;BR /&gt;| | | | |Crypto: Tunnel,UDPEncap,MD5, |&lt;BR /&gt;| | | | |SHA1,NULL,3DES,DES,CAST, |&lt;BR /&gt;| | | | |CAST-40,AES-128,AES-256,ESP, |&lt;BR /&gt;| | | | |LinkSelection,DynamicVPN, |&lt;BR /&gt;| | | | |NatTraversal,AES-XCBC,SHA256 |&lt;BR /&gt;+-----------------------------------------------------------------------------+&lt;/P&gt;&lt;P&gt;Accept Templates : enabled&lt;BR /&gt;Drop Templates : enabled&lt;BR /&gt;NAT Templates : enabled&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;So, after all the above, my question is: since this seems to be a connection handling case, how do I get what is causing all this CPU utilization? How can I understand what traffic could be causing this? During the high load, I've done a tcpdump &amp;amp; cpmonitor debug to try and identify traffic that is new (?) or could break acceleration, but to no avail. I've also added some fast_accel rules for backup traffic and CIFS (CIFS was already in place), but was not able to identify any performance gains. And yes, this gateway cluster is indeed responsible for VPN and i've already configured&amp;nbsp;cphwd_medium_path_qid_by_mspi=0, which really skyrocketed VPN performance.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thank anybody for just reading the above.&lt;/P&gt;</description>
      <pubDate>Tue, 01 Dec 2020 09:53:56 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/General-Topics/Performance-issue-on-R80-30-Take-219-SND-cores-heavily-used/m-p/103829#M19948</guid>
      <dc:creator>Nick_Vidiadakis</dc:creator>
      <dc:date>2020-12-01T09:53:56Z</dc:date>
    </item>
    <item>
      <title>Re: Performance issue on R80.10 Take 219 - SND cores heavily used</title>
      <link>https://community.checkpoint.com/t5/General-Topics/Performance-issue-on-R80-30-Take-219-SND-cores-heavily-used/m-p/103840#M19949</link>
      <description>&lt;P&gt;Just a general suggestion: What about an upgrade to R80.40 or at least R80.30 ?&lt;/P&gt;</description>
      <pubDate>Tue, 01 Dec 2020 09:46:33 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/General-Topics/Performance-issue-on-R80-30-Take-219-SND-cores-heavily-used/m-p/103840#M19949</guid>
      <dc:creator>G_W_Albrecht</dc:creator>
      <dc:date>2020-12-01T09:46:33Z</dc:date>
    </item>
    <item>
      <title>Re: Performance issue on R80.10 Take 219 - SND cores heavily used</title>
      <link>https://community.checkpoint.com/t5/General-Topics/Performance-issue-on-R80-30-Take-219-SND-cores-heavily-used/m-p/103842#M19950</link>
      <description>&lt;P&gt;&amp;nbsp;Very sorry, did a typo: the version is R80.30, not R80.10. I've corrected the title and the post.&lt;/P&gt;&lt;P&gt;Thanks!&lt;/P&gt;</description>
      <pubDate>Tue, 01 Dec 2020 09:54:47 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/General-Topics/Performance-issue-on-R80-30-Take-219-SND-cores-heavily-used/m-p/103842#M19950</guid>
      <dc:creator>Nick_Vidiadakis</dc:creator>
      <dc:date>2020-12-01T09:54:47Z</dc:date>
    </item>
    <item>
      <title>Re: Performance issue on R80.30 Take 219 - SND cores heavily used</title>
      <link>https://community.checkpoint.com/t5/General-Topics/Performance-issue-on-R80-30-Take-219-SND-cores-heavily-used/m-p/104079#M19992</link>
      <description>&lt;P&gt;R80.40 or R81 with Dynamic Workloads as well as the ability to use multiqueue on more than 5 interfaces might be helpful here.&lt;/P&gt;</description>
      <pubDate>Thu, 03 Dec 2020 02:28:08 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/General-Topics/Performance-issue-on-R80-30-Take-219-SND-cores-heavily-used/m-p/104079#M19992</guid>
      <dc:creator>PhoneBoy</dc:creator>
      <dc:date>2020-12-03T02:28:08Z</dc:date>
    </item>
    <item>
      <title>Re: Performance issue on R80.30 Take 219 - SND cores heavily used</title>
      <link>https://community.checkpoint.com/t5/General-Topics/Performance-issue-on-R80-30-Take-219-SND-cores-heavily-used/m-p/104197#M20025</link>
      <description>&lt;P&gt;Gotta agree with Phoneboy and Gunter here; instead of trying to juggle manual Multi-Queue settings to stay under the 5 interface limit in Gaia 2.6.18, upgrade to R80.40 which has Gaia 3.10 and Multi-Queue enabled by default on all interfaces except the defined management interface, with no 5 interface limit.&amp;nbsp; You could spend many hours trying to figure out what is happening here, but going down the road of manual affinities is generally not a good idea if you can avoid it.&amp;nbsp; Your time is better spent upgrading.&lt;/P&gt;</description>
      <pubDate>Thu, 03 Dec 2020 12:53:40 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/General-Topics/Performance-issue-on-R80-30-Take-219-SND-cores-heavily-used/m-p/104197#M20025</guid>
      <dc:creator>Timothy_Hall</dc:creator>
      <dc:date>2020-12-03T12:53:40Z</dc:date>
    </item>
  </channel>
</rss>

