<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Skyline - Interface Utilization Alerts in OpenTelemetry/Skyline</title>
    <link>https://community.checkpoint.com/t5/OpenTelemetry-Skyline/Skyline-Interface-Utilization-Alerts/m-p/204503#M312</link>
    <description>&lt;P&gt;Hello,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;originally I used this dashboard from&amp;nbsp;&lt;a href="https://community.checkpoint.com/t5/user/viewprofilepage/user-id/11456"&gt;@Kaspars_Zibarts&lt;/a&gt;&amp;nbsp; and slightly modified it and fixed some queries (e.g. top CPU)&lt;/P&gt;
&lt;P&gt;&lt;A href="https://community.checkpoint.com/t5/Telemetry/Cluster-Dashboard-Skyline-alternative/td-p/164601" target="_blank"&gt;Cluster Dashboard - Skyline alternative - Check Point CheckMates&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Tue, 30 Jan 2024 08:46:13 GMT</pubDate>
    <dc:creator>Alexander_Wilke</dc:creator>
    <dc:date>2024-01-30T08:46:13Z</dc:date>
    <item>
      <title>Skyline - Interface Utilization Alerts</title>
      <link>https://community.checkpoint.com/t5/OpenTelemetry-Skyline/Skyline-Interface-Utilization-Alerts/m-p/204216#M305</link>
      <description>&lt;P&gt;Hi team,&amp;nbsp;I am trying to setup an alert for our Interface utilization in Skyline, one example would be for utilization &amp;gt;50%.. Below you can see how we did this on our previous Grafana instance (running with InfluxDB &amp;amp; Telegraf). First, we had to distinguish between 1G and 10G interfaces, then create two conditions for alerting to facilitate the different types of NICs – not pretty but if works for our almost 1800 interfaces.&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Dorian_0-1706236829812.png" style="width: 400px;"&gt;&lt;img src="https://community.checkpoint.com/t5/image/serverpage/image-id/24236i52035836A57CF256/image-size/medium?v=v2&amp;amp;px=400" role="button" title="Dorian_0-1706236829812.png" alt="Dorian_0-1706236829812.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;Alert&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Dorian_1-1706236829816.png" style="width: 400px;"&gt;&lt;img src="https://community.checkpoint.com/t5/image/serverpage/image-id/24234iD3CF1EB9A533495D/image-size/medium?v=v2&amp;amp;px=400" role="button" title="Dorian_1-1706236829816.png" alt="Dorian_1-1706236829816.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;In Skyline I am yet to find how to implement something similar. I found that we can distinguish between "speed" in the metrics browser so I feel like something similar should be possible but I'm missing how to put it all together. How do you alert on Interface utilization?&amp;nbsp;&lt;/P&gt;&lt;P&gt;Cheers Dorian&lt;/P&gt;</description>
      <pubDate>Fri, 26 Jan 2024 03:13:47 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/OpenTelemetry-Skyline/Skyline-Interface-Utilization-Alerts/m-p/204216#M305</guid>
      <dc:creator>Dorian</dc:creator>
      <dc:date>2024-01-26T03:13:47Z</dc:date>
    </item>
    <item>
      <title>Re: Skyline - Interface Utilization Alerts</title>
      <link>https://community.checkpoint.com/t5/OpenTelemetry-Skyline/Skyline-Interface-Utilization-Alerts/m-p/204266#M307</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;
&lt;P&gt;for this you have to find the proper PromQL query.&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;system_traffic_connections{speed} shows you the interface Speed.&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;you probably have to combine it or compare it with :&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;system_traffic_io_transmit&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;system_traffic_io_receive&lt;/SPAN&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;I personally do not monitor the Interface speed because in my case this is irrelevant. In general I am more interested in throughput:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;DIV&gt;
&lt;DIV&gt;&lt;SPAN&gt;ceil&lt;/SPAN&gt;&lt;SPAN&gt;((&lt;/SPAN&gt;&lt;SPAN&gt;avg_over_time&lt;/SPAN&gt;&lt;SPAN&gt;(&lt;/SPAN&gt;&lt;SPAN&gt;system_network_interface_io_transmit_rate&lt;/SPAN&gt;&lt;SPAN&gt;{&lt;/SPAN&gt;&lt;SPAN&gt;host_name&lt;/SPAN&gt;&lt;SPAN&gt;=~&lt;/SPAN&gt;&lt;SPAN&gt;".*"&lt;/SPAN&gt;&lt;SPAN&gt;, &lt;/SPAN&gt;&lt;SPAN&gt;interface&lt;/SPAN&gt;&lt;SPAN&gt;=&lt;/SPAN&gt;&lt;SPAN&gt;"TOTAL"&lt;/SPAN&gt;&lt;SPAN&gt;}[&lt;/SPAN&gt;&lt;SPAN&gt;15m&lt;/SPAN&gt;&lt;SPAN&gt;])&lt;/SPAN&gt; &lt;SPAN&gt;/&lt;/SPAN&gt; &lt;SPAN&gt;1024&lt;/SPAN&gt; &lt;SPAN&gt;/&lt;/SPAN&gt; &lt;SPAN&gt;1024&lt;/SPAN&gt; &lt;SPAN&gt;)&lt;/SPAN&gt; &lt;SPAN&gt;&amp;gt;=&lt;/SPAN&gt; &lt;SPAN&gt;8000&lt;/SPAN&gt;&lt;SPAN&gt;)&lt;/SPAN&gt; &lt;SPAN&gt;## &amp;gt;= XXX Firewall throughput bigger than X, convert from bits to Mbit and without any decimals over the last 15 minutes&lt;/SPAN&gt;&lt;/DIV&gt;
&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;
&lt;DIV&gt;&lt;SPAN&gt;I only use "transmit" because everything which is sent came in at some time.&lt;/SPAN&gt;&lt;/DIV&gt;
&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;
&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;
&lt;DIV&gt;&lt;SPAN&gt;count by (host_name)( ### counts and groups by host_name&lt;BR /&gt;sum by (cpu,host_name) (&lt;BR /&gt;100 - avg_over_time(system_cpu_utilization{host_name=~".*", state="idle"}[5m]) ## 100% - CPU Idle avg over 5 minutes = cpu usage over 5 minutes&lt;BR /&gt;)&amp;gt;= 70 ### shows all CPU cores and hostnames which load is higher tha 70 percent&lt;BR /&gt;) &amp;gt;= 3 ### only shows systems where cpu core usage is higher than 70% for at least 3 cpu cores at the same time&lt;/SPAN&gt;&lt;/DIV&gt;
&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;
&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;
&lt;DIV&gt;&lt;SPAN&gt;For promql questions I would suggest:&lt;BR /&gt;&lt;A href="https://groups.google.com/g/prometheus-users" target="_blank"&gt;Prometheus Users - Google Groups&lt;/A&gt;&lt;BR /&gt;&lt;/SPAN&gt;&lt;/DIV&gt;
&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;
&lt;DIV&gt;&lt;SPAN&gt;&lt;A href="https://promlabs.com/promql-cheat-sheet/" target="_blank"&gt;PromLabs | PromQL Cheat Sheet&lt;/A&gt;&lt;/SPAN&gt;&lt;/DIV&gt;
&lt;/DIV&gt;</description>
      <pubDate>Fri, 26 Jan 2024 14:37:54 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/OpenTelemetry-Skyline/Skyline-Interface-Utilization-Alerts/m-p/204266#M307</guid>
      <dc:creator>Alexander_Wilke</dc:creator>
      <dc:date>2024-01-26T14:37:54Z</dc:date>
    </item>
    <item>
      <title>Re: Skyline - Interface Utilization Alerts</title>
      <link>https://community.checkpoint.com/t5/OpenTelemetry-Skyline/Skyline-Interface-Utilization-Alerts/m-p/204465#M311</link>
      <description>&lt;P&gt;Thanks&amp;nbsp;&lt;a href="https://community.checkpoint.com/t5/user/viewprofilepage/user-id/6901"&gt;@Alexander_Wilke&lt;/a&gt;&amp;nbsp;- much appreciated.. haven't played around with &lt;SPAN&gt;PromQL&amp;nbsp;yet but suspected that might be the answer to my question..&lt;BR /&gt;Interface speed is irrelevant&amp;nbsp;for us too.. we used that in our old environment to distinguish&amp;nbsp;between alerts/thresholds for 1GB &amp;amp; 10G interfaces. I will try what you've described above for the interface utilization&amp;nbsp;- when you get a chance would you please mind sharing a screenshot of query &amp;amp; alert setup? Thanks again for your help&amp;nbsp;&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 29 Jan 2024 20:54:08 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/OpenTelemetry-Skyline/Skyline-Interface-Utilization-Alerts/m-p/204465#M311</guid>
      <dc:creator>Dorian</dc:creator>
      <dc:date>2024-01-29T20:54:08Z</dc:date>
    </item>
    <item>
      <title>Re: Skyline - Interface Utilization Alerts</title>
      <link>https://community.checkpoint.com/t5/OpenTelemetry-Skyline/Skyline-Interface-Utilization-Alerts/m-p/204503#M312</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;originally I used this dashboard from&amp;nbsp;&lt;a href="https://community.checkpoint.com/t5/user/viewprofilepage/user-id/11456"&gt;@Kaspars_Zibarts&lt;/a&gt;&amp;nbsp; and slightly modified it and fixed some queries (e.g. top CPU)&lt;/P&gt;
&lt;P&gt;&lt;A href="https://community.checkpoint.com/t5/Telemetry/Cluster-Dashboard-Skyline-alternative/td-p/164601" target="_blank"&gt;Cluster Dashboard - Skyline alternative - Check Point CheckMates&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 30 Jan 2024 08:46:13 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/OpenTelemetry-Skyline/Skyline-Interface-Utilization-Alerts/m-p/204503#M312</guid>
      <dc:creator>Alexander_Wilke</dc:creator>
      <dc:date>2024-01-30T08:46:13Z</dc:date>
    </item>
    <item>
      <title>Re: Skyline - Interface Utilization Alerts</title>
      <link>https://community.checkpoint.com/t5/OpenTelemetry-Skyline/Skyline-Interface-Utilization-Alerts/m-p/206242#M323</link>
      <description>&lt;P&gt;Good morning Alex &amp;amp; sorry for the delayed reply. Thank you very much for sending that through - vielen Dank fellow German Checkmate &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;BR /&gt;I finally managed to spend some time on this - I replicated your alerting query which works perfect however I might be missing something - using the interface "Total" and setting a threshold of 5000 will cater for a &amp;gt;5GB alert for total throughput of your firewall correct? I've set the wildcard for the interface instead to query all of our 1800 interfaces however setting a threshold of 500 or 5000 would only cater for 50% alerting for 1G or 10G interfaces - one or the other.&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;&lt;STRONG&gt;system_network_interface_state&lt;/STRONG&gt; allows me to select the label speed which I could use to query 1G or 10G interfaces - there must be a way to combine this with &lt;STRONG&gt;transmit&lt;/STRONG&gt;.&amp;nbsp;&lt;BR /&gt;&lt;BR /&gt;Pre skyline we had a Grafana panel with two queries - first one to identify throughput for all 1G interfaces &amp;amp; second one for the 10G interfaces - then two alerting conditions to cater for each query which caters for each Interface type/speed (ie. 500 for 1G &amp;amp; 5000 for 10G resulting in a 50% interface utilization alert) - hoping this explanation makes sense. Apologies if I'm missing something obvious here.&lt;BR /&gt;&lt;BR /&gt;Thanks again for your time mate.&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 15 Feb 2024 22:22:13 GMT</pubDate>
      <guid>https://community.checkpoint.com/t5/OpenTelemetry-Skyline/Skyline-Interface-Utilization-Alerts/m-p/206242#M323</guid>
      <dc:creator>Dorian</dc:creator>
      <dc:date>2024-02-15T22:22:13Z</dc:date>
    </item>
  </channel>
</rss>

