<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Carlos&#039; Corner &#187; NAgios</title>
	<atom:link href="http://cars.lostroncos.org/category/nagios/feed/" rel="self" type="application/rss+xml" />
	<link>http://cars.lostroncos.org</link>
	<description>The tired geek-dad in the corner</description>
	<lastBuildDate>Wed, 12 May 2010 19:46:13 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.5</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Nagios ESX3i Scripts and ESX4i</title>
		<link>http://cars.lostroncos.org/2009/06/29/nagios-esx3i-scripts-and-esx4i/</link>
		<comments>http://cars.lostroncos.org/2009/06/29/nagios-esx3i-scripts-and-esx4i/#comments</comments>
		<pubDate>Mon, 29 Jun 2009 22:57:09 +0000</pubDate>
		<dc:creator>cars</dc:creator>
				<category><![CDATA[NAgios]]></category>
		<category><![CDATA[VMware]]></category>
		<category><![CDATA[monitoring]]></category>
		<category><![CDATA[3i]]></category>
		<category><![CDATA[4i]]></category>
		<category><![CDATA[ESX]]></category>
		<category><![CDATA[esx3i]]></category>
		<category><![CDATA[esx4i]]></category>
		<category><![CDATA[esxi]]></category>

		<guid isPermaLink="false">http://cars.lostroncos.org/?p=193</guid>
		<description><![CDATA[<p>This is just an update, but the scripts I wrote for use with Nagios and ESX3i seem to work just fine with ESX4i (at least in my home lab environment using Dell Poweredge 2850s).</p>
]]></description>
			<content:encoded><![CDATA[<p>This is just an update, but the <a href="http://cars.lostroncos.org/2008/03/07/updated-esx-3i-scripts-for-nagios/">scripts I wrote for use with Nagios and ESX3i</a> seem to work just fine with ESX4i (at least in my home lab environment using Dell Poweredge 2850s).</p>
]]></content:encoded>
			<wfw:commentRss>http://cars.lostroncos.org/2009/06/29/nagios-esx3i-scripts-and-esx4i/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Updated ESX 3i scripts for Nagios</title>
		<link>http://cars.lostroncos.org/2008/03/07/updated-esx-3i-scripts-for-nagios/</link>
		<comments>http://cars.lostroncos.org/2008/03/07/updated-esx-3i-scripts-for-nagios/#comments</comments>
		<pubDate>Fri, 07 Mar 2008 08:23:55 +0000</pubDate>
		<dc:creator>cars</dc:creator>
				<category><![CDATA[NAgios]]></category>
		<category><![CDATA[hardware]]></category>
		<category><![CDATA[monitoring]]></category>
		<category><![CDATA[virtualization]]></category>

		<guid isPermaLink="false">http://cars.lostroncos.org/2008/03/07/updated-esx-3i-scripts-for-nagios/</guid>
		<description><![CDATA[<p>I had a little time today and needed to mentally switch gears so I took a look again at the 3i storage script for Nagios and made some changes to it to clean it up a little bit. I also made clones&#8230;er &#8230;versions rather&#8230;. for monitoring CPU status, memory status, and sensor data (i.e. fan [...]]]></description>
			<content:encoded><![CDATA[<p>I had a little time today and needed to mentally switch gears so I took a look again at the 3i storage script for Nagios and made some changes to it to clean it up a little bit. I also made clones&#8230;er &#8230;versions rather&#8230;. for monitoring CPU status, memory status, and sensor data (i.e. fan speeds, power supply status etc). The new version takes one less command line option and connects to the host managed object a little more directly.</p>
<p>As discussed earlier you could use IPMI just as easily to get the sensor data as the VMware APIs, but I added it just for consistency&#8217;s sake rather than having to manage two interfaces and sets of ids and passwords. Since the only servers I have access to at the moment are new Dell Poweredge&#8217;s right now I can only assume that these will also work with other manufacturers servers as they become supported.</p>
<p>As a reminder you need to install the appropriate VMware pieces for these scripts to run. Namely the VMware VI-Perl toolkit.</p>
<p>Script arguments are the same for all four of these scripts :</p>
<p><a href="http://cars.lostroncos.org/wp-content/uploads/2008/03/check_3i_storage.txt" target="_blank">check_3i_storage</a> –password &lt;passwd&gt; –username &lt;user&gt; –server &lt;ip addr/hostname&gt;</p>
<p><a href="http://cars.lostroncos.org/wp-content/uploads/2008/03/check_3i_cpu.txt" title="check_3i_cpu" target="_blank">check_3i_cpu</a> –password &lt;passwd&gt; –username &lt;user&gt; –server &lt;ip addr/hostname&gt;</p>
<p><a href="http://cars.lostroncos.org/wp-content/uploads/2008/03/check_3i_memory.txt" target="_blank">check_3i_memory</a> –password &lt;passwd&gt; –username &lt;user&gt; –server &lt;ip addr/hostname&gt;</p>
<p><a href="http://cars.lostroncos.org/wp-content/uploads/2008/03/check_3i_sensors.txt" target="_blank">check_3i_sensors</a> –password &lt;passwd&gt; –username &lt;user&gt; –server &lt;ip addr/hostname&gt;</p>
<p>I’ve also included the updated service and command definitions I’ve used in my test Nagios environment for reference.</p>
<pre>define service{
use                             generic-service
host_name                       svr-esx-test-01.company.com
service_description             ESX 3i Storage Status
check_command                   check_3i_storage!root!mypassword!
}</pre>
<hr />
<pre>define command{
command_name    check_3i_storage
command_line    $USER1$/check_3i_storage --server $HOSTADDRESS$ --username $ARG1$ --password $ARG2$
}</pre>
<pre></pre>
]]></content:encoded>
			<wfw:commentRss>http://cars.lostroncos.org/2008/03/07/updated-esx-3i-scripts-for-nagios/feed/</wfw:commentRss>
		<slash:comments>21</slash:comments>
		</item>
		<item>
		<title>Nagios Monitoring of ESX 3i on our PowerEdge</title>
		<link>http://cars.lostroncos.org/2008/02/26/nagios-monitoring-of-esx-3i-on-our-poweredge/</link>
		<comments>http://cars.lostroncos.org/2008/02/26/nagios-monitoring-of-esx-3i-on-our-poweredge/#comments</comments>
		<pubDate>Tue, 26 Feb 2008 20:06:33 +0000</pubDate>
		<dc:creator>cars</dc:creator>
				<category><![CDATA[NAgios]]></category>
		<category><![CDATA[hardware]]></category>
		<category><![CDATA[monitoring]]></category>

		<guid isPermaLink="false">http://cars.lostroncos.org/2008/02/26/nagios-monitoring-of-esx-3i-on-our-poweredge/</guid>
		<description><![CDATA[<p>In earlier posts when I mentioned the use of IPMI I don&#8217;t think I ever circled back around on where we stood with that. While we can get some information about the status of the hardware via IPMI I haven&#8217;t been able to figure out how to get at the array controller to get the [...]]]></description>
			<content:encoded><![CDATA[<p>In earlier posts when I mentioned the use of IPMI I don&#8217;t think I ever circled back around on where we stood with that. While we can get some information about the status of the hardware via IPMI I haven&#8217;t been able to figure out how to get at the array controller to get the status. I&#8217;ve looked at sites like Nagios Exchange and some others, but none of those had what I was looking for.  I found one script that came close but would only tell me when the RAID disk was actually rebuilding, not when it was degraded. If we were to use RAID5 it might have been useful, but since we&#8217;re talking about going forward with RAID 6 to minimize our window of vulnerability and we&#8217;re using  large disks. So that&#8217;s why I went back and looked at the VMware SDK and VI-Perl toolkit.</p>
<p>Now that I have a script that will work from the command line I needed to take it and make it work for Nagios since that&#8217;s what we&#8217;ll probably end up using to monitor these servers once they&#8217;re deployed. (We&#8217;ll look at Dell&#8217;s IT Assistant again since we&#8217;re a dot version or two behind but I&#8217;m not holding out high hopes.)</p>
<p><span id="more-51"></span></p>
<p>While the simple version of this script will tell us when a disk is missing (i.e. we pulled one out to test it) it won&#8217;t tell us which one is missing but looking at the VI client can provide a clue. If we look at the <a href="http://http://cars.lostroncos.org/wp-content/uploads/2008/02/022208-0803-checkingsto1.png" target="_blank">screenshot from the earlier post</a> we can see Disks 0-9, but that Disk 7 is missing from the list. That&#8217;s the one we pulled. If the disk were bad my assumption (yet to be tested/seen) is that the drive would show up with a status of RED.  While it&#8217;d be possible to figure this out in the script I would need to keep configuration data for each class of server (or even each individual server). I&#8217;m more interested in a general purpose script to alert us, and then leave it up to the responsible admin to figure out what&#8217;s going on.</p>
<p style="margin-left: 54pt"><img src="http://cars.lostroncos.org/wp-content/uploads/2008/02/022608-2006-nagiosmonit1.png" /><span style="font-family: Times New Roman; font-size: 12pt"><br />
</span></p>
<p>One of the nice things that does happen though is when the disk is put back in and the RAID set is rebuilding we do see the new disk that was plugged in because it also starts in a Yellow state until the volume is rebuilt.</p>
<p><img src="http://cars.lostroncos.org/wp-content/uploads/2008/02/022608-2006-nagiosmonit2.png" /></p>
<p><span style="color: #4f81bd; font-size: 9pt"><strong>Figure 1 &#8211; The Nagios View<br />
</strong></span></p>
<p><img src="http://cars.lostroncos.org/wp-content/uploads/2008/02/022608-2006-nagiosmonit3.png" /></p>
<p><span style="color: #4f81bd"><strong><span style="font-size: 9pt">Figure 2 &#8211; The view from the VI client</span><br />
</strong></span></p>
<p>Anyhow without further ado <a href="http://cars.lostroncos.org/wp-content/uploads/2008/02/check_3i_storage.txt">here&#8217;s the script</a>. Again it comes with the usual caveats: it&#8217;s rough, it mostly works, use at your own risk etc….</p>
<p>Script arguments are the same as before:</p>
<p>check_3i_storage –password &lt;passwd&gt; –username &lt;user&gt; –server &lt;ip addr/hostname&gt; –datacenter ha-datacenter</p>
<p>I&#8217;ve also included the preliminary service and command definitions I&#8217;ve used in my test environment for reference.</p>
<pre>define service{
use                             generic-service
host_name                       svr-esx-test-01.company.com
service_description             ESX 3i Storage Status
check_command                   check_3i_storage!root!mypassword!
}</pre>
<hr />
<pre>define command{
command_name    check_3i_storage
command_line    $USER1$/check_3i_storage --server $HOSTADDRESS$ --username $ARG1$ --password $ARG2$ --datacenter ha-datacenter
}</pre>
<p>My next steps would be to clean up the code and see if I can speed it up a little by starting with the host object directly rather than the &#8220;datacenter&#8221; managed object. It&#8217;d also be nice if it checked to make sure this was an ESX 3i box and not 3.5</p>
]]></content:encoded>
			<wfw:commentRss>http://cars.lostroncos.org/2008/02/26/nagios-monitoring-of-esx-3i-on-our-poweredge/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>
