<?xml version="1.0" encoding="utf-8"?>
<feed xmlns="http://www.w3.org/2005/Atom">
  <title>Tumbleweed Rants</title>
  <subtitle>Stefano's World</subtitle>
  <link rel="alternate" type="text/html" href="http://tumbleweed.org.za/2007/02/16/horrific-performance-with-3ware-raid"/>
  <link rel="self" type="application/atom+xml" href="http://tumbleweed.org.za/node/58/atom/feed"/>
  <id>http://tumbleweed.org.za/node/58/atom/feed</id>
  <updated>2008-01-02T19:55:10+00:00</updated>
  <entry>
    <title>Horrific performance with 3ware RAID</title>
    <link rel="alternate" type="text/html" href="http://tumbleweed.org.za/2007/02/16/horrific-performance-with-3ware-raid" />
    <id>http://tumbleweed.org.za/2007/02/16/horrific-performance-with-3ware-raid</id>
    <published>2007-02-16T08:01:54+00:00</published>
    <updated>2008-01-02T19:55:10+00:00</updated>
    <author>
      <name>tumbleweed</name>
    </author>
    <category term="3ware" />
    <category term="hardware" />
    <category term="linux" />
    <category term="raid" />
    <category term="software" />
    <category term="uk2" />
    <summary type="html"><![CDATA[<p>I&#8217;ve been enjoying our server at <a href="http://uk2.net/"><span class="caps"><span class="caps">UK2</span></span>.net</a>. It&#8217;s a pretty speedy machine (although a little light on <span class="caps"><span class="caps">RAM</span></span> - I suspect that they don&#8217;t want people running Xen), and it&#8217;s connected to a fat pipe.
But I&#8217;ve been experiencing a lot of&nbsp;bad&nbsp;lockups.</p>

<p>I traced the problem to <a href="http://www.postfix.org/postmap.1.html">postmap</a>ing the <a href="http://www.uceprotect.net/en/index.php">uceprotect.net</a> <span class="caps"><span class="caps">RBL</span></span> file. They recommend that you rsync this file from them, and then postmap it into a fast lookup database for postfix, rather than using their <span class="caps"><span class="caps">DNSRBL</span></span> service. But running the postmap was taking my box 40 mins. The same operation, on a loaded, lower-spec, 2 year old server took 2 mins (yes this server also has <span class="caps"><span class="caps">RAID1</span></span> on the volume concerned). On my <span class="caps"><span class="caps">UK2</span></span> box, while the postmap was running, the machine became totally unresponsive, and it could take a minute or two to log in, serve a web page, or even execute a basic command like <span class="geshifilter"><code class="geshifilter-text">ps</code></span>.</p>

<p>Clearly something wasn&#8217;t right. And it was something in the <span class="caps"><span class="caps">IO</span></span> system. The only answer is the 3ware <span class="caps"><span class="caps">RAID</span></span> controller. (It&#8217;s a 8006-2, doing <span class="caps"><span class="caps">RAID</span></span>-1) I know these controllers have a big buffer, so I looked up the 3ware website, for <a href="http://www.3ware.com/KB/article.aspx?id=11050">tuning guidance</a>. I followed it to the letter, and things didn&#8217;t really improve. I tried the deadline scheduler, and tweaking the buffers, but it only got&nbsp;marginally&nbsp;better.</p>

<p>Personally, I&#8217;ve always used software <span class="caps"><span class="caps">RAID</span></span>, even for <span class="caps"><span class="caps">RAID</span></span>-5, and I&#8217;ve never had bad performance like that. And having the <span class="caps"><span class="caps">RAID</span></span> in a portable format has really helped with recovery in the past. I understand that Windows monkeys have to use hardware <span class="caps"><span class="caps">RAID</span></span> (because their software <span class="caps"><span class="caps">RAID</span></span> sucks so much), but is this kind of&nbsp;performance&nbsp;normal?</p>

<p>I&#8217;ve asked <span class="caps"><span class="caps">UK2</span></span> to chuck my controller and give me software <span class="caps"><span class="caps">RAID</span></span>&nbsp;:-)</p>

<h2>Update</h2>

<p>I&#8217;ve now got software <span class="caps"><span class="caps">RAID</span></span> 1, and postmap runs in 25 seconds. That&#8217;s what I call a 60x speed&nbsp;improvement&nbsp;:-)</p>

<p>Oh, and the system is totally responsive while the&nbsp;postmap&nbsp;runs.</p>
    ]]></summary>
    <content type="html"><![CDATA[<p>I&#8217;ve been enjoying our server at <a href="http://uk2.net/"><span class="caps">UK2</span>.net</a>. It&#8217;s a pretty speedy machine (although a little light on <span class="caps">RAM</span> - I suspect that they don&#8217;t want people running Xen), and it&#8217;s connected to a fat pipe.
But I&#8217;ve been experiencing a lot of bad&nbsp;lockups.</p>

<p>I traced the problem to <a href="http://www.postfix.org/postmap.1.html">postmap</a>ing the <a href="http://www.uceprotect.net/en/index.php">uceprotect.net</a> <span class="caps">RBL</span> file. They recommend that you rsync this file from them, and then postmap it into a fast lookup database for postfix, rather than using their <span class="caps">DNSRBL</span> service. But running the postmap was taking my box 40 mins. The same operation, on a loaded, lower-spec, 2 year old server took 2 mins (yes this server also has <span class="caps">RAID1</span> on the volume concerned). On my <span class="caps">UK2</span> box, while the postmap was running, the machine became totally unresponsive, and it could take a minute or two to log in, serve a web page, or even execute a basic command like <code>ps</code>.</p>

<p>Clearly something wasn&#8217;t right. And it was something in the <span class="caps">IO</span> system. The only answer is the 3ware <span class="caps">RAID</span> controller. (It&#8217;s a 8006-2, doing <span class="caps">RAID</span>-1) I know these controllers have a big buffer, so I looked up the 3ware website, for <a href="http://www.3ware.com/KB/article.aspx?id=11050">tuning guidance</a>. I followed it to the letter, and things didn&#8217;t really improve. I tried the deadline scheduler, and tweaking the buffers, but it only got marginally&nbsp;better.</p>

<p>Personally, I&#8217;ve always used software <span class="caps">RAID</span>, even for <span class="caps">RAID</span>-5, and I&#8217;ve never had bad performance like that. And having the <span class="caps">RAID</span> in a portable format has really helped with recovery in the past. I understand that Windows monkeys have to use hardware <span class="caps">RAID</span> (because their software <span class="caps">RAID</span> sucks so much), but is this kind of performance&nbsp;normal?</p>

<p>I&#8217;ve asked <span class="caps">UK2</span> to chuck my controller and give me software <span class="caps">RAID</span>&nbsp;:-)</p>

<h2>Update</h2>

<p>I&#8217;ve now got software <span class="caps">RAID</span> 1, and postmap runs in 25 seconds. That&#8217;s what I call a 60x speed improvement&nbsp;:-)</p>

<p>Oh, and the system is totally responsive while the postmap&nbsp;runs.</p>
    ]]></content>
  </entry>
</feed>
