Portal Home > Knowledgebase > Articles Database > Faulty IOstat?


Faulty IOstat?




Posted by whdev, 02-12-2011, 05:11 AM
We have a raid 10, 6 drives and here's the iostat: See sda3 and sda4 svctm and util. It always stays at that high range of level, never drops even when there's our web servers aren't running. Anyone guru knows what could possibly be wrong?

Posted by DeanoC, 02-12-2011, 08:38 AM
svctm and %utils are indicators to how long requests are taking, this indicates those two disk are taking a long time. Without more details i'd be looking at drive or controller or cabling errors. What does a dd test look like in iostat?

Posted by aeris, 02-12-2011, 01:00 PM
As long as sda shows a low load, you can probably ignore any high loads indicated for its partitions. I never use iostat on the partition level, as the numbers tend to be a bit off.

Posted by whdev, 02-13-2011, 09:38 AM
Raid 10, 6 drives (the one with faulty iostat): # sync ; dd if=/dev/zero of=test.file bs=8k count=100k 102400+0 records in 102400+0 records out 838860800 bytes (839 MB) copied, 3.11384 seconds, 269 MB/s # sync ; dd if=test.file of=/dev/null 1638400+0 records in 1638400+0 records out 838860800 bytes (839 MB) copied, 1.91765 seconds, 437 MB/s Raid 5, 4 drives (our other server): # sync ; dd if=/dev/zero of=test.file bs=8k count=100k 102400+0 records in 102400+0 records out 838860800 bytes (839 MB) copied, 2.06441 seconds, 406 MB/s # sync ; dd if=test.file of=/dev/null 1638400+0 records in 1638400+0 records out 838860800 bytes (839 MB) copied, 1.78293 seconds, 470 MB/s

Posted by DeanoC, 02-13-2011, 10:50 AM
Even more suggestive of some hardware issue, the write is slowed as for RAID10 it has to write to all drives, whereas the read is mostly okay as it only sometimes will be accessing the bad drives. I'd say you have a fair case for swapping the drives or cables.

Posted by DeanoC, 02-13-2011, 10:53 AM
If your OS has some fault management tools, see if you can get the read/write/drop out errors per drive. Should help pin down the exact problem

Posted by greecejoe, 02-13-2011, 01:52 PM
drives, cables or ports/controller problem. This is not a software problem.

Posted by UNIXy, 02-13-2011, 02:16 PM
Checkout this link w.r.t iostat's %util issues: http://yoshinorimatsunobu.blogspot.c...-on-linux.html In brief, it's not as reliable as it's thought to be. The %util numbers had me scratch my head in the past too. Regards Joe / UNIXY

Posted by DeanoC, 02-13-2011, 03:02 PM
Its not reliable as a speed metric, but it is fairly reliable as a problem sign. If one or two drives in a RAID set are showing different numbers than the rest of the set (like here) its a fair sign something is wrong, and here backed up by write speed being lower than expected.

Posted by ZenMonk, 02-14-2011, 02:18 AM
What about the results of hdparm -tT /dev/devicename ?



Was this answer helpful?

Add to Favourites Add to Favourites    Print this Article Print this Article

Also Read


Language: