[nSLUG] hard-drives

Dop Ganger nslug at fop.ns.ca
Tue Dec 13 21:31:24 AST 2005


On Tue, 13 Dec 2005, Gordon Jones wrote:

> I ran the maxtor tool on it and got a rma code saying it was defective so 
> have arranged to get new one ,pretty sure it was no good,i do not know what 
> you mean by checking the smart stats!

SMART is a system on pretty much all modern drives to do self monitoring 
for problems (see http://www.pcmech.com/show/harddrive/158/ for a 
reasonably good overview). The Maxtor tool analyses the SMART stats 
(as well as doing a drive scan and what not) to decide whether a drive is 
problematic or not.

A sample output for SMART checking looks like this:

SMART Attributes Data Structure revision number: 11
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
   1 Raw_Read_Error_Rate     0x0029   100   253   020    Pre-fail  Offline      -       0
   3 Spin_Up_Time            0x0027   081   080   020    Pre-fail  Always       -       2451
   4 Start_Stop_Count        0x0032   100   100   008    Old_age   Always       -       144
   5 Reallocated_Sector_Ct   0x0033   100   100   020    Pre-fail  Always       -       0
   7 Seek_Error_Rate         0x000b   100   093   023    Pre-fail  Always       -       0
   9 Power_On_Hours          0x0012   067   067   001    Old_age   Always       -       22221
  10 Spin_Retry_Count        0x0026   100   100   000    Old_age   Always       -       0
  11 Calibration_Retry_Count 0x0013   100   100   020    Pre-fail  Always       -       0
  12 Power_Cycle_Count       0x0032   100   100   008    Old_age   Always       -       115
  13 Read_Soft_Error_Rate    0x000b   100   100   023    Pre-fail  Always       -       0
194 Temperature_Celsius     0x0022   089   084   042    Old_age   Always       -       30
195 Hardware_ECC_Recovered  0x001a   100   099   000    Old_age   Always       -       727469
196 Reallocated_Event_Count 0x0010   100   253   020    Old_age   Offline      -       0
197 Current_Pending_Sector  0x0032   100   100   020    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0010   100   253   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x001a   001   001   000    Old_age   Always       -       743

>From this I can see that the drive is reasonably healthy for a drive 
that's around 2.5 years old, and is running at 30C (and thanks to graphing 
from rrdtool/MRTG, I know that the temperature is stable at 30C, so the 
drive should last quite a while). If anything were failing, the type would 
be set to "FAILING_NOW". If any of the values hit the threshold, the drive 
should be replaced.

Most Linux distributions have a smart tool. Under Debian it's provided by 
the smartmontools package.

Cheers... Dop.

!DSPAM:439f75f0182484147416842!




More information about the nSLUG mailing list