RAID 1 & Off-Line Uncorrectable Sector Count

How, what, where and why - when using the software.
jeffhare
Posts: 2
Joined: 2018.03.08. 21:29

RAID 1 & Off-Line Uncorrectable Sector Count

Post by jeffhare »

Hello,

I'm seeing the following error in a RAID 1 setup (my D drive has 2 physical drives) on my Server 2012R2 system and am looking for some guidance on the proper course of action. If it's just a bad sector and can be remapped easily, great, but I'm not sure of the right way to accomplish that. Any ideas? (hopefully I provided enough info to start with)
Disk: #2: WDC WD60EFRX-68L0BN1 - Off-Line Uncorrectable Sector Count

Health Information

Disk: #0: WD5003ABYX-88 LEN: 100 %
Disk: #1: WDC WD5003ABYZ-011FA0: 100 %
Disk: #2: WDC WD60EFRX-68L0BN1: 100 %
Disk: #3: WDC WD60EFRX-68MYMN1: 100 %

Partition Information

Disk: #0-1, Logical Drive C: Total Space: 427.3 GB, Free Space: 348.5 GB 82 %
Disk: #2-3, Logical Drive D: Total Space: 5309.5 GB, Free Space: 855.9 GB 16 %
* Short Self-Test passed
* Extended Self-Test has many hours left to run.
* Surface Test doesn't seem compatible with drives in a RAID 1 array, but that is my impression since it refers to surface scan on my "D:" array not just the individual disk.
* It's a 6TB array with only 16% free space at the moment and really don't want to damage the array, but I could pull the bad drive.
Failure Predicted - Attribute: 198 Off-Line Uncorrectable Sector Count, Count of uncorrectable errors when reading/writing. Indicate problem with the disk surface or the read/write heads.
Replace hard disk immediately.
SMART Info tab:
198, Off-Line Uncorrectable Sector Count, 116, 100, 253, Advisory: Usage limit exceeded, 000000000000, 0, Enabled
User avatar
hdsentinel
Site Admin
Posts: 3128
Joined: 2008.07.27. 17:00
Location: Hungary
Contact:

Re: RAID 1 & Off-Line Uncorrectable Sector Count

Post by hdsentinel »

Thanks for your message and the information.

According the details, I'm sure that the hard disk is perfect, there is no problem with it.
Ther is absolutely no bad sector present - otherwise the health % and the text description in Hard Disk Sentinel would surely mention that.

> Disk: #2: WDC WD60EFRX-68L0BN1: 100 %

This confirms that generally the hard disk perfect, there is no problem (bad sector or any other issue) found.

> Short Self-Test passed

Yes, as the drive is perfect, it is completely expected.


> Extended Self-Test has many hours left to run.

Yes, this is completely normal and expected. As described in the Help, the extended self test may run for much longer than estimated when the drive used during the test as all reads/writes delays the execution of the test.
Usually if the status is not perfect, not stable it stops relatively quickly so if the test runs for long time, it is good.


> Surface Test doesn't seem compatible with drives in a RAID 1 array, but that is
> my impression since it refers to surface scan on my "D:" array not just the individual disk.

Surface test is fully compatible with the RAID array of course, just as you wrote, yes, it tests the complete disk array.
This is not a bug/limitation of course, works completely as expected, as the RAID controller would not allow to perform surface testing on individual drives, just the combined array.

So by the hardware tests (short / extended test) yes, you can access individual drives in your case (this is not always true, some RAID controllers even do not allow this) and by the surface tests, you can diagnose/test the complete RAID array.
This way you can check how the drives work in general - and also how the completed, created array works (in terms of reliability, performance too).


> It's a 6TB array with only 16% free space at the moment and really don't want to damage the array, but I could pull the bad drive.

No need for that of course.


> Failure Predicted - Attribute: 198 Off-Line Uncorrectable Sector Count,
> SMART Info tab:
> 198, Off-Line Uncorrectable Sector Count, 116, 100, 253, Advisory: Usage limit exceeded, 000000000000, 0, Enabled

Yes I see.
Generally there is no such error (as the number is 000000000000) so the drive surely has no problems.

You may got a false alarm: which may happen when the disk controller (more precisely its driver) has a bug and may sometimes incorrectly provide status information for Hard Disk Sentinel.
In this case, it seems the "Threshold" field is incorrectly reported by the controller - which causes a false assumption that the error level threshold exceeded (which happens when the Value is lower than the Threshold) - even if the actual error counter is zero.

This is not rare when a problematic older Intel chipset RAID controller driver used. On the Information page you can verify the Disk Controller (which manages the RAID array) and its driver. If I'm correct, an Intel driver before version 11.6 used and while some of them are good, there are many driver version which have problems (like this one).
If this is true, installing the proper driver from the Support -> Driver Zone page
( https://www.hdsentinel.com/driverzone.php )
or diretly from https://www.hdsentinel.com/driver/stor_ ... 030_pv.zip )
will likely help.

Generally Hard Disk Sentinel has filters against such/similar invalid driver responses, but sometimes the drivers may still provide false information.
Please use Report menu -> Send test report to developer option, as then it is possible to check the actual situation, verify the driver version and response and improve the situation. Plus (especially if the controller/driver is different) from the report I can check and advise about the best to be used to improve the situation and prevent false information to be displayed.
jeffhare
Posts: 2
Joined: 2018.03.08. 21:29

Re: RAID 1 & Off-Line Uncorrectable Sector Count

Post by jeffhare »

Thank you very much for the clear reply.

As of today the drive no longer reports any anomaly, so as you suggested, it was likely a disk controller glitch. Sorry for jumping the gun. When I read the S.M.A.R.T. info and it didn't really jive with the error report, I just assumed I wasn't interpreting the S.M.A.R.T. data correctly.

I have to say I'm so much happier over the past year now having the HDSentinel software running on my server then in the past!

Thanks Again!
-Jeff
Post Reply