Trouble with Backplane

Post here if you encounter any problems or bugs with the software.
notserpmh
Posts: 3
Joined: 2013.03.29. 23:20

Trouble with Backplane

Post by notserpmh »

We love HDSentinel and have used it one several servers before.

We are in the process of building a new server and are running into issues. We have a IBM ServeRAID M1015, flashed with the LSI SAS2008 IT (passthrough, no raid) firmware, rev 15 (the most current). This is being plugged into a Chenbro SAS backplane in this chassis (backplane manual is here: http://usa.chenbro.com/corporatesite/se ... &mk=41&sk=)

We have hooked up to it 12 x 3 TB Seagate Barracuda 7200.14 SATA drives. When we run hdsentinel we get no temperatures or scores, however, if I run smartctl, I get normal SMART output. Below is a sample of

smartctl /dev/sdh -a

run on a specific drive (/dev/sdh)

Is there anything I can do to tweak hdsentinel to get it to see the SMART info, or any info I can supply to help get it updated to support this and similar configurations?

Thanks!

Here is the smartctl output:

Code: Select all

jan hdsentinel # smartctl /dev/sdh -a
smartctl 5.41 2011-06-09 r3365 [x86_64-linux-3.5.0-23-generic] (local build)
Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net

=== START OF INFORMATION SECTION ===
Model Family:     Seagate Barracuda 7200.14 (AF)
Device Model:     ST3000DM001-1CH166
Serial Number:    W1F2H5LP
LU WWN Device Id: 5 000c50 05ffbb02b
Firmware Version: CC26
User Capacity:    3,000,592,982,016 bytes [3.00 TB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   8
ATA Standard is:  ATA-8-ACS revision 4
Local Time is:    Fri Mar 29 16:44:16 2013 CDT

==> WARNING: A firmware update for this drive may be available,
see the following Seagate web pages:
http://knowledge.seagate.com/articles/en_US/FAQ/207931en
http://knowledge.seagate.com/articles/en_US/FAQ/223651en

SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x82)	Offline data collection activity
					was completed without error.
					Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0)	The previous self-test routine completed
					without error or no self-test has ever 
					been run.
Total time to complete Offline 
data collection: 		(   97) seconds.
Offline data collection
capabilities: 			 (0x7b) SMART execute Offline immediate.
					Auto Offline data collection on/off support.
					Suspend Offline collection upon new
					command.
					Offline surface scan supported.
					Self-test supported.
					Conveyance Self-test supported.
					Selective Self-test supported.
SMART capabilities:            (0x0003)	Saves SMART data before entering
					power-saving mode.
					Supports SMART auto save timer.
Error logging capability:        (0x01)	Error logging supported.
					General Purpose Logging supported.
Short self-test routine 
recommended polling time: 	 (   1) minutes.
Extended self-test routine
recommended polling time: 	 ( 255) minutes.
Conveyance self-test routine
recommended polling time: 	 (   2) minutes.
SCT capabilities: 	       (0x3085)	SCT Status supported.

SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000f   100   100   006    Pre-fail  Always       -       135256
  3 Spin_Up_Time            0x0003   094   094   000    Pre-fail  Always       -       0
  4 Start_Stop_Count        0x0032   100   100   020    Old_age   Always       -       15
  5 Reallocated_Sector_Ct   0x0033   100   100   010    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x000f   100   253   030    Pre-fail  Always       -       128849021673
  9 Power_On_Hours          0x0032   100   100   000    Old_age   Always       -       4
 10 Spin_Retry_Count        0x0013   100   100   097    Pre-fail  Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   020    Old_age   Always       -       15
183 Runtime_Bad_Block       0x0032   100   100   000    Old_age   Always       -       0
184 End-to-End_Error        0x0032   100   100   099    Old_age   Always       -       0
187 Reported_Uncorrect      0x0032   100   100   000    Old_age   Always       -       0
188 Command_Timeout         0x0032   100   100   000    Old_age   Always       -       0 0 0
189 High_Fly_Writes         0x003a   100   100   000    Old_age   Always       -       0
190 Airflow_Temperature_Cel 0x0022   070   070   045    Old_age   Always       -       30 (Min/Max 28/30)
191 G-Sense_Error_Rate      0x0032   100   100   000    Old_age   Always       -       0
192 Power-Off_Retract_Count 0x0032   100   100   000    Old_age   Always       -       15
193 Load_Cycle_Count        0x0032   100   100   000    Old_age   Always       -       17
194 Temperature_Celsius     0x0022   030   040   000    Old_age   Always       -       30 (0 19 0 0)
197 Current_Pending_Sector  0x0012   100   100   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0010   100   100   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x003e   200   200   000    Old_age   Always       -       0
240 Head_Flying_Hours       0x0000   100   253   000    Old_age   Offline      -       4h+38m+13.706s
241 Total_LBAs_Written      0x0000   100   253   000    Old_age   Offline      -       19137
242 Total_LBAs_Read         0x0000   100   253   000    Old_age   Offline      -       38159

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline       Completed without error       00%         4         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
Here is the hdsentinel -dump output for that same drive if it helps:

Code: Select all

 -- Physical Disk Information - Disk: #7: ATA     ST3000DM001-1CH1 --

   Hard Disk Summary
   -----------------
    Hard Disk Number. . . . . . . . . . . . . . . . .  7
    Hard Disk Device. . . . . . . . . . . . . . . . .  /dev/sdh
    Interface . . . . . . . . . . . . . . . . . . . .  SCSI
    Hard Disk Model ID. . . . . . . . . . . . . . . .  ATA     ST3000DM001-1CH1
    Hard Disk Revision. . . . . . . . . . . . . . . .  CC26
    Hard Disk Serial Number . . . . . . . . . . . . .  W1F2H5LP
    Hard Disk Total Size. . . . . . . . . . . . . . .  2861588 MB
    Current Temperature . . . . . . . . . . . . . . .  ?
    Maximum Temperature (during Entire Lifespan). . .  ?
    Health. . . . . . . . . . . . . . . . . . . . . .  ? (Unknown)
    Performance . . . . . . . . . . . . . . . . . . .  ? (Unknown)

   Properties
   ----------
    Vendor Information. . . . . . . . . . . . . . . .  ?
    Status. . . . . . . . . . . . . . . . . . . . . .  OK
    Version . . . . . . . . . . . . . . . . . . . . .  6
    Device Type . . . . . . . . . . . . . . . . . . .  Disk
    ASC . . . . . . . . . . . . . . . . . . . . . . .  0
    ASCQ. . . . . . . . . . . . . . . . . . . . . . .  0

   SCSI Information
   ----------------
    Removable . . . . . . . . . . . . . . . . . . . .  Not supported
    Failure Prediction. . . . . . . . . . . . . . . .  Not supported
    Failure Prediction. . . . . . . . . . . . . . . .  Disabled
User avatar
hdsentinel
Site Admin
Posts: 3010
Joined: 2008.07.27. 17:00
Location: Hungary
Contact:

Re: Trouble with Backplane

Post by hdsentinel »

Thanks for your message and the information.

Yes, I can confirm that this controller and configuration is currently fully supported only by the Windows version of Hard Disk Sentinel, so the Windows version of Hard Disk Sentinel (even the trial / unregistered) would be able to show all hard disk status immediately.

The Linux version of Hard Disk Sentinel can be updated slowly because of our limited resources. I can make sure it is planned to add support for such controller and configuration into the Linux version as well, just at the moment I can't promise when a such updated version will be available.
notserpmh
Posts: 3
Joined: 2013.03.29. 23:20

Re: Trouble with Backplane

Post by notserpmh »

Great, thank you.

We only use Linux in our office. I know the Linux version is normally free, but for the value we get out of it, we would be glad to pay something to help get this feature in a new version. Would it be possible to "sponsor" getting this feature in a new version and get it released more quickly? If so, what would a ballpark amount be?

Thanks!
User avatar
hdsentinel
Site Admin
Posts: 3010
Joined: 2008.07.27. 17:00
Location: Hungary
Contact:

Re: Trouble with Backplane

Post by hdsentinel »

Thanks for your kind words and offer !

If you prefer to help development, you may transfer some amount to info@hdsentinel.com by Paypal - or maybe order some Windows licenses which you may "park" for unlimited time, they never expire (as they can be used on any systems later, even in your company or on personal systems).

I'd leave the amount on you - as I do not want to ask too much. I can promise that this situation will be investigated with high priority and we'll contact with possible updated version(s) available for testing as soon as possible.

Alternatively, if you can write some thoughts, experiences you have with Hard Disk Sentinel (for example how it helps your company) and we can share on Testimonials page ( http://www.hdsentinel.com/testimonials.php ), that would also help.

Thanks so much for your kindness and patience!
notserpmh
Posts: 3
Joined: 2013.03.29. 23:20

Re: Trouble with Backplane

Post by notserpmh »

Okay, great.

I bought 12 licenses for $210 USD. Hopefully this will help out some. Here is a testimonial for your page if you want to use it:

Provided By Century 21 Mike Bowman, Inc (http://www.c21bowman.com)

Hard Disk Sentinel is an invaluable tool for our company. We use the Linux version extensively on our servers to monitor our hard drives. We have the output connected to our Nagios monitor and with this setup, we have, on several occasions been able to replace a disk in a RAID array before a failure. The allows us to avoid any downtime or outages with our services.

I highly recommend Hard Disk Sentinel for everyone. Disk failures are common and being aware of the health of your disk allows you to plan and avoid costly downtime. It has proved its value in our organization time and time again. It goes way beyond standard S.M.A.R.T monitoring and give you a much better idea of overall drive health. I wouldn't run a system without it.
User avatar
hdsentinel
Site Admin
Posts: 3010
Joined: 2008.07.27. 17:00
Location: Hungary
Contact:

Re: Trouble with Backplane

Post by hdsentinel »

Thank you very much for your kindness and support by both the licenses and the testimonial :)

I can make sure that I'll let you know any progress and send updated versions for testing as soon as possible !
Post Reply