defekte Platte ???

Jan-Benedict Glaw jbglaw at lug-owl.de
Tue Jan 13 11:47:27 CET 2009


On Sat, 2009-01-10 12:28:24 +0100, Konstantin Nebel <konnebel at gmx.de> wrote:
> Hi,
> 
> ich habe ein 3ware 8506-4LP Raid Controller mit einem Raid-5 am Laufen. Ich 
> höre ab und an fehlergeräusche von einer der Festplatten. Meiner Meinung nach 
> sehe ich aber nichts in Smart, was auf einen Defekt hinweist. Hier die Auszüge 
> den 3 Pladden. Vielleicht seht ihr den Fehler.
> 
> smartctl version 5.38 [x86_64-pc-linux-gnu] Copyright (C) 2002-8 Bruce Allen
> Home page is http://smartmontools.sourceforge.net/
> 
> === START OF INFORMATION SECTION ===
> Model Family:     SAMSUNG SpinPoint P120 series
> Device Model:     SAMSUNG SP2504C
> Serial Number:    301012FP432889
> Firmware Version: VT100-59
> User Capacity:    250.059.350.016 bytes
> Device is:        In smartctl database [for details use: -P show]
> ATA Version is:   7
> ATA Standard is:  ATA/ATAPI-7 T13 1532D revision 4a
> Local Time is:    Sat Jan 10 12:15:35 2009 CET
> 
> ==> WARNING: May need -F samsung3 enabled; see manual for details.
> 
> SMART support is: Available - device has SMART capability.
> SMART support is: Enabled
> 
> === START OF READ SMART DATA SECTION ===
> SMART overall-health self-assessment test result: PASSED
> 
> General SMART Values:
> Offline data collection status:  (0x02)	Offline data collection activity
> 					was completed without error.
> 					Auto Offline Data Collection: Disabled.
> Self-test execution status:      (   0)	The previous self-test routine 
> completed
> 					without error or no self-test has ever 
> 					been run.
> Total time to complete Offline 
> data collection: 		 (5066) seconds.
> Offline data collection
> capabilities: 			 (0x5b) SMART execute Offline immediate.
> 					Auto Offline data collection on/off support.
> 					Suspend Offline collection upon new
> 					command.
> 					Offline surface scan supported.
> 					Self-test supported.
> 					No Conveyance Self-test supported.
> 					Selective Self-test supported.
> SMART capabilities:            (0x0003)	Saves SMART data before entering
> 					power-saving mode.
> 					Supports SMART auto save timer.
> Error logging capability:        (0x01)	Error logging supported.
> 					General Purpose Logging supported.
> Short self-test routine 
> recommended polling time: 	 (   1) minutes.
> Extended self-test routine
> recommended polling time: 	 (  84) minutes.
> SCT capabilities: 	       (0x003f)	SCT Status supported.
> 					SCT Feature Control supported.
> 					SCT Data Table supported.
> 
> SMART Attributes Data Structure revision number: 16
> Vendor Specific SMART Attributes with Thresholds:
> ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
>   1 Raw_Read_Error_Rate     0x000f   100   100   051    Pre-fail  Always       -       0
>   3 Spin_Up_Time            0x0007   253   253   025    Pre-fail  Always       -       5824
>   4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always       -       92
>   5 Reallocated_Sector_Ct   0x0033   253   253   010    Pre-fail  Always       -       0
>   7 Seek_Error_Rate         0x000f   253   253   051    Pre-fail  Always       -       0
>   8 Seek_Time_Performance   0x0025   253   253   015    Pre-fail  Offline      -       0
>   9 Power_On_Hours          0x0032   100   100   000    Old_age   Always       -       3032
>  10 Spin_Retry_Count        0x0033   253   253   051    Pre-fail  Always       -       0
>  11 Calibration_Retry_Count 0x0012   253   253   000    Old_age   Always       -       0
>  12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       83
> 187 Reported_Uncorrect      0x0032   253   253   000    Old_age   Always       -       196608
                                                                                         ^^^^^^
> 190 Airflow_Temperature_Cel 0x0022   109   094   000    Old_age   Always       -       43
> 194 Temperature_Celsius     0x0022   109   094   000    Old_age   Always       -       43
> 195 Hardware_ECC_Recovered  0x001a   100   100   000    Old_age   Always       -       25285952
> 196 Reallocated_Event_Count 0x0032   253   253   000    Old_age   Always       -       0
> 197 Current_Pending_Sector  0x0012   253   253   000    Old_age   Always       -       0
> 198 Offline_Uncorrectable   0x0030   253   253   000    Old_age   Offline      -       0
> 199 UDMA_CRC_Error_Count    0x003e   200   200   000    Old_age   Always       -       0
> 200 Multi_Zone_Error_Rate   0x000a   100   100   000    Old_age   Always       -       0
> 201 Soft_Read_Error_Rate    0x000a   100   100   000    Old_age   Always       -       0
> 202 TA_Increase_Count       0x0032   253   253   000    Old_age   Always       -       0
> 
> SMART Error Log Version: 1
> No Errors Logged
> 
> SMART Self-test log structure revision number 1
> Num  Test_Description    Status                  Remaining  LifeTime(hours)  
> LBA_of_first_error
> # 1  Extended offline    Completed without error       00%      2793         -
> # 2  Extended offline    Interrupted (host reset)      70%      2346         -
> # 3  Extended offline    Completed without error       00%      1417         -
> # 4  Extended offline    Completed without error       00%       643         -
> # 5  Extended offline    Completed without error       00%       360         -
> # 6  Extended offline    Completed without error       00%       245         -
> 
> SMART Selective Self-Test Log Data Structure Revision Number (0) should be 1
> SMART Selective self-test log data structure revision number 0
> Warning: ATA Specification requires selective self-test log data structure 
> revision number = 1
>  SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
>     1        0        0  Not_testing
>     2        0        0  Not_testing
>     3        0        0  Not_testing
>     4        0        0  Not_testing
>     5        0        0  Not_testing
> Selective self-test flags (0x0):
>   After scanning selected spans, do NOT read-scan remainder of disk.
> If Selective self-test is pending on power-up, resume after 0 minute delay.
> 
> ______________________________________
> 
> 
> smartctl version 5.38 [x86_64-pc-linux-gnu] Copyright (C) 2002-8 Bruce Allen
> Home page is http://smartmontools.sourceforge.net/
> 
> === START OF INFORMATION SECTION ===
> Model Family:     SAMSUNG SpinPoint P120 series
> Device Model:     SAMSUNG SP2504C
> Serial Number:    S09QJ1GL302662
> Firmware Version: VT100-33
> User Capacity:    250.059.350.016 bytes
> Device is:        In smartctl database [for details use: -P show]
> ATA Version is:   7
> ATA Standard is:  ATA/ATAPI-7 T13 1532D revision 4a
> Local Time is:    Sat Jan 10 12:15:51 2009 CET
> 
> ==> WARNING: May need -F samsung3 enabled; see manual for details.
> 
> SMART support is: Available - device has SMART capability.
> SMART support is: Enabled
> 
> === START OF READ SMART DATA SECTION ===
> SMART overall-health self-assessment test result: PASSED
> 
> General SMART Values:
> Offline data collection status:  (0x82)	Offline data collection activity
> 					was completed without error.
> 					Auto Offline Data Collection: Enabled.
> Self-test execution status:      (   0)	The previous self-test routine 
> completed
> 					without error or no self-test has ever 
> 					been run.
> Total time to complete Offline 
> data collection: 		 (4806) seconds.
> Offline data collection
> capabilities: 			 (0x5b) SMART execute Offline immediate.
> 					Auto Offline data collection on/off support.
> 					Suspend Offline collection upon new
> 					command.
> 					Offline surface scan supported.
> 					Self-test supported.
> 					No Conveyance Self-test supported.
> 					Selective Self-test supported.
> SMART capabilities:            (0x0003)	Saves SMART data before entering
> 					power-saving mode.
> 					Supports SMART auto save timer.
> Error logging capability:        (0x01)	Error logging supported.
> 					General Purpose Logging supported.
> Short self-test routine 
> recommended polling time: 	 (   1) minutes.
> Extended self-test routine
> recommended polling time: 	 (  80) minutes.
> 
> SMART Attributes Data Structure revision number: 16
> Vendor Specific SMART Attributes with Thresholds:
> ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
>   1 Raw_Read_Error_Rate     0x000f   100   100   051    Pre-fail  Always       -       10
>   3 Spin_Up_Time            0x0007   100   100   025    Pre-fail  Always       -       6016
>   4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always       -       689
>   5 Reallocated_Sector_Ct   0x0033   077   077   010    Pre-fail  Always       -       219
>   7 Seek_Error_Rate         0x000f   253   253   051    Pre-fail  Always       -       0
>   8 Seek_Time_Performance   0x0025   253   253   015    Pre-fail  Offline      -       0
>   9 Power_On_Hours          0x0032   100   100   000    Old_age   Always       -       17874
>  10 Spin_Retry_Count        0x0033   253   253   051    Pre-fail  Always       -       0
>  11 Calibration_Retry_Count 0x0012   253   002   000    Old_age   Always       -       0
>  12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       663
> 190 Airflow_Temperature_Cel 0x0022   115   082   000    Old_age   Always       -       41
> 194 Temperature_Celsius     0x0022   115   082   000    Old_age   Always       -       41
> 195 Hardware_ECC_Recovered  0x001a   100   100   000    Old_age   Always       -       17806828
> 196 Reallocated_Event_Count 0x0032   077   077   000    Old_age   Always       -       219
> 197 Current_Pending_Sector  0x0012   253   253   000    Old_age   Always       -       0
> 198 Offline_Uncorrectable   0x0030   253   253   000    Old_age   Offline      -       0
> 199 UDMA_CRC_Error_Count    0x003e   200   200   000    Old_age   Always       -       0
> 200 Multi_Zone_Error_Rate   0x000a   100   100   000    Old_age   Always       -       0
> 201 Soft_Read_Error_Rate    0x000a   100   100   000    Old_age   Always       -       0
> 202 TA_Increase_Count       0x0032   253   253   000    Old_age   Always       -       0
> 
> SMART Error Log Version: 1
> No Errors Logged
> 
> SMART Self-test log structure revision number 1
> Num  Test_Description    Status                  Remaining  LifeTime(hours)  
> LBA_of_first_error
> # 1  Extended offline    Completed without error       00%     17638         -
> # 2  Extended offline    Interrupted (host reset)      20%     17352         -
> # 3  Extended offline    Completed without error       00%     16277         -
> # 4  Extended offline    Completed without error       00%     15985         -
> # 5  Extended offline    Completed without error       00%     15502         -
> # 6  Extended offline    Completed without error       00%     15223         -
> # 7  Extended offline    Completed without error       00%     15108         -
> # 8  Extended offline    Completed without error       00%     14615         -
> # 9  Extended offline    Completed without error       00%     14469         -
> #10  Extended offline    Completed without error       00%     14202         -
> #11  Extended offline    Completed without error       00%     14171         -
> 
> SMART Selective Self-Test Log Data Structure Revision Number (0) should be 1
> SMART Selective self-test log data structure revision number 0
> Warning: ATA Specification requires selective self-test log data structure 
> revision number = 1
>  SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
>     1        0        0  Not_testing
>     2        0        0  Not_testing
>     3        0        0  Not_testing
>     4        0        0  Not_testing
>     5        0        0  Not_testing
> Selective self-test flags (0x0):
>   After scanning selected spans, do NOT read-scan remainder of disk.
> If Selective self-test is pending on power-up, resume after 0 minute delay.
> 
> ______________________________________
> 
> smartctl version 5.38 [x86_64-pc-linux-gnu] Copyright (C) 2002-8 Bruce Allen
> Home page is http://smartmontools.sourceforge.net/
> 
> === START OF INFORMATION SECTION ===
> Model Family:     SAMSUNG SpinPoint P120 series
> Device Model:     SAMSUNG SP2504C
> Serial Number:    S09QJ1GL302666
> Firmware Version: VT100-33
> User Capacity:    250.059.350.016 bytes
> Device is:        In smartctl database [for details use: -P show]
> ATA Version is:   7
> ATA Standard is:  ATA/ATAPI-7 T13 1532D revision 4a
> Local Time is:    Sat Jan 10 12:15:57 2009 CET
> 
> ==> WARNING: May need -F samsung3 enabled; see manual for details.
> 
> SMART support is: Available - device has SMART capability.
> SMART support is: Enabled
> 
> === START OF READ SMART DATA SECTION ===
> SMART overall-health self-assessment test result: PASSED
> 
> General SMART Values:
> Offline data collection status:  (0x82)	Offline data collection activity
> 					was completed without error.
> 					Auto Offline Data Collection: Enabled.
> Self-test execution status:      (   0)	The previous self-test routine 
> completed
> 					without error or no self-test has ever 
> 					been run.
> Total time to complete Offline 
> data collection: 		 (4836) seconds.
> Offline data collection
> capabilities: 			 (0x5b) SMART execute Offline immediate.
> 					Auto Offline data collection on/off support.
> 					Suspend Offline collection upon new
> 					command.
> 					Offline surface scan supported.
> 					Self-test supported.
> 					No Conveyance Self-test supported.
> 					Selective Self-test supported.
> SMART capabilities:            (0x0003)	Saves SMART data before entering
> 					power-saving mode.
> 					Supports SMART auto save timer.
> Error logging capability:        (0x01)	Error logging supported.
> 					General Purpose Logging supported.
> Short self-test routine 
> recommended polling time: 	 (   1) minutes.
> Extended self-test routine
> recommended polling time: 	 (  80) minutes.
> 
> SMART Attributes Data Structure revision number: 16
> Vendor Specific SMART Attributes with Thresholds:
> ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  
> WHEN_FAILED RAW_VALUE
>   1 Raw_Read_Error_Rate     0x000f   100   099   051    Pre-fail  Always       -       163
>   3 Spin_Up_Time            0x0007   100   100   025    Pre-fail  Always       -       5952
>   4 Start_Stop_Count        0x0032   100   100   000    Old_age   Always       -       692
>   5 Reallocated_Sector_Ct   0x0033   100   100   010    Pre-fail  Always       -       1
>   7 Seek_Error_Rate         0x000f   253   253   051    Pre-fail  Always       -       0
>   8 Seek_Time_Performance   0x0025   253   253   015    Pre-fail  Offline      -       0
>   9 Power_On_Hours          0x0032   100   100   000    Old_age   Always       -       17893
>  10 Spin_Retry_Count        0x0033   253   253   051    Pre-fail  Always       -       0
>  11 Calibration_Retry_Count 0x0012   100   002   000    Old_age   Always       -       33
                                                                                         ^^
>  12 Power_Cycle_Count       0x0032   100   100   000    Old_age   Always       -       666
> 190 Airflow_Temperature_Cel 0x0022   121   082   000    Old_age   Always       -       39
> 194 Temperature_Celsius     0x0022   121   082   000    Old_age   Always       -       39
> 195 Hardware_ECC_Recovered  0x001a   100   100   000    Old_age   Always       -       44236889
> 196 Reallocated_Event_Count 0x0032   100   100   000    Old_age   Always       -       1
> 197 Current_Pending_Sector  0x0012   253   100   000    Old_age   Always       -       0
> 198 Offline_Uncorrectable   0x0030   253   253   000    Old_age   Offline      -       0
> 199 UDMA_CRC_Error_Count    0x003e   200   200   000    Old_age   Always       -       0
> 200 Multi_Zone_Error_Rate   0x000a   100   100   000    Old_age   Always       -       0
> 201 Soft_Read_Error_Rate    0x000a   100   100   000    Old_age   Always       -       0
> 202 TA_Increase_Count       0x0032   253   253   000    Old_age   Always       -       0
> 
> SMART Error Log Version: 1
> No Errors Logged
> 
> SMART Self-test log structure revision number 1
> Num  Test_Description    Status                  Remaining  LifeTime(hours)  
> LBA_of_first_error
> # 1  Extended offline    Completed without error       00%     17657         -
> # 2  Extended offline    Interrupted (host reset)      40%     17361         -
> # 3  Extended offline    Completed without error       00%     16457         -
> # 4  Extended offline    Completed without error       00%     16288         -
> # 5  Extended offline    Completed without error       00%     15997         -
> # 6  Extended offline    Completed without error       00%     15514         -
> # 7  Extended offline    Completed without error       00%     15235         -
> # 8  Extended offline    Completed without error       00%     15120         -
> # 9  Extended offline    Completed without error       00%     14627         -
> #10  Extended offline    Completed without error       00%     14481         -
> #11  Extended offline    Completed without error       00%     14210         -
> #12  Extended offline    Completed without error       00%     14182         -
> 
> SMART Selective Self-Test Log Data Structure Revision Number (0) should be 1
> SMART Selective self-test log data structure revision number 0
> Warning: ATA Specification requires selective self-test log data structure 
> revision number = 1
>  SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
>     1        0        0  Not_testing
>     2        0        0  Not_testing
>     3        0        0  Not_testing
>     4        0        0  Not_testing
>     5        0        0  Not_testing
> Selective self-test flags (0x0):
>   After scanning selected spans, do NOT read-scan remainder of disk.
> If Selective self-test is pending on power-up, resume after 0 minute delay.
> 
> Hier noch der Auszug aus dem Raid-Controller:
> 
> Unit  UnitType  Status         %RCmpl  %V/I/M  Stripe  Size(GB)  Cache  AVrfy
> ------------------------------------------------------------------------------
> u0    RAID-5    OK             -       -       64K     465.77    ON     -
> 
> Port   Status           Unit   Size        Blocks        Serial
> ---------------------------------------------------------------
> p0     OK               u0     232.88 GB   488397168     301012FP432889
> p1     NOT-PRESENT      -      -           -             -
> p2     OK               u0     232.88 GB   488397168     S09QJ1GL302662
> p3     OK               u0     232.88 GB   488397168     S09QJ1GL302666


Ergo: Die erste Platte hat bei 200000 Zugriffen keine Daten liefern
können. Die dritte Platte scheint (selten) mal Probleme dabei zu
haben, die Spur zu finden.

MfG, JBG

-- 
      Jan-Benedict Glaw      jbglaw at lug-owl.de              +49-172-7608481
Signature of:             God put me on earth to accomplish a certain number of
the second  :            things. Right now I am so far behind I will never die.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 189 bytes
Desc: Digital signature
URL: <http://lug-owl.de/pipermail/linux/attachments/20090113/9579490d/attachment.sig>


More information about the Linux mailing list