cluber
15-12-2012, 13:21
Добрый день.
Столкнулся с пренеприятной проблемой.
В общем Фря крутится на софтверном зеркале gmirror
на днях по причине наличия беда вывалился диск из массива (ad0 - винт WD 160 GB)
Винт проверил MHDD - по адресу 47208092 обнаружен один бед
стер диск командой MHDD erase - бед пропал, т.е. это был софтверный а не физический бед
еще раз проверил винт MHDD - поверхность диска чистая, - все ОК
вернул диск на место - массив не синхронизируется
smartd сыпет на мыло сообщения:
SMART error (OfflineUncorrectableSector) detected on host: Host
This email was generated by the smartd daemon running on:
host name: ***
DNS domain: ***
NIS domain:
The following warning/error was logged by the smartd daemon:
Device: /dev/ad0, 1 Offline uncorrectable sectors
For details see host's SYSLOG (default: /var/log/messages).
You can also use the smartctl utility for further investigation.
No additional email messages about this problem will be sent.
dd_rescue ошибок не находит
dd_rescue: (info): about to transfer 0.0 kBytes from /dev/ad0 to /dev/null
dd_rescue: (info): blocksizes: soft 65536, hard 512
dd_rescue: (info): starting positions: in 0.0k, out 0.0k
dd_rescue: (info): Logfile: /var/log/error.log, Maxerr: 0
dd_rescue: (info): Reverse: no , Trunc: no , interactive: no
dd_rescue: (info): abort on Write errs: no , spArse write: if err
dd_rescue: (info): about to transfer 0.0 kBytes from /dev/ad0 to /dev/null
dd_rescue: (info): about to transfer 0.0 kBytes from /dev/ad0 to /dev/null
dd_rescue: (info): blocksizes: soft 65536, hard 512
dd_rescue: (info): blocksizes: soft 65536, hard 512
dd_rescue: (info): starting positions: in 0.0k, out 0.0k
dd_rescue: (info): starting positions: in 0.0k, out 0.0k
dd_rescue: (info): Logfile: /var/log/error.log, Maxerr: 0
dd_rescue: (info): Logfile: /var/log/error.log, Maxerr: 0
dd_rescue: (info): Reverse: no , Trunc: no , interactive: no
dd_rescue: (info): Reverse: no , Trunc: no , interactive: no
dd_rescue: (info): abort on Write errs: no , spArse write: if err
dd_rescue: (info): abort on Write errs: no , spArse write: if err
dd_rescue: (info): ipos: 156290880.0k, opos: 156290880.0k, xferd: 156290880.0k
errs: 0, errxfer: 0.0k, succxfer: 156290880.0k
+curr.rate: 55252kB/s, avg.rate: 76174kB/s, avg.load: 3.3%
dd_rescue: (info): /dev/ad0 (156290904.0k): EOF
Summary for /dev/ad0 -> /dev/null:
dd_rescue: (info): ipos: 156290904.0k, opos: 156290904.0k, xferd: 156290904.0k
errs: 0, errxfer: 0.0k, succxfer: 156290904.0k
+curr.rate: 75472kB/s, avg.rate: 76174kB/s, avg.load: 3.3%
# smartd -q onecheck
smartd 5.39.1 2010-01-28 r3054 [FreeBSD 8.1-RELEASE-p1 i386] (local build)
Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net
Opened configuration file /usr/local/etc/smartd.conf
Configuration file /usr/local/etc/smartd.conf parsed.
Device: /dev/ad0, opened
Device: /dev/ad0, found in smartd database.
Device: /dev/ad0, is SMART capable. Adding to "monitor" list.
Monitoring 2 ATA and 0 SCSI devices
Device: /dev/ad0, opened ATA device
Device: /dev/ad0, 1 Offline uncorrectable sectors
Sending warning via mail to ...mymail@myaddress
Warning via mail to mymail@myaddress: successful
Device: /dev/ad0, initial Temperature is 30 Celsius (Min/Max ??/30)
Started with '-q onecheck' option. All devices sucessfully checked once.
smartd is exiting (exit status 0)
smartctl -a /dev/ad0
...
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
40 51 00 9c 56 d0 e2 Error: UNC at LBA = 0x02d0569c = 47208092
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 01 9c 56 d0 02 00 09:40:45.784 READ DMA
c8 00 01 9b 56 d0 02 00 09:40:45.784 READ DMA
c8 00 01 9a 56 d0 02 00 09:40:45.783 READ DMA
c8 00 01 99 56 d0 02 00 09:40:45.783 READ DMA
c8 00 01 98 56 d0 02 00 09:40:45.783 READ DMA
...
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
40 51 00 9c 56 d0 e2 Error: UNC at LBA = 0x02d0569c = 47208092
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 80 3f 56 d0 02 00 09:40:43.449 READ DMA
c8 00 80 bf 55 d0 02 00 09:40:43.448 READ DMA
c8 00 80 3f 55 d0 02 00 09:40:43.447 READ DMA
c8 00 80 bf 54 d0 02 00 09:40:43.447 READ DMA
c8 00 80 3f 54 d0 02 00 09:40:43.446 READ DMA
ситуация тупиковая - виртуальный бед, которого фактически нет, чтобы исправить(реалокейтом например), но он (какбы) есть и винт поэтому в зеркало не пускает
Помогите, пожалуйста, подскажите, как решить эту проблему
Столкнулся с пренеприятной проблемой.
В общем Фря крутится на софтверном зеркале gmirror
на днях по причине наличия беда вывалился диск из массива (ad0 - винт WD 160 GB)
Винт проверил MHDD - по адресу 47208092 обнаружен один бед
стер диск командой MHDD erase - бед пропал, т.е. это был софтверный а не физический бед
еще раз проверил винт MHDD - поверхность диска чистая, - все ОК
вернул диск на место - массив не синхронизируется
smartd сыпет на мыло сообщения:
SMART error (OfflineUncorrectableSector) detected on host: Host
This email was generated by the smartd daemon running on:
host name: ***
DNS domain: ***
NIS domain:
The following warning/error was logged by the smartd daemon:
Device: /dev/ad0, 1 Offline uncorrectable sectors
For details see host's SYSLOG (default: /var/log/messages).
You can also use the smartctl utility for further investigation.
No additional email messages about this problem will be sent.
dd_rescue ошибок не находит
dd_rescue: (info): about to transfer 0.0 kBytes from /dev/ad0 to /dev/null
dd_rescue: (info): blocksizes: soft 65536, hard 512
dd_rescue: (info): starting positions: in 0.0k, out 0.0k
dd_rescue: (info): Logfile: /var/log/error.log, Maxerr: 0
dd_rescue: (info): Reverse: no , Trunc: no , interactive: no
dd_rescue: (info): abort on Write errs: no , spArse write: if err
dd_rescue: (info): about to transfer 0.0 kBytes from /dev/ad0 to /dev/null
dd_rescue: (info): about to transfer 0.0 kBytes from /dev/ad0 to /dev/null
dd_rescue: (info): blocksizes: soft 65536, hard 512
dd_rescue: (info): blocksizes: soft 65536, hard 512
dd_rescue: (info): starting positions: in 0.0k, out 0.0k
dd_rescue: (info): starting positions: in 0.0k, out 0.0k
dd_rescue: (info): Logfile: /var/log/error.log, Maxerr: 0
dd_rescue: (info): Logfile: /var/log/error.log, Maxerr: 0
dd_rescue: (info): Reverse: no , Trunc: no , interactive: no
dd_rescue: (info): Reverse: no , Trunc: no , interactive: no
dd_rescue: (info): abort on Write errs: no , spArse write: if err
dd_rescue: (info): abort on Write errs: no , spArse write: if err
dd_rescue: (info): ipos: 156290880.0k, opos: 156290880.0k, xferd: 156290880.0k
errs: 0, errxfer: 0.0k, succxfer: 156290880.0k
+curr.rate: 55252kB/s, avg.rate: 76174kB/s, avg.load: 3.3%
dd_rescue: (info): /dev/ad0 (156290904.0k): EOF
Summary for /dev/ad0 -> /dev/null:
dd_rescue: (info): ipos: 156290904.0k, opos: 156290904.0k, xferd: 156290904.0k
errs: 0, errxfer: 0.0k, succxfer: 156290904.0k
+curr.rate: 75472kB/s, avg.rate: 76174kB/s, avg.load: 3.3%
# smartd -q onecheck
smartd 5.39.1 2010-01-28 r3054 [FreeBSD 8.1-RELEASE-p1 i386] (local build)
Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net
Opened configuration file /usr/local/etc/smartd.conf
Configuration file /usr/local/etc/smartd.conf parsed.
Device: /dev/ad0, opened
Device: /dev/ad0, found in smartd database.
Device: /dev/ad0, is SMART capable. Adding to "monitor" list.
Monitoring 2 ATA and 0 SCSI devices
Device: /dev/ad0, opened ATA device
Device: /dev/ad0, 1 Offline uncorrectable sectors
Sending warning via mail to ...mymail@myaddress
Warning via mail to mymail@myaddress: successful
Device: /dev/ad0, initial Temperature is 30 Celsius (Min/Max ??/30)
Started with '-q onecheck' option. All devices sucessfully checked once.
smartd is exiting (exit status 0)
smartctl -a /dev/ad0
...
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
40 51 00 9c 56 d0 e2 Error: UNC at LBA = 0x02d0569c = 47208092
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 01 9c 56 d0 02 00 09:40:45.784 READ DMA
c8 00 01 9b 56 d0 02 00 09:40:45.784 READ DMA
c8 00 01 9a 56 d0 02 00 09:40:45.783 READ DMA
c8 00 01 99 56 d0 02 00 09:40:45.783 READ DMA
c8 00 01 98 56 d0 02 00 09:40:45.783 READ DMA
...
After command completion occurred, registers were:
ER ST SC SN CL CH DH
-- -- -- -- -- -- --
40 51 00 9c 56 d0 e2 Error: UNC at LBA = 0x02d0569c = 47208092
Commands leading to the command that caused the error were:
CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name
-- -- -- -- -- -- -- -- ---------------- --------------------
c8 00 80 3f 56 d0 02 00 09:40:43.449 READ DMA
c8 00 80 bf 55 d0 02 00 09:40:43.448 READ DMA
c8 00 80 3f 55 d0 02 00 09:40:43.447 READ DMA
c8 00 80 bf 54 d0 02 00 09:40:43.447 READ DMA
c8 00 80 3f 54 d0 02 00 09:40:43.446 READ DMA
ситуация тупиковая - виртуальный бед, которого фактически нет, чтобы исправить(реалокейтом например), но он (какбы) есть и винт поэтому в зеркало не пускает
Помогите, пожалуйста, подскажите, как решить эту проблему