gmirror и софтверный эксBAD
Добрый день.
Столкнулся с пренеприятной проблемой. В общем Фря крутится на софтверном зеркале gmirror на днях по причине наличия беда вывалился диск из массива (ad0 - винт WD 160 GB) Винт проверил MHDD - по адресу 47208092 обнаружен один бед стер диск командой MHDD erase - бед пропал, т.е. это был софтверный а не физический бед еще раз проверил винт MHDD - поверхность диска чистая, - все ОК вернул диск на место - массив не синхронизируется smartd сыпет на мыло сообщения: SMART error (OfflineUncorrectableSector) detected on host: Host This email was generated by the smartd daemon running on: host name: *** DNS domain: *** NIS domain: The following warning/error was logged by the smartd daemon: Device: /dev/ad0, 1 Offline uncorrectable sectors For details see host's SYSLOG (default: /var/log/messages). You can also use the smartctl utility for further investigation. No additional email messages about this problem will be sent. dd_rescue ошибок не находит dd_rescue: (info): about to transfer 0.0 kBytes from /dev/ad0 to /dev/null dd_rescue: (info): blocksizes: soft 65536, hard 512 dd_rescue: (info): starting positions: in 0.0k, out 0.0k dd_rescue: (info): Logfile: /var/log/error.log, Maxerr: 0 dd_rescue: (info): Reverse: no , Trunc: no , interactive: no dd_rescue: (info): abort on Write errs: no , spArse write: if err dd_rescue: (info): about to transfer 0.0 kBytes from /dev/ad0 to /dev/null dd_rescue: (info): about to transfer 0.0 kBytes from /dev/ad0 to /dev/null dd_rescue: (info): blocksizes: soft 65536, hard 512 dd_rescue: (info): blocksizes: soft 65536, hard 512 dd_rescue: (info): starting positions: in 0.0k, out 0.0k dd_rescue: (info): starting positions: in 0.0k, out 0.0k dd_rescue: (info): Logfile: /var/log/error.log, Maxerr: 0 dd_rescue: (info): Logfile: /var/log/error.log, Maxerr: 0 dd_rescue: (info): Reverse: no , Trunc: no , interactive: no dd_rescue: (info): Reverse: no , Trunc: no , interactive: no dd_rescue: (info): abort on Write errs: no , spArse write: if err dd_rescue: (info): abort on Write errs: no , spArse write: if err dd_rescue: (info): ipos: 156290880.0k, opos: 156290880.0k, xferd: 156290880.0k errs: 0, errxfer: 0.0k, succxfer: 156290880.0k +curr.rate: 55252kB/s, avg.rate: 76174kB/s, avg.load: 3.3% dd_rescue: (info): /dev/ad0 (156290904.0k): EOF Summary for /dev/ad0 -> /dev/null: dd_rescue: (info): ipos: 156290904.0k, opos: 156290904.0k, xferd: 156290904.0k errs: 0, errxfer: 0.0k, succxfer: 156290904.0k +curr.rate: 75472kB/s, avg.rate: 76174kB/s, avg.load: 3.3% # smartd -q onecheck smartd 5.39.1 2010-01-28 r3054 [FreeBSD 8.1-RELEASE-p1 i386] (local build) Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net Opened configuration file /usr/local/etc/smartd.conf Configuration file /usr/local/etc/smartd.conf parsed. Device: /dev/ad0, opened Device: /dev/ad0, found in smartd database. Device: /dev/ad0, is SMART capable. Adding to "monitor" list. Monitoring 2 ATA and 0 SCSI devices Device: /dev/ad0, opened ATA device Device: /dev/ad0, 1 Offline uncorrectable sectors Sending warning via mail to ...mymail@myaddress Warning via mail to mymail@myaddress: successful Device: /dev/ad0, initial Temperature is 30 Celsius (Min/Max ??/30) Started with '-q onecheck' option. All devices sucessfully checked once. smartd is exiting (exit status 0) smartctl -a /dev/ad0 ... After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 00 9c 56 d0 e2 Error: UNC at LBA = 0x02d0569c = 47208092 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- c8 00 01 9c 56 d0 02 00 09:40:45.784 READ DMA c8 00 01 9b 56 d0 02 00 09:40:45.784 READ DMA c8 00 01 9a 56 d0 02 00 09:40:45.783 READ DMA c8 00 01 99 56 d0 02 00 09:40:45.783 READ DMA c8 00 01 98 56 d0 02 00 09:40:45.783 READ DMA ... After command completion occurred, registers were: ER ST SC SN CL CH DH -- -- -- -- -- -- -- 40 51 00 9c 56 d0 e2 Error: UNC at LBA = 0x02d0569c = 47208092 Commands leading to the command that caused the error were: CR FR SC SN CL CH DH DC Powered_Up_Time Command/Feature_Name -- -- -- -- -- -- -- -- ---------------- -------------------- c8 00 80 3f 56 d0 02 00 09:40:43.449 READ DMA c8 00 80 bf 55 d0 02 00 09:40:43.448 READ DMA c8 00 80 3f 55 d0 02 00 09:40:43.447 READ DMA c8 00 80 bf 54 d0 02 00 09:40:43.447 READ DMA c8 00 80 3f 54 d0 02 00 09:40:43.446 READ DMA ситуация тупиковая - виртуальный бед, которого фактически нет, чтобы исправить(реалокейтом например), но он (какбы) есть и винт поэтому в зеркало не пускает Помогите, пожалуйста, подскажите, как решить эту проблему |
Время: 00:02. |
Время: 00:02.
© OSzone.net 2001-