Beware of hddtemp and hardware RAID ... or something else.

This forum is dedicated to testing early releases and cauldron : Howtos, tips, tricks and user global feedback and thoughts...

Helpful tip :
For bugs tracking we use : https://bugs.mageia.org = The Mageia Bug Tracker
In this bug tracker you'll find already reported bugs and you'll be able to report those you have found....

Beware of hddtemp and hardware RAID ... or something else.

Postby ghmitch » Apr 1st, '13, 06:24

In the past I have always run hddtemp. So I just tonight installed it on my Cauldren Mageia 3 beta 4? system. All of a sudden I noticed I was taking errors on one of my 3ware RAID controllers.

Code: Select all

Mar 31 20:35:58 localhost.localdomain kernel: 3w-xxxx: scsi8: Unknown scsi opcode: 0x41
Mar 31 20:35:58 localhost.localdomain kernel: sd 8:0:0:0: [sdf] Unhandled error code
Mar 31 20:35:58 localhost.localdomain kernel: sd 8:0:0:0: [sdf] 
Mar 31 20:35:58 localhost.localdomain kernel: Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK
Mar 31 20:35:58 localhost.localdomain kernel: sd 8:0:0:0: [sdf] CDB:
Mar 31 20:35:58 localhost.localdomain kernel: Write Same(10): 41 00 11 a1 95 48 00 00 08 00
Mar 31 20:35:58 localhost.localdomain kernel: end_request: I/O error, dev sdf, sector 295802184
Mar 31 20:35:58 localhost.localdomain kernel: sdf1: WRITE SAME failed. Manually zeroing.




Eventually I discover that /etc/sysconfig/hddtemp includes the following by default:

Code: Select all

#
# hddtemp(8) daemon options.  Add at least the disk(s) you want to monitor here.
#
HDDTEMP_OPTIONS="-l 127.0.0.1 /dev/hd? /dev/sd?"



That would seem to force hddtemp to probe all drives including the 3ware pseudo drives. That is not a good idea. I understand that if you don't include this, the user will complain that hddtemp is not working. I don't know what the solution is. But in the meantime, be forewarned. If you are running hardware RAID, don't enable hddtemp without fixing this config file. AND installing hddtemp causes it to start running upon installation.

- George
Last edited by ghmitch on Apr 1st, '13, 06:59, edited 1 time in total.
ghmitch
 
Posts: 325
Joined: Mar 30th, '11, 03:05
Location: Eureka California USA

Re: Beware of hddtemp and hardware RAID !!!

Postby ghmitch » Apr 1st, '13, 06:34

Well, apparently hddtemp is not the problem. It was quiet for a while and then started again. It seems to be happening when I have mcc running. Could mcc be probing my drives in some way to cause this. I just unmounted it and ran a full fsck on it and it is clean. I have changed the 3-ware card, so I know it is not the hardware. It has got to be related to something I installed that was previously installed on my Mageia 2 system. Strange.
ghmitch
 
Posts: 325
Joined: Mar 30th, '11, 03:05
Location: Eureka California USA

Re: Beware of hddtemp and hardware RAID ... or something els

Postby doktor5000 » Apr 1st, '13, 10:58

Well, depending what you run within MCC, yes. Enable the logs in MCC via the menu, and also have a look in /var/log/explanations (IIRC)
Cauldron is not for the faint of heart!
Caution: Hot, bubbling magic inside. May explode or cook your kittens!
----
Disclaimer: Beware of allergic reactions in answer to unconstructive complaint-type posts
User avatar
doktor5000
 
Posts: 18052
Joined: Jun 4th, '11, 10:10
Location: Leipzig, Germany


Return to Testing : Alpha, Beta, RC and Cauldron

Who is online

Users browsing this forum: No registered users and 1 guest

cron