#601856 gkrellm: fans are not always detected on startup

Package:
gkrellm
Source:
gkrellm
Description:
GNU Krell Monitors
Submitter:
Toni Mueller
Date:
2021-01-30 05:21:04 UTC
Severity:
normal
Tags:
#601856#5
Date:
2010-10-30 10:58:34 UTC
From:
To:
Hi,

I run gkrellm from my .xsession file, and often, gkrellm does not detect
all three fans in my computer, but only two of them. After one or two
minutes of being run, I can usually click on the fan menu where the
third fan now appears, deselected. Then I can activate the monitoring of
the fan and see the fan's status. I'd just like gkrellm to come up with
all three fans all of the time.

I have a sysfs mounted on /sys, and mbmon running "for ages".


Kind regards,
--Toni++

#601856#10
Date:
2010-11-03 06:12:38 UTC
From:
To:
Hello Toni,

Can you please try version 2.3.5, uploaded to experimental? From that
version, gkrellm uses libsensors to gather system info.

Regards,

#601856#15
Date:
2010-11-12 20:47:53 UTC
From:
To:
Hi,

a few days ago I installed that version. So far, although I haven't
given it extensive testing yet (ie, logged in only a few times), it has
not failed me. It is also hard to compare the two versions, as the new
version provides me with many more sensors as before, which led me to
activate five temperature and six fan sensors (all giving different
readings). So far, it has just been working fine. :)

#601856#20
Date:
2010-11-15 08:08:42 UTC
From:
To:
hi,

That's great! :) I think that we can wait for ~1 week and if the new
gkrellm doesn't fail you, I can close this bug as fixed in the recent
version: what do you think?

Regards,

#601856#25
Date:
2010-11-15 08:11:51 UTC
From:
To:
Hi,

well, today it has, in exactly the same way as before.

#601856#30
Date:
2010-11-15 10:21:55 UTC
From:
To:
Hi,

that's what I hoped, too, but you should already have my message from
this morning in your mailbox, stating that the new version doesn't
really fix the problem. It only looks like making the problem appear
a little less often.

#601856#35
Date:
2010-11-15 22:57:57 UTC
From:
To:
gaah, I was hoping it was fixed :( anyhow, can you try to run gkrellm
passing '-d 0x80' from the command line? it will print all the debug
information for the 'sensors' category, maybe we're lucky and it will
print what's happening.

Regards,

#601856#40
Date:
2011-08-24 22:05:42 UTC
From:
To:
Is this still happening? can you please provide the above requested information?
#601856#45
Date:
2011-08-25 09:34:06 UTC
From:
To:
Hi,

yes, it still happens ever so often - probably about every other day. I
am now running 3.0rc6 since about one or two months, if that matters
anything. The logs desired, are now attached, but this time, gkrellm
didn't fail to list any sensors that I had enabled.

Kind regards,
--Toni++

#601856#50
Date:
2011-08-25 16:29:34 UTC
From:
To:
forwarded 601856 gkrellm@lists.netservicesgroup.com
thanks

Dear Gkrellm developers, a Debian user reported this problem. He
confirms it still exists with 2.3.5, I asked him for the output of -d
0x80 and you can find it here attached. Could you please give it a
look?

Thanks in advance,
Sandro

#601856#57
Date:
2012-05-12 11:05:33 UTC
From:
To:
Hi,

I just stumbled over this old bug report of mine since gkrellm now does
not only seem to lose some (most) fans every other day, but disks, too.
Yesterday, it would not detect two of my four disks for the life of me,
but today, I could at least enable them in the configuration menu. But
the disk temperature is now zero centigrades on all disks, no matter
what.


Here's a sample smartctl output from one of the disks:

# smartctl -a /dev/sda
smartctl 5.40 2010-07-12 r3124 [i686-pc-linux-gnu] (local build)
Copyright (C) 2002-10 by Bruce Allen, http://smartmontools.sourceforge.net

=== START OF INFORMATION SECTION ===
Model Family:     Seagate Barracuda 7200.12 family
Device Model:     ST31000528AS
Serial Number:    9VP25GY3
Firmware Version: CC37
User Capacity:    1,000,204,886,016 bytes
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   8
ATA Standard is:  ATA-8-ACS revision 4
Local Time is:    Sat May 12 12:55:53 2012 CEST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x82) Offline data collection activity
                                        was completed without error.
                                        Auto Offline Data Collection: Enabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                 ( 600) seconds.
Offline data collection
capabilities:                    (0x7b) SMART execute Offline immediate.
                                        Auto Offline data collection on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   1) minutes.
Extended self-test routine
recommended polling time:        ( 176) minutes.
Conveyance self-test routine
recommended polling time:        (   2) minutes.
SCT capabilities:              (0x103f) SCT Status supported.
                                        SCT Error Recovery Control supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000f   114   099   006    Pre-fail  Always       -       65266772
  3 Spin_Up_Time            0x0003   095   094   000    Pre-fail  Always       -       0
  4 Start_Stop_Count        0x0032   099   099   020    Old_age   Always       -       1504
  5 Reallocated_Sector_Ct   0x0033   100   100   036    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x000f   084   060   030    Pre-fail  Always       -       302857693
  9 Power_On_Hours          0x0032   083   083   000    Old_age   Always       -       15063
 10 Spin_Retry_Count        0x0013   100   100   097    Pre-fail  Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   020    Old_age   Always       -       752
183 Runtime_Bad_Block       0x0032   100   100   000    Old_age   Always       -       0
184 End-to-End_Error        0x0032   100   100   099    Old_age   Always       -       0
187 Reported_Uncorrect      0x0032   100   100   000    Old_age   Always       -       0
188 Command_Timeout         0x0032   100   096   000    Old_age   Always       -       437
189 High_Fly_Writes         0x003a   100   100   000    Old_age   Always       -       0
190 Airflow_Temperature_Cel 0x0022   059   054   045    Old_age   Always       -       41 (Lifetime Min/Max 17/41)
194 Temperature_Celsius     0x0022   041   046   000    Old_age   Always       -       41 (0 10 0 0)
195 Hardware_ECC_Recovered  0x001a   037   024   000    Old_age   Always       -       65266772
197 Current_Pending_Sector  0x0012   100   100   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0010   100   100   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x003e   200   200   000    Old_age   Always       -       0
240 Head_Flying_Hours       0x0000   100   253   000    Old_age   Offline      -       31490700231406
241 Total_LBAs_Written      0x0000   100   253   000    Old_age   Offline      -       573809667
242 Total_LBAs_Read         0x0000   100   253   000    Old_age   Offline      -       2800604567

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
No self-tests have been logged.  [To run self-tests, use: smartctl -t]


SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.



This is on a 3.3 kernel from experimental, and with your 2.3.5 gkrellm.

If you have any other ideas about what to check, I'm interested to know.



Kind regards,
--Toni++

#601856#64
Date:
2021-01-30 05:15:53 UTC
From:
To:
control: tags -1 +moreinfo

This bug has been reported a looong time ago, but upstream recently
suggested these steps:

```
Does this still happen with recent versions (i.e. 2.3.10 or 2.3.11) in Debian?
The Debian package enabled libsensors support in 2.3.5-1 so chances
are that fans detected since then are more reliable.

If that is not the case please try starting gkrellm as gkrellm -d 0x80.
The (extensive) sensor log output should show which sensors are seen
on startup and which sensors appear later on.
```

can you follow up please?