"failed to insert coretemp" no such device

Well I shutdown the notebook and the next time I tried to boot I am stuck at run level 3. I can login and poke around but the computer won’t go to run level 5 due to this module problem. (I’m doing sneakernet with a thumb drive since obviously I can’t use the forum without graphics. I am also sinning by using Windows 10.)

The system instructed me to run
systemctl status systemd-modules-load.service. Results:


● systemd-modules-load.service - Load Kernel Modules
   Loaded: loaded (/usr/lib/systemd/system/systemd-modules-load.service; static; vendor preset: disabled)
   Active: failed (Result: exit-code) since Mon 2021-05-24 00:03:37 PDT; 53min ago
     Docs: man:systemd-modules-load.service(8)
           man:modules-load.d(5)
  Process: 1549 ExecStart=/usr/lib/systemd/systemd-modules-load (code=exited, status=1/FAILURE)
 Main PID: 1549 (code=exited, status=1/FAILURE)


May 24 00:03:36 linux-yxjo systemd[1]: Starting Load Kernel Modules...
May 24 00:03:37 linux-yxjo systemd-modules-load[1549]: Failed to insert 'coretemp': No such device
May 24 00:03:37 linux-yxjo systemd[1]: systemd-modules-load.service: Main process exited, code=exited, status=1/FAILURE
May 24 00:03:37 linux-yxjo systemd[1]: Failed to start Load Kernel Modules.
May 24 00:03:37 linux-yxjo systemd[1]: systemd-modules-load.service: Unit entered failed state.
May 24 00:03:37 linux-yxjo systemd[1]: systemd-modules-load.service: Failed with result 'exit-code'.

Here is the output of the sensors command:


iwlwifi_1-virtual-0
Adapter: Virtual device
temp1:        +34.0°C  


k10temp-pci-00c3
Adapter: PCI adapter
Vcore:        +0.70 V  
Vsoc:         +0.80 V  
Tdie:         +38.5°C  
Tctl:         +38.5°C  
Icore:        +7.00 A  
Isoc:         +7.75 A  


nvme-pci-0200
Adapter: PCI adapter
Composite:    +35.9°C  (low  = -273.1°C, high = +80.8°C)
                       (crit = +80.8°C)
Sensor 1:     +35.9°C  (low  = -273.1°C, high = +65261.8°C)
Sensor 2:     +35.9°C  (low  = -273.1°C, high = +65261.8°C)


BAT0-acpi-0
Adapter: ACPI interface
in0:         +13.01 V  


thinkpad-isa-0000
Adapter: ISA adapter
fan1:           0 RPM
temp1:        +38.0°C  
temp2:         +0.0°C  
temp3:         +0.0°C  
temp4:         +0.0°C  
temp5:         +0.0°C  
temp6:         +0.0°C  
temp7:         +0.0°C  
temp8:            N/A  
temp9:         +0.0°C  
temp10:        +0.0°C  
temp11:       +66.0°C  
temp12:        +0.0°C  
temp13:        +0.0°C  
temp14:        +0.0°C  
temp15:        +0.0°C  
temp16:        +0.0°C  


amdgpu-pci-0700
Adapter: PCI adapter
vddgfx:           N/A  
vddnb:            N/A  
edge:         +38.0°C  (crit = +80.0°C, hyst =  +0.0°C)



This is my sensors3.conf file contents:


# the same hardware won't need to recreate it again and again.


chip "lm78-*" "lm79-*" "lm80-*" "lm96080-*"


    label temp1 "M/B Temp"




chip "w83792d-*"


    label in0 "VcoreA"
    label in1 "VcoreB"
    label in6 "+5V"
    label in7 "5VSB"
    label in8 "Vbat"


    set in6_min  5.0 * 0.90
    set in6_max  5.0 * 1.10
    set in7_min  5.0 * 0.90
    set in7_max  5.0 * 1.10
    set in8_min  3.0 * 0.90
    set in8_max  3.0 * 1.10




chip "w83793-*"


    label in0 "VcoreA"
    label in1 "VcoreB"
    label in7 "+5V"
    label in8 "5VSB"
    label in9 "Vbat"


    set in7_min  5.0 * 0.90
    set in7_max  5.0 * 1.10
    set in8_min  5.0 * 0.90
    set in8_max  5.0 * 1.10
    set in9_min  3.0 * 0.90
    set in9_max  3.0 * 1.10




chip "w83795g-*" "w83795adg-*"


    label in12 "+3.3V"
    label in13 "3VSB"
    label in14 "Vbat"


    set in12_min  3.3 * 0.90
    set in12_max  3.3 * 1.10
    set in13_min  3.3 * 0.90
    set in13_max  3.3 * 1.10
    set in14_min  3.0 * 0.90
    set in14_max  3.3 * 1.10




chip "via686a-*"


    label in0 "Vcore"
    label in2 "+3.3V"
    label in3 "+5V"
    label in4 "+12V"


    set in2_min  3.3 * 0.90
    set in2_max  3.3 * 1.10
    set in3_min  5.0 * 0.90
    set in3_max  5.0 * 1.10
    set in4_min 12.0 * 0.90
    set in4_max 12.0 * 1.10




chip "adm1025-*" "ne1619-*"


    label in1 "Vcore"
    label in2 "+3.3V"
    label in3 "+5V"
    label in4 "+12V"
    label in5 "VCC"


    set in2_min  3.3 * 0.90
    set in2_max  3.3 * 1.10
    set in3_min  5.0 * 0.90
    set in3_max  5.0 * 1.10
    set in5_min  3.3 * 0.90
    set in5_max  3.3 * 1.10
# Depending on how your chip is hardwired, you may or may not have
# +12V readings.
#    set in4_min 12.0 * 0.90
#    set in4_max 12.0 * 1.10


    label temp1 "CPU Temp"
    label temp2 "M/B Temp"




chip "lm87-*" "adm1024-*"


    label in1 "Vcore"
    label in2 "+3.3V"
    label in3 "+5V"
    label in4 "+12V"


    set in2_min  3.3 * 0.90
    set in2_max  3.3 * 1.10
    set in3_min  5.0 * 0.90
    set in3_max  5.0 * 1.10
    set in4_min 12.0 * 0.90
    set in4_max 12.0 * 1.10


    label temp1 "M/B Temp"
    label temp2 "CPU Temp"




chip "it87-*" "it8712-*" "it8716-*" "it8718-*" "it8720-*"


    label in8 "Vbat"




chip "fscpos-*" "fscher-*"
#FSC "Hermes"


    label in0 "+12V"
    label in1 "+5V"
    label in2 "Vbat"


    label temp1 "CPU Temp"
    label temp2 "M/B Temp"
    label temp3 "Aux Temp"




chip "fscscy-*"
#FSC "Scylla"


    label in0 "+12V"
    label in1 "+5V"
    label in2 "+3.3V"


    label temp1 "CPU0 Temp"
    label temp2 "CPU1 Temp"
    label temp3 "M/B Temp"
    label temp4 "Aux Temp"




chip "fschds-*"
# Fujitsu Technology Solutions, "Hades"-Chip


# Temperatures
    label temp1 "CPU Temp"
    label temp2 "Super I/O Temp"
    label temp3 "System Temp"


# Fans
    label fan1 "PSU Fan"
    label fan2 "CPU Fan"
    label fan3 "System FAN2"
    label fan4 "System FAN3"
    label fan5 "System FAN4"


# Voltages
    label in0 "+12V"
    label in1 "+5V"
    label in2 "Vbat"


chip "fscsyl-*"
# Fujitsu Technology Solutions, "Syleus"-Chip


# Temperatures
    label temp1 "CPU Temp"
    label temp4 "Super I/O Temp"
    label temp5 "Northbridge Temp"


# Fans
    label fan1 "CPU Fan"
    label fan2 "System FAN2"
    label fan3 "System FAN3"
    label fan4 "System FAN4"
    label fan7 "PSU Fan"


# Voltages
    label in0 "+12V"
    label in1 "+5V"
    label in2 "Vbat"
    label in3 "+3.3V"
    label in5 "+3.3V-Aux"


chip "vt1211-*"


    label in5 "+3.3V"


    label temp2 "SIO Temp"




chip "vt8231-*"


    label in5 "+3.3V"




chip "smsc47m192-*"


    label in1 "Vcore"
    label in2 "+3.3V"
    label in3 "+5V"
    label in4 "+12V"
    label in5 "VCC"


    set in2_min  3.3 * 0.90
    set in2_max  3.3 * 1.10
    set in3_min  5.0 * 0.90
    set in3_max  5.0 * 1.10
    set in4_min 12.0 * 0.90
    set in4_max 12.0 * 1.10
    set in5_min  3.3 * 0.90
    set in5_max  3.3 * 1.10


    label temp1 "SIO Temp"




chip "lm85-*" "lm85b-*" "lm85c-*" "adm1027-*" "adt7463-*" "adt7468-*" \
     "emc6d100-*" "emc6d102-*" "emc6d103-*" "emc6d103s-*" 


    label in1 "Vcore"
    label in2 "+3.3V"
    label in3 "+5V"
    label in4 "+12V"


    set in2_min  3.3 * 0.90
    set in2_max  3.3 * 1.10
    set in3_min  5.0 * 0.90
    set in3_max  5.0 * 1.10
# Depending on how your chip is hardwired, you may or may not have
# +12V readings.
#    set in4_min 12.0 * 0.90
#    set in4_max 12.0 * 1.10


    label temp2 "M/B Temp"




chip "emc6w201-*"


    label in2 "+3.3V"
    label in3 "+5V"


    label temp6 "M/B Temp"




chip "pc87365-*" "pc87366-*"


# Voltage inputs


    label in7 "3VSB"
    label in8 "VDD"
    label in9 "Vbat"
    label in10 "AVDD"


    compute in7   @*2, @/2
    compute in8   @*2, @/2
    compute in10  @*2, @/2


# These are the operating conditions as recommended by National
# Semiconductor
    set in7_min   3.0
    set in7_max   3.6
    set in8_min   3.0
    set in8_max   3.6
    set in10_min  3.0
    set in10_max  3.6
# Depending on the hardware setup, the battery voltage may or may not
# be monitored.
#    set in9_min   2.4
#    set in9_max   3.6


    label temp3 "SIO Temp"


    set temp3_min    0
    set temp3_max   70
    set temp3_crit  85




chip "adm1030-*" "adm1031-*"


    label temp1 "M/B Temp"




chip "w83627thf-*"


    label in3 "+5V"
    label in7 "5VSB"
    label in8 "Vbat"


    # Internal resistors
    compute in3  @ * (1 + 34/51), @ / (1 + 34/51)
    compute in7  @ * (1 + 34/51), @ / (1 + 34/51)


    set in3_min  5.0 * 0.90
    set in3_max  5.0 * 1.10
    set in7_min  5.0 * 0.90
    set in7_max  5.0 * 1.10
# The battery voltage may or may not be monitored.
#    set in8_min  3.0 * 0.90
#    set in8_max  3.0 * 1.10




chip "w83627ehf-*" "w83627dhg-*" "w83667hg-*" "nct6775-*" "nct6776-*" \
     "nct6779-*" "nct6791-*" "nct6795-*" "nct6796-*"


    label in0 "Vcore"
    label in2 "AVCC"
    label in3 "+3.3V"
    label in7 "3VSB"
    label in8 "Vbat"


    set in2_min  3.3 * 0.90
    set in2_max  3.3 * 1.10
    set in3_min  3.3 * 0.90
    set in3_max  3.3 * 1.10
    set in7_min  3.3 * 0.90
    set in7_max  3.3 * 1.10
    set in8_min  3.0 * 0.90
    set in8_max  3.3 * 1.10




chip "w83627uhg-*"


    label in2 "AVCC"
    label in3 "+5V"
    label in7 "5VSB"
    label in8 "Vbat"


    set in2_min  5.0 * 0.90
    set in2_max  5.0 * 1.10
    set in3_min  5.0 * 0.90
    set in3_max  5.0 * 1.10
    set in7_min  5.0 * 0.90
    set in7_max  5.0 * 1.10
    set in8_min  3.0 * 0.90
    set in8_max  3.3 * 1.10




chip "f71805f-*"


    label in0 "+3.3V"


    set in0_min  3.3 * 0.90
    set in0_max  3.3 * 1.10




chip "f71872f-*"


    label in0 "+3.3V"
    label in9 "Vbat"
    label in10 "3VSB"


    set in0_min   3.3 * 0.90
    set in0_max   3.3 * 1.10
    set in9_min   3.0 * 0.90
    set in9_max   3.0 * 1.10
    set in10_min  3.3 * 0.90
    set in10_max  3.3 * 1.10




chip "k8temp-*"


    label temp1 "Core0 Temp"
    label temp2 "Core0 Temp"
    label temp3 "Core1 Temp"
    label temp4 "Core1 Temp"




chip "dme1737-*"


    label in0 "5VSB"
    label in1 "Vcore"
    label in2 "+3.3V"
    label in3 "+5V"
    label in4 "+12V"
    label in5 "3VSB"
    label in6 "Vbat"


    label temp2 "SIO Temp"


    set in0_min  5.0 * 0.90
    set in0_max  5.0 * 1.10
    set in2_min  3.3 * 0.90
    set in2_max  3.3 * 1.10
    set in3_min  5.0 * 0.90
    set in3_max  5.0 * 1.10
    set in4_min 12.0 * 0.90
    set in4_max 12.0 * 1.10
    set in5_min  3.3 * 0.90
    set in5_max  3.3 * 1.10
    set in6_min  3.0 * 0.90
    set in6_max  3.0 * 1.10




chip "sch311x-*"


    label in1 "Vcore"
    label in2 "+3.3V"
    label in3 "+5V"
    label in4 "+12V"
    label in5 "3VSB"
    label in6 "Vbat"


    label temp2 "SIO Temp"


    set in2_min  3.3 * 0.90
    set in2_max  3.3 * 1.10
    set in3_min  5.0 * 0.90
    set in3_max  5.0 * 1.10
    set in4_min 12.0 * 0.90
    set in4_max 12.0 * 1.10
    set in5_min  3.3 * 0.90
    set in5_max  3.3 * 1.10
    set in6_min  3.0 * 0.90
    set in6_max  3.0 * 1.10




chip "sch5027-*"


    label in0 "5VSB"
    label in1 "Vcore"
    label in2 "+3.3V"
    label in5 "3VSB"
    label in6 "Vbat"


    label temp2 "SIO Temp"


    set in0_min  5.0 * 0.90
    set in0_max  5.0 * 1.10
    set in2_min  3.3 * 0.90
    set in2_max  3.3 * 1.10
    set in5_min  3.3 * 0.90
    set in5_max  3.3 * 1.10
    set in6_min  3.0 * 0.90
    set in6_max  3.0 * 1.10




chip "sch5127-*"


    label in2 "+3.3V"
    label in5 "3VSB"
    label in6 "Vbat"


    set in2_min  3.3 * 0.90
    set in2_max  3.3 * 1.10
    set in5_min  3.3 * 0.90
    set in5_max  3.3 * 1.10
    set in6_min  3.0 * 0.90
    set in6_max  3.0 * 1.10




chip "f71808e-*" "f71808a-*" "f71862fg-*" "f71869-*" "f71869a-*" "f71882fg-*" \
     "f71889fg-*" "f71889ed-*" "f71889a-*"


    label in0 "+3.3V"
    label in7 "3VSB"
    label in8 "Vbat"


    compute in0  @*2, @/2
    compute in7  @*2, @/2
    compute in8  @*2, @/2




chip "f71858fg-*" "f8000-*"


    label in0 "+3.3V"
    label in1 "3VSB"
    label in2 "Vbat"


    compute in0  @*2, @/2
    compute in1  @*2, @/2
    compute in2  @*2, @/2




chip "f71868a-*"


    label in0 "+3.3V"
    label in7 "3VSB"
    label in8 "Vbat"
    label in9 "5VSB"


    compute in0  @*2, @/2
    compute in7  @*2, @/2
    compute in8  @*2, @/2
    compute in9  @*3, @/3




chip "f81865f-*"


    label in0 "+3.3V"
    label in5 "3VSB"
    label in6 "Vbat"


    compute in0  @*2, @/2
    compute in5  @*2, @/2
    compute in6  @*2, @/2




chip "adt7473-*" "adt7475-*"


    label in2 "+3.3V"


    set in2_min  3.3 * 0.90
    set in2_max  3.3 * 1.10


    label temp2 "Board Temp"




chip "adt7476-*" "adt7490-*"


    label in1 "Vcore"
    label in2 "+3.3V"
    label in3 "+5V"
    label in4 "+12V"


    set in2_min  3.3 * 0.90
    set in2_max  3.3 * 1.10
    set in3_min  5.0 * 0.90
    set in3_max  5.0 * 1.10
# Depending on how your ADT7476 is hardwired, you may or may not have
# +12V readings.
#    set in4_min 12.0 * 0.90
#    set in4_max 12.0 * 1.10


    label temp2 "M/B Temp"



Results from uname -a:


Linux linux-yxjo 5.3.18-lp152.75-default #1 SMP Wed May 5 09:22:56 UTC 2021 (16c42c8) x86_64 x86_64 x86_64 GNU/Linux

I did a zypper up but that didn’t fix anything.

Suggestions?

Hi
As root user have you run;


sensors-detect --auto

As you can guess I am sort of hampered going back and force between operating systems and thumbdrive sneakernet. I ran sensors-detect without the auto option before my first post and it didn’t fix anything. However I went back and ran it per your suggestion with the --auto and that doesn’t fix it. I assume --auto means just say yes to all the questions which is what I did the first time.

I hadn’t realized that 15.2 saves the last four kernels. I tried booting with the older kernels and that didn’t make a difference.

Hi
It could be a bug (no think* hardware here) between coretemp-isa* and thinkpad-isa* you don’t have anything special in your kernel boot options?

As root user what is the output from;


fgrep -r coretemp /etc/*

Or it could not be needed for that device…

The fgrep finds nothing. Remember this notebook was working, shutdown, then the problem started on the next boot. I haven’t changed the boot options.

Here is some stuff that might prove useful. I’m just taking wild stabs here hoping this will help. First here is the journalctl where it is loading modules.


May 24 10:49:39 linux-yxjo systemd-journald[209]: Runtime journal (/run/log/journal/b4261b7617c54f38bd560604e2611e15) is 8.0M, max 1.0G, 1.0G free.
May 24 10:49:39 linux-yxjo systemd-modules-load[207]: Inserted module 'scsi_dh_alua'
May 24 10:49:39 linux-yxjo systemd-modules-load[207]: Inserted module 'scsi_dh_emc'
May 24 10:49:39 linux-yxjo systemd-modules-load[207]: Inserted module 'scsi_dh_rdac'
May 24 10:49:39 linux-yxjo systemd-modules-load[207]: Inserted module 'dm_multipath'
May 24 10:49:39 linux-yxjo systemd-modules-load[207]: Inserted module 'sg'
May 24 10:49:39 linux-yxjo systemd-modules-load[207]: Inserted module 'msr'
May 24 10:49:40 linux-yxjo systemd-modules-load[207]: Failed to insert 'coretemp': No such device

I replace coretemp with msr in the fgrep just so you have some confidence that I entered the command correctly. No fat finger mistake.


/etc/apparmor/severity.db:/dev/msr	4 10 0
/etc/protocols:# [Sridhar] Manickam R.Sridhar <msridhar@sitaranetworks.com>, September 1997.
/etc/rpm/macros.suse-ruby2.5:%rubygemsruby25() rubygem(ruby:2.5.0:%{expand:%%rubygemsx%*} %{expand:%%{rubygems%*}}
/etc/services:ohimsrv            506/tcp      # ohimsrv  [Scott_Powell]
/etc/services:ohimsrv            506/udp      # ohimsrv  [Scott_Powell]
/etc/services:msrp               2855/tcp     # MSRP over TCP (updated 2014-04-09) [RFC4976]
/etc/services:ovalarmsrv         2953/tcp     # OVALARMSRV 
/etc/services:ovalarmsrv         2953/udp     # OVALARMSRV 
/etc/services:ovalarmsrv-cmd     2954/tcp     # OVALARMSRV-CMD  [Eric_Pulsipher]
/etc/services:ovalarmsrv-cmd     2954/udp     # OVALARMSRV-CMD  [Eric_Pulsipher]
/etc/services:msr-plugin-port    3931/tcp     # MSR Plugin Port  [Ron_Steed]
/etc/services:msr-plugin-port    3931/udp     # MSR Plugin Port  [Ron_Steed]
/etc/xml/catalog.d/xmlcharent.xml:	<public publicId="ISO 8879:1986//ENTITIES Added Math Symbols: Relations//EN//XML" uri="file:///usr/share/xmlcharent/entities/iso-amsr.ent"/>
/etc/xml/catalog.d/docbook_4.xml:    <public uri="file:///usr/share/xml/docbook/schema/dtd/4.5/ent/isoamsr.ent" publicId="ISO 8879:1986//ENTITIES Added Math Symbols: Relations//EN//XML"/>
/etc/xml/catalog.d/docbook_4.xml:    <system uri="file:///usr/share/xml/docbook/schema/dtd/4.5/ent/isoamsr.ent" systemId="http://www.w3.org/2003/entities/iso8879/isoamsr.ent"/>
/etc/xml/catalog-d.xml:    <public uri="file:///usr/share/xml/docbook/schema/dtd/4.5/ent/isoamsr.ent" publicId="ISO 8879:1986//ENTITIES Added Math Symbols: Relations//EN//XML"/>
/etc/xml/catalog-d.xml:    <system uri="file:///usr/share/xml/docbook/schema/dtd/4.5/ent/isoamsr.ent" systemId="http://www.w3.org/2003/entities/iso8879/isoamsr.ent"/>
/etc/xml/catalog-d.xml:	<public publicId="ISO 8879:1986//ENTITIES Added Math Symbols: Relations//EN//XML" uri="file:///usr/share/xmlcharent/entities/iso-amsr.ent"/>



Perhaps this is a bug related to the coretemp problem. Note the word thermal.


May 24 17:49:42 linux-yxjo kernel: EDAC amd64: Node 0: DRAM ECC disabled.
May 24 17:49:42 linux-yxjo kernel: EDAC amd64: ECC disabled in the BIOS or no ECC capability, module will not load.
                                    Either enable ECC checking or force module loading by setting 'ecc_enable_override'.
                                    (Note that use of the override may cause unknown side effects.)
May 24 17:49:42 linux-yxjo kernel: iwlwifi 0000:01:00.0: Applying debug destination EXTERNAL_DRAM
May 24 17:49:42 linux-yxjo kernel: iwlwifi 0000:01:00.0: Allocated 0x00400000 bytes for firmware monitor.
May 24 17:49:42 linux-yxjo laptop-mode[1581]: Laptop mode
May 24 17:49:42 linux-yxjo laptop-mode[1584]: enabled, not active
May 24 17:49:42 linux-yxjo systemd-modules-load[1316]: Failed to insert 'coretemp': No such device
May 24 17:49:42 linux-yxjo systemd[1]: systemd-modules-load.service: Main process exited, code=exited, status=1/FAILURE
May 24 17:49:42 linux-yxjo systemd[1]: Failed to start Load Kernel Modules.
May 24 17:49:42 linux-yxjo systemd[1]: systemd-modules-load.service: Unit entered failed state.
May 24 17:49:42 linux-yxjo systemd[1]: systemd-modules-load.service: Failed with result 'exit-code'.
May 24 17:49:42 linux-yxjo kernel: iwlwifi 0000:01:00.0: base HW address: 5c:87:9c:70:90:4f
May 24 17:49:42 linux-yxjo kernel: EDAC amd64: F17h_M10h detected (node 0).
May 24 17:49:42 linux-yxjo kernel: EDAC MC: UMC0 chip selects:
May 24 17:49:42 linux-yxjo kernel: EDAC amd64: MC: 0:  8192MB 1:     0MB
May 24 17:49:42 linux-yxjo kernel: EDAC amd64: MC: 2:     0MB 3:     0MB
May 24 17:49:42 linux-yxjo kernel: EDAC MC: UMC1 chip selects:
May 24 17:49:42 linux-yxjo kernel: EDAC amd64: MC: 0:  8192MB 1:  8192MB
May 24 17:49:42 linux-yxjo kernel: EDAC amd64: MC: 2:     0MB 3:     0MB
May 24 17:49:42 linux-yxjo kernel: EDAC amd64: using x4 syndromes.
May 24 17:49:42 linux-yxjo kernel: EDAC amd64: Node 0: DRAM ECC disabled.
May 24 17:49:42 linux-yxjo kernel: EDAC amd64: ECC disabled in the BIOS or no ECC capability, module will not load.
                                    Either enable ECC checking or force module loading by setting 'ecc_enable_override'.
                                    (Note that use of the override may cause unknown side effects.)
May 24 17:49:42 linux-yxjo kernel: ieee80211 phy0: Selected rate control algorithm 'iwl-mvm-rs'
May 24 17:49:42 linux-yxjo kernel: thermal thermal_zone0: failed to read out thermal zone (-61)

Hi
Well since your on AMD, then for sure coretemp isn’t needed… it’s for intel…

My HP 14-an013nr… (SLED 15.2 though)


 sensors
amdgpu-pci-0008
Adapter: PCI adapter
GPU Temperature:  +47.0°C  (crit = +120.0°C, hyst = +90.0°C)

k10temp-pci-00c3
Adapter: PCI adapter
K10 CPU Temperature:  +47.5°C  (high = +70.0°C)
                               (crit = +105.0°C, hyst = +104.0°C)

acpitz-acpi-0
Adapter: ACPI interface
ACPI CPU Temperature:  +44.0°C  (crit = +120.0°C)

iwlwifi_1-virtual-0
Adapter: Virtual device
temp1:        +31.0°C  

BAT0-acpi-0
Adapter: ACPI interface
in0:         +10.75 V  
curr1:        +1.47 A  

fam15h_power-pci-00c4
Adapter: PCI adapter
Power Adapter:   0.00 W  (interval =   0.01 s, crit =  14.98 W)

k10-temp = coretemp

unless you changed it openSUSE does not keep 4 kernels

check if you have free space

Well it is in my kernel. This is the result of locate coretemp.ko:


/lib/modules/5.3.18-lp152.41-default/kernel/drivers/hwmon/coretemp.ko
/lib/modules/5.3.18-lp152.69-default/kernel/drivers/hwmon/coretemp.ko
/lib/modules/5.3.18-lp152.72-default/kernel/drivers/hwmon/coretemp.ko
/lib/modules/5.3.18-lp152.75-default/kernel/drivers/hwmon/coretemp.ko

Is this something you blacklist, maybe in udev?

As if I would know how to change opensuse to save multiple kernels! :wink: Though it doesn’t sound like a bad idea to save one. Rolling back 4 seems a bit extreme.

Since I am doing a sneakernet I can’t easily do a du but I know I am not out of disk space.

Hi
And /sbin/modinfo coretemp shows…


/sbin/modinfo coretemp |grep description:
description:    Intel Core temperature monitor

/sbin/modinfo k10temp |grep description:
description:    AMD Family 10h+ CPU core temperature monitor

Here is an intel system running Leap 15.2…


sensors
coretemp-isa-0000
Adapter: ISA adapter
CPU Package Temperature:  +29.0°C  (high = +93.0°C, crit = +103.0°C)
CPU Core 0 Temperature:   +23.0°C  (high = +93.0°C, crit = +103.0°C)
CPU Core 1 Temperature:   +24.0°C  (high = +93.0°C, crit = +103.0°C)
CPU Core 2 Temperature:   +29.0°C  (high = +93.0°C, crit = +103.0°C)
CPU Core 3 Temperature:   +29.0°C  (high = +93.0°C, crit = +103.0°C)

uname -a
Linux oscar 5.3.18-lp152.75-default #1 SMP Wed May 5 09:22:56 UTC 2021 (16c42c8) x86_64 x86_64 x86_64 GNU/Linux

The systemd-modules-load.service does not run here…, check the service…


systemctl status systemd-modules-load.service

If it’s enabled, disable and see how a reboot goes… or you have met the condition…


 systemctl status systemd-modules-load.service
● systemd-modules-load.service - Load Kernel Modules
   Loaded: loaded (/usr/lib/systemd/system/systemd-modules-load.service; static; vendor preset: disabled)
   Active: inactive (dead)
Condition: start condition failed at Mon 2021-05-24 21:51:11 CDT; 5min ago
           ├─ ConditionDirectoryNotEmpty=|/lib/modules-load.d was not met
           ├─ ConditionDirectoryNotEmpty=|/usr/lib/modules-load.d was not met
           ├─ ConditionDirectoryNotEmpty=|/usr/local/lib/modules-load.d was not met
           ├─ ConditionDirectoryNotEmpty=|/etc/modules-load.d was not met
           ├─ ConditionDirectoryNotEmpty=|/run/modules-load.d was not met
           ├─ ConditionKernelCommandLine=|modules-load was not met
           └─ ConditionKernelCommandLine=|rd.modules-load was not met
     Docs: man:systemd-modules-load.service(8)
           man:modules-load.d(5)

Stopping systemd-modules-load.service didn’t help. I assume systemd is set up not to enter runlevel 5 unless the modules are loaded.

After much poking around, I found systemd-modules-load.service. Here are the contents of the file:


#  This file is part of systemd.
#
#  systemd is free software; you can redistribute it and/or modify it
#  under the terms of the GNU Lesser General Public License as published by
#  the Free Software Foundation; either version 2.1 of the License, or
#  (at your option) any later version.


[Unit]
Description=Load Kernel Modules
Documentation=man:systemd-modules-load.service(8) man:modules-load.d(5)
DefaultDependencies=no
Conflicts=shutdown.target
Before=sysinit.target shutdown.target
ConditionCapability=CAP_SYS_MODULE
ConditionDirectoryNotEmpty=|/lib/modules-load.d
ConditionDirectoryNotEmpty=|/usr/lib/modules-load.d
ConditionDirectoryNotEmpty=|/usr/local/lib/modules-load.d
ConditionDirectoryNotEmpty=|/etc/modules-load.d
ConditionDirectoryNotEmpty=|/run/modules-load.d
ConditionKernelCommandLine=|modules-load
ConditionKernelCommandLine=|rd.modules-load


[Service]
Type=oneshot
RemainAfterExit=yes
ExecStart=/usr/lib/systemd/systemd-modules-load
TimeoutSec=90s

Now /usr/lib/modules-load.d contains these files:


multipath.conf
sg.conf
thermald.conf
xl2tpd.conf

And thermald.conf is what calls up coretemp. You can just comment the line out.

The problem I face now is messing with systemd-modules-load.service seems to have stops the modules that used to be inserted from now being inserted. That is journalctl doesn’t have those insert lines anymore. I don’t know if that was supposed to be a one time insertion or insert the modules on every boot. Maybe there are some useful modprobes I can do.

Is there a way to simply force run level 5 to make debugging simpler?

Or have things reached the point where I should just reload the OS and try to rebuild the system? Maybe try the release candidate for 15.3?

The number of kernels retained is controlled by a setting in “etc/zypp/zypp.conf”

for example:

multiversion.kernels = latest,latest-1,running

options for the setting are detailed in the configuration file.

Hi
So if you remove thermald? (never used it)

Well I certainly never changed this but my options are latest,latest-1,latest-2,running,oldest

And now the solution. I fixed a few things that probably should have been fixed (I will elaborate) but the real problem is the program sax3. Oh and I am entering this reply from opensuse not (gasp) windows. I have reached runlevel 5.

One really annoying thing on this lenovo thinkpad is the touchpad likes to zoom when you want to move . Very annoying on google earth. I loaded sax3 to see if I could fix this problem. I did change something. To my recollection I got rid of the single finger scroll since two finger scroll is common these days.

So at some point I shut down the notebook and later tried to boot it and it wouldn’t boot. I’m not sure why the coretemp problem started. As I posted earlier just commenting out one parameter fixed that problem.

The ECC message is more of a warning than an actual bug. I added “blacklist amd64_edac_mod” to /etc/modprobe.d/50-blacklist.conf

The reason I wasn’t reaching runlevel 5 had to do with X11. Here is the relevant part of journalctl:


May 25 14:41:05 linux-yxjo sddm[2349]: Initializing...
May 25 14:41:05 linux-yxjo sddm[2349]: Starting...
May 25 14:41:05 linux-yxjo sddm[2349]: Logind interface found
May 25 14:41:05 linux-yxjo sddm[2349]: Adding new display on vt 7 ...
May 25 14:41:05 linux-yxjo sddm[2349]: Loading theme configuration from
""
May 25 14:41:05 linux-yxjo sddm[2349]: Display server starting...
May 25 14:41:05 linux-yxjo sddm[2349]: Adding cookie to "/run/sddm
{09328ca4-2733-47cb-ae8d-7f373019610e}"
May 25 14:41:05 linux-yxjo sddm[2349]: Running: /usr/bin/X -nolisten
tcp -auth /run/sddm/{09328ca4-2733-47cb-ae8d-7f373019610e} -backgroundnone -noreset -displayfd 17 -seat seat0 vt7
May 25 14:41:05 linux-yxjo sddm[2349]: Failed to read display number
from pipe
May 25 14:41:05 linux-yxjo sddm[2349]: Display server failed to start.
Exiting
May 25 14:41:05 linux-yxjo startproc[2348]: startproc: signal catched
/usr/bin/sddm: Aborted
May 25 14:41:05 linux-yxjo display-manager[2328]: Starting service
sddm..failed
May 25 14:41:05 linux-yxjo systemd[1]: display-manager.service: Control
process exited, code=exited status=1
May 25 14:41:05 linux-yxjo systemd[1]: Failed to start X Display
Manager.
-- Subject: Unit display-manager.service has failed
-- Defined-By: systemd
-- Support: https://lists.freedesktop.org/mailman/listinfo/systemd
devel
-- 
-- Unit display-manager.service has failed.
-- 
-- The result is failed.


I went into /var/log and looked at Xorg.0.log


 1363.777] (==) Using config directory: "/etc/X11/xorg.conf.d"
 1363.777] (==) Using system config directory "/usr/share/X11
xorg.conf.d"
 1363.778] Parse error on line 55 of section InputClass in file /etc/X11/xorg.conf.d/70-synaptics.conf
    The MatchIsTouchpad keyword requires a quoted string to
follow it.

Remember I had installed sax3. I certainly wouldn't have changed this file directly but the program did.


Here is line 55. I added the quotes and can now boot the system and reach runlevel 5


MatchIsTouchpad "on"

I still need to verify I haven’t broken something else along the way doing my (ahem) other fixes.

sax3 ???

I know I am not the youngest, thus this touched something in my memory. Does that really still exist? And is it still good for something?

This is definitely not standard.

Its in the OSS Repo:

zypper if sax3
Loading repository data...
Reading installed packages...


Information for package sax3:
-----------------------------
Repository     : OSS
Name           : sax3
Version        : 0.1.2-lp152.3.8
Arch           : x86_64
Vendor         : openSUSE
Installed Size : 316,4 KiB
Installed      : No
Status         : not installed
Source package : sax3-0.1.2-lp152.3.8.src
Summary        : A Graphical Configuration Tool for X
Description    : 
    A graphical utility for configuring X server settings. It can run without X with a graphical interface.

    Authors:
    _______
    Manu Gupta
    Michal Hrusecky

But not developed anymore for 5 years and the URLs for the sources are not working.
https://software.opensuse.org/package/sax3?search_term=sax3

Maybe it should be deleted from the Distribution…