Page 1 of 2 12 LastLast
Results 1 to 10 of 12

Thread: drm/915: Resetting chip after gpu hang

  1. #1

    Default drm/915: Resetting chip after gpu hang

    Laptop is a Lenovo T460s with latest Tumbleweed updates .

    Sometimes (not always) after wake up from screen lock
    the system is not responsive and only power button works.

    This issue is perfectly described here by someone on a debian /ubuntu system
    https://mail-archive.com/debian-bugs...msg438367.html

    so it probably is kernel related , the question is what can be done to avoid this hang ups on Tumbleweed ?

    Thank you for any input.

  2. #2
    Join Date
    Jun 2008
    Location
    Podunk
    Posts
    27,817
    Blog Entries
    15

    Default Re: drm/915: Resetting chip after gpu hang

    Hi
    Do you have a second system you can use to login from via ssh?

    Else are there any crashes/oops in the output from journalctl?
    Cheers Malcolm °¿° SUSE Knowledge Partner (Linux Counter #276890)
    SUSE SLE, openSUSE Leap/Tumbleweed (x86_64) | GNOME DE
    If you find this post helpful and are logged into the web interface,
    please show your appreciation and click on the star below... Thanks!

  3. #3

    Default Re: drm/915: Resetting chip after gpu hang

    Thank you for fast reply

    Yes I can log in via ssh

    I do not see any strange crashes in journalctl

    Just ran : /sbin/lspci -nnk



    Code:
    00:00.0 Host bridge [0600]: Intel Corporation Skylake Host Bridge/DRAM Registers [8086:1904] (rev 08)
            Subsystem: Lenovo Device [17aa:2233]
    00:02.0 VGA compatible controller [0300]: Intel Corporation HD Graphics 520 [8086:1916] (rev 07)
            Subsystem: Lenovo Device [17aa:2234]
            Kernel driver in use: i915
            Kernel modules: i915
    00:08.0 System peripheral [0880]: Intel Corporation Skylake Gaussian Mixture Model [8086:1911]
            Subsystem: Lenovo Device [17aa:2233]
    00:14.0 USB controller [0c03]: Intel Corporation Sunrise Point-LP USB 3.0 xHCI Controller [8086:9d2f] (rev 21)
            Subsystem: Lenovo Device [17aa:2233]
            Kernel driver in use: xhci_hcd
            Kernel modules: xhci_pci
    00:14.2 Signal processing controller [1180]: Intel Corporation Sunrise Point-LP Thermal subsystem [8086:9d31] (rev 21)
            Subsystem: Lenovo Device [17aa:2233]
            Kernel driver in use: intel_pch_thermal
            Kernel modules: intel_pch_thermal
    00:16.0 Communication controller [0780]: Intel Corporation Sunrise Point-LP CSME HECI #1 [8086:9d3a] (rev 21)
            Subsystem: Lenovo Device [17aa:2233]
            Kernel driver in use: mei_me
            Kernel modules: mei_me
    00:17.0 SATA controller [0106]: Intel Corporation Sunrise Point-LP SATA Controller [AHCI mode] [8086:9d03] (rev 21)
            Subsystem: Lenovo Device [17aa:2233]
            Kernel driver in use: ahci
    00:1c.0 PCI bridge [0604]: Intel Corporation Device [8086:9d10] (rev f1)
            Kernel driver in use: pcieport
            Kernel modules: shpchp
    00:1c.2 PCI bridge [0604]: Intel Corporation Device [8086:9d12] (rev f1)
            Kernel driver in use: pcieport
            Kernel modules: shpchp
    00:1d.0 PCI bridge [0604]: Intel Corporation Sunrise Point-LP PCI Express Root Port #9 [8086:9d18] (rev f1)
            Kernel driver in use: pcieport
            Kernel modules: shpchp
    00:1f.0 ISA bridge [0601]: Intel Corporation Sunrise Point-LP LPC Controller [8086:9d48] (rev 21)
            Subsystem: Lenovo Device [17aa:2233]
    00:1f.2 Memory controller [0580]: Intel Corporation Sunrise Point-LP PMC [8086:9d21] (rev 21)
            Subsystem: Lenovo Device [17aa:2233]
            Kernel driver in use: intel_pmc_core
    00:1f.3 Audio device [0403]: Intel Corporation Sunrise Point-LP HD Audio [8086:9d70] (rev 21)
            Subsystem: Lenovo Device [17aa:2233]
            Kernel driver in use: snd_hda_intel
            Kernel modules: snd_hda_intel, snd_soc_skl
    00:1f.4 SMBus [0c05]: Intel Corporation Sunrise Point-LP SMBus [8086:9d23] (rev 21)
            Subsystem: Lenovo Device [17aa:2233]
            Kernel driver in use: i801_smbus
            Kernel modules: i2c_i801
    00:1f.6 Ethernet controller [0200]: Intel Corporation Ethernet Connection I219-LM [8086:156f] (rev 21)
            Subsystem: Lenovo Device [17aa:2233]
            Kernel driver in use: e1000e
            Kernel modules: e1000e
    02:00.0 Unassigned class [ff00]: Realtek Semiconductor Co., Ltd. RTS522A PCI Express Card Reader [10ec:522a] (rev 01)
            Subsystem: Lenovo Device [17aa:2233]
            Kernel driver in use: rtsx_pci
            Kernel modules: rtsx_pci
    04:00.0 Network controller [0280]: Intel Corporation Wireless 8260 [8086:24f3] (rev 3a)
            Subsystem: Intel Corporation Device [8086:1130]
            Kernel driver in use: iwlwifi
            Kernel modules: iwlwifi
    06:00.0 3D controller [0302]: NVIDIA Corporation GM108M [GeForce 930M] [10de:1349] (rev a2)
            Subsystem: Lenovo Device [17aa:2234]
            Kernel driver in use: nouveau
            Kernel modules: nouveau

  4. #4
    Join Date
    Jun 2008
    Location
    Podunk
    Posts
    27,817
    Blog Entries
    15

    Default Re: drm/915: Resetting chip after gpu hang

    Hi
    Ahh dual graphics, you need to be running bumblebee...

    See http://forums.opensuse.org/showthread.php?t=520918
    Cheers Malcolm °¿° SUSE Knowledge Partner (Linux Counter #276890)
    SUSE SLE, openSUSE Leap/Tumbleweed (x86_64) | GNOME DE
    If you find this post helpful and are logged into the web interface,
    please show your appreciation and click on the star below... Thanks!

  5. #5

    Default Re: drm/915: Resetting chip after gpu hang

    Well , it worked for 4 months without a hitch , just using the nouveau drivers
    and no change to any configuration other than the weekly updates.

    So suddenly the system needs bumblebee ? What update introduced the problem ?

    My previous experience with bumblebee was not too good , remembering one bug
    after another , but it's been a while so things may have progressed.

    Is this the only possible solution then ?

    Thanks.

  6. #6
    Join Date
    Jun 2008
    Location
    Podunk
    Posts
    27,817
    Blog Entries
    15

    Default Re: drm/915: Resetting chip after gpu hang

    Quote Originally Posted by Darkblue2 View Post
    Well , it worked for 4 months without a hitch , just using the nouveau drivers
    and no change to any configuration other than the weekly updates.

    So suddenly the system needs bumblebee ? What update introduced the problem ?

    My previous experience with bumblebee was not too good , remembering one bug
    after another , but it's been a while so things may have progressed.

    Is this the only possible solution then ?

    Thanks.
    Hi
    Mesa, Xorg, Kernel updates possibly (the joys of running tumbleweed). Can you disable the intel gpu in the BIOS?

    If you ssh into the machine when it fails to resume, anything in the logs? It might be better to already be logged in and follow the logs with journalctl -f and see if anything happens.

    If you run the top command via ssh, any process hogging the CPU?
    Cheers Malcolm °¿° SUSE Knowledge Partner (Linux Counter #276890)
    SUSE SLE, openSUSE Leap/Tumbleweed (x86_64) | GNOME DE
    If you find this post helpful and are logged into the web interface,
    please show your appreciation and click on the star below... Thanks!

  7. #7
    Join Date
    Jun 2008
    Location
    Groningen, Netherlands
    Posts
    20,306
    Blog Entries
    14

    Default Re: drm/915: Resetting chip after gpu hang

    Quote Originally Posted by Darkblue2 View Post
    Well , it worked for 4 months without a hitch , just using the nouveau drivers
    and no change to any configuration other than the weekly updates.

    So suddenly the system needs bumblebee ? What update introduced the problem ?

    My previous experience with bumblebee was not too good , remembering one bug
    after another , but it's been a while so things may have progressed.

    Is this the only possible solution then ?

    Thanks.
    No, I use bbswitch to switch off the NVIDIA card and only use the Intel. The usage is explained on the bumblebee SDB page.
    ° Appreciate my reply? Click the star and let me know why.

    ° Perfection is not gonna happen. No way.

    http://en.opensuse.org/User:Knurpht
    http://nl.opensuse.org/Gebruiker:Knurpht

  8. #8

    Default Re: drm/915: Resetting chip after gpu hang

    As suggested by Knurpht and malcolmlewis I installed bumblebee and the time between failures was reduced but
    the system still becomes unresponsive after several hours suspension . Here a dump of part of a journalctl dump:

    dump part A

    Code:
    Nov 17]00:11:48 linux-edbc kernel: [drm] GPU HANG: ecode 9:0:0xfffffffe, in X [1579], reason: Hang on render ring, action: reset
    Nov 17 00:11:48 linux-edbc kernel: [drm] GPU hangs can indicate a bug anywhere in the entire gfx stack, including userspace.
    Nov 17 00:11:48 linux-edbc kernel: [drm] Please file a _new_ bug report on bugs.freedesktop.org against DRI -> DRM/Intel
    Nov 17 00:11:48 linux-edbc kernel: [drm] drm/i915 developers can then reassign to the right component if it's not a kernel issue.
    Nov 17 00:11:48 linux-edbc kernel: [drm] The gpu crash dump is required to analyze gpu hangs, so please always attach it.
    Nov 17 00:11:48 linux-edbc kernel: [drm] GPU crash dump saved to /sys/class/drm/card0/error
    Nov 17 00:11:48 linux-edbc kernel: drm/i915: Resetting chip after gpu hang
    Nov 17 00:11:48 linux-edbc kernel: [drm] GuC firmware load skipped
    Nov 17 00:11:50 linux-edbc kernel: [drm] RC6 on
    Nov 17 00:11:58 linux-edbc kernel: drm/i915: Resetting chip after gpu hang
    Nov 17 00:11:58 linux-edbc kernel: [drm] GuC firmware load skipped
    Nov 17 00:12:00 linux-edbc kernel: [drm] RC6 on
    Nov 17 00:12:11 linux-edbc kernel: drm/i915: Resetting chip after gpu hang
    Nov 17 00:12:11 linux-edbc kernel: [drm] GuC firmware load skipped
    Nov 17 00:12:11 linux-edbc bluetoothd[1004]: Endpoint unregistered: sender=:1.37 path=/MediaEndpoint/A2DPSource
    Nov 17 00:12:11 linux-edbc bluetoothd[1004]: Endpoint unregistered: sender=:1.37 path=/MediaEndpoint/A2DPSink
    Nov 17 00:12:11 linux-edbc polkitd[1031]: Unregistered Authentication Agent for unix-session:1 (system bus name :1.26, object path /org/kde/PolicyKit1/AuthenticationAgent, locale en_US.UTF-8) (disconnected from bus)
    Nov 17 00:12:12 linux-edbc kernel: [drm] RC6 on
    Nov 17 00:12:12 linux-edbc kernel: ------------[ cut here ]------------
    Nov 17 00:12:12 linux-edbc kernel: WARNING: CPU: 0 PID: 1579 at ../drivers/gpu/drm/i915/intel_display.c:13688 intel_atomic_commit_tail+0xf2d/0xf40 [i915]
    Nov 17 00:12:12 linux-edbc kernel: pipe A vblank wait timed out
    Nov 17 00:12:12 linux-edbc kernel: Modules linked in: fuse ccm cmac rfcomm af_packet nf_log_ipv6 xt_pkttype nf_log_ipv4 nf_log_common xt_LOG xt_limit msr bnep xfs libcrc32c uvcvideo btusb btrtl btbcm videobuf2_vmalloc videobuf2_memops btintel videobuf2_v4l2 videobuf2_core bluetooth videodev snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic ip6t_REJECT snd_soc_skl snd_soc_skl_ipc snd_soc_sst_ipc snd_soc_sst_dsp nf_reject_ipv6 snd_hda_ext_core snd_soc_sst_match arc4 snd_soc_core snd_compress xt_tcpudp nf_conntrack_ipv6 nf_defrag_ipv6 ip6table_raw ipt_REJECT nf_reject_ipv4 intel_rapl x86_pkg_temp_thermal iptable_raw intel_powerclamp coretemp xt_CT kvm irqbypass iptable_filter crct10dif_pclmul crc32_pclmul mei_wdt ghash_clmulni_intel iTCO_wdt iTCO_vendor_support aesni_intel ip6table_mangle snd_hda_intel snd_hda_codec
    Nov 17 00:12:12 linux-edbc kernel:  snd_hda_core iwlmvm snd_hwdep aes_x86_64 lrw glue_helper ablk_helper mac80211 nf_conntrack_netbios_ns cryptd nf_conntrack_broadcast snd_pcm nf_conntrack_ipv4 nf_defrag_ipv4 thinkpad_acpi ip_tables snd_timer e1000e iwlwifi pcspkr ptp joydev pps_core rtsx_pci_ms snd xt_conntrack mei_me thermal mei fjes i2c_i801 memstick intel_pch_thermal i2c_smbus cfg80211 soundcore battery ac rfkill shpchp tpm_tis tpm_tis_core tpm nf_conntrack ip6table_filter ip6_tables x_tables uas usb_storage btrfs xor zlib_deflate hid_generic usbhid raid6_pq rtsx_pci_sdmmc mmc_core crc32c_intel serio_raw i915 rtsx_pci mfd_core i2c_algo_bit drm_kms_helper syscopyarea xhci_pci sysfillrect sysimgblt fb_sys_fops xhci_hcd usbcore drm usb_common wmi video button sg bbswitch(O)
    Nov 17 00:12:12 linux-edbc kernel: CPU: 0 PID: 1579 Comm: X Tainted: G           O    4.8.6-2-default #1
    Nov 17 00:12:12 linux-edbc kernel: Hardware name: LENOVO 20F9A00PHH/20F9A00PHH, BIOS N1CET43W (1.11 ) 04/14/2016
    Nov 17 00:12:12 linux-edbc kernel:  0000000000000000 ffffffffba3a4272 ffff984ea8773ba8 0000000000000000
    Nov 17 00:12:12 linux-edbc kernel:  ffffffffba07de2e 0000000000000000 ffff984ea8773bf8 0000000000000000
    Nov 17 00:12:12 linux-edbc kernel:  0000000000000000 0000000000000000 ffff984eb2d2b000 ffffffffba07de9f
    Nov 17 00:12:12 linux-edbc kernel: Call Trace:
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffba02eefe>] dump_trace+0x5e/0x310
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffba02f2cb>] show_stack_log_lvl+0x11b/0x1a0
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffba030001>] show_stack+0x21/0x40
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffba3a4272>] dump_stack+0x5c/0x7a
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffba07de2e>] __warn+0xbe/0xe0
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffba07de9f>] warn_slowpath_fmt+0x4f/0x60
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffc0645e5d>] intel_atomic_commit_tail+0xf2d/0xf40 [i915]
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffc064627c>] intel_atomic_commit+0x40c/0x510 [i915]
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffc05744fc>] restore_fbdev_mode+0x14c/0x270 [drm_kms_helper]
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffc0575f9e>] drm_fb_helper_restore_fbdev_mode_unlocked+0x2e/0x70 [drm_kms_helper]
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffc0660744>] intel_fbdev_restore_mode+0x34/0xa0 [i915]
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffc05c8a0a>] i915_driver_lastclose+0xa/0x10 [i915]
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffc04311a9>] drm_lastclose+0x29/0x120 [drm]
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffc04314fb>] drm_release+0x25b/0x370 [drm]
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffba21ac38>] __fput+0xb8/0x1c0
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffba09abb9>] task_work_run+0x79/0xa0
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffba081952>] do_exit+0x2c2/0xb50
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffba08225a>] do_group_exit+0x3a/0xa0
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffba0822d0>] SyS_exit_group+0x10/0x10
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffba6d43f6>] entry_SYSCALL_64_fastpath+0x1e/0xa8
    Nov 17 00:12:12 linux-edbc kernel: DWARF2 unwinder stuck at entry_SYSCALL_64_fastpath+0x1e/0xa8
    Nov 17 00:12:12 linux-edbc kernel: 
    Nov 17 00:12:12 linux-edbc kernel: Leftover inexact backtrace:
    Nov 17 00:12:12 linux-edbc kernel: ---[ end trace 321413246605d259 ]---
    Nov 17 00:12:12 linux-edbc kernel: ------------[ cut here ]------------

  9. #9

    Default Re: drm/915: Resetting chip after gpu hang

    dump part B

    Code:
    Nov 17 00:12:12 linux-edbc kernel: WARNING: CPU: 0 PID: 1579 at  ../drivers/gpu/drm/i915/intel_display.c:13688  intel_atomic_commit_tail+0xf2d/0xf40 [i915]
    Nov 17 00:12:12 linux-edbc kernel: pipe B vblank wait timed out
    Nov 17 00:12:12 linux-edbc kernel: Modules linked in: fuse ccm cmac  rfcomm af_packet nf_log_ipv6 xt_pkttype nf_log_ipv4 nf_log_common xt_LOG  xt_limit msr bnep xfs libcrc32c uvcvideo btusb btrtl btbcm  videobuf2_vmalloc videobuf2_memops btintel videobuf2_v4l2 videobuf2_core  bluetooth videodev snd_hda_codec_hdmi snd_hda_codec_realtek  snd_hda_codec_generic ip6t_REJECT snd_soc_skl snd_soc_skl_ipc  snd_soc_sst_ipc snd_soc_sst_dsp nf_reject_ipv6 snd_hda_ext_core  snd_soc_sst_match arc4 snd_soc_core snd_compress xt_tcpudp  nf_conntrack_ipv6 nf_defrag_ipv6 ip6table_raw ipt_REJECT nf_reject_ipv4  intel_rapl x86_pkg_temp_thermal iptable_raw intel_powerclamp coretemp  xt_CT kvm irqbypass iptable_filter crct10dif_pclmul crc32_pclmul mei_wdt  ghash_clmulni_intel iTCO_wdt iTCO_vendor_support aesni_intel  ip6table_mangle snd_hda_intel snd_hda_codec
    Nov 17 00:12:12 linux-edbc kernel:  snd_hda_core iwlmvm snd_hwdep  aes_x86_64 lrw glue_helper ablk_helper mac80211 nf_conntrack_netbios_ns  cryptd nf_conntrack_broadcast snd_pcm nf_conntrack_ipv4 nf_defrag_ipv4  thinkpad_acpi ip_tables snd_timer e1000e iwlwifi pcspkr ptp joydev  pps_core rtsx_pci_ms snd xt_conntrack mei_me thermal mei fjes i2c_i801  memstick intel_pch_thermal i2c_smbus cfg80211 soundcore battery ac  rfkill shpchp tpm_tis tpm_tis_core tpm nf_conntrack ip6table_filter  ip6_tables x_tables uas usb_storage btrfs xor zlib_deflate hid_generic  usbhid raid6_pq rtsx_pci_sdmmc mmc_core crc32c_intel serio_raw i915  rtsx_pci mfd_core i2c_algo_bit drm_kms_helper syscopyarea xhci_pci  sysfillrect sysimgblt fb_sys_fops xhci_hcd usbcore drm usb_common wmi  video button sg bbswitch(O)
    Nov 17 00:12:12 linux-edbc kernel: CPU: 0 PID: 1579 Comm: X Tainted: G        W  O    4.8.6-2-default #1
    Nov 17 00:12:12 linux-edbc kernel: Hardware name: LENOVO 20F9A00PHH/20F9A00PHH, BIOS N1CET43W (1.11 ) 04/14/2016
    Nov 17 00:12:12 linux-edbc kernel:  0000000000000000 ffffffffba3a4272 ffff984ea8773ba8 0000000000000000
    Nov 17 00:12:12 linux-edbc kernel:  ffffffffba07de2e 0000000000000001 ffff984ea8773bf8 00000000000000a8
    Nov 17 00:12:12 linux-edbc kernel:  0000000000000000 0000000000000004 ffff984eb1e00000 ffffffffba07de9f
    Nov 17 00:12:12 linux-edbc kernel: Call Trace:
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffba02eefe>] dump_trace+0x5e/0x310
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffba02f2cb>] show_stack_log_lvl+0x11b/0x1a0
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffba030001>] show_stack+0x21/0x40
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffba3a4272>] dump_stack+0x5c/0x7a
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffba07de2e>] __warn+0xbe/0xe0
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffba07de9f>] warn_slowpath_fmt+0x4f/0x60
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffc0645e5d>] intel_atomic_commit_tail+0xf2d/0xf40 [i915]
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffc064627c>] intel_atomic_commit+0x40c/0x510 [i915]
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffc05744fc>] restore_fbdev_mode+0x14c/0x270 [drm_kms_helper]
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffc0575f9e>]  drm_fb_helper_restore_fbdev_mode_unlocked+0x2e/0x70 [drm_kms_helper]
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffc0660744>] intel_fbdev_restore_mode+0x34/0xa0 [i915]
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffc05c8a0a>] i915_driver_lastclose+0xa/0x10 [i915]
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffc04311a9>] drm_lastclose+0x29/0x120 [drm]
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffc04314fb>] drm_release+0x25b/0x370 [drm]
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffba21ac38>] __fput+0xb8/0x1c0
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffba09abb9>] task_work_run+0x79/0xa0
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffba081952>] do_exit+0x2c2/0xb50
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffba08225a>] do_group_exit+0x3a/0xa0
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffba0822d0>] SyS_exit_group+0x10/0x10
    Nov 17 00:12:12 linux-edbc kernel:  [<ffffffffba6d43f6>] entry_SYSCALL_64_fastpath+0x1e/0xa8
    Nov 17 00:12:12 linux-edbc kernel: DWARF2 unwinder stuck at entry_SYSCALL_64_fastpath+0x1e/0xa8
    Nov 17 00:12:12 linux-edbc kernel: 
    Nov 17 00:12:12 linux-edbc kernel: Leftover inexact backtrace:
    Nov 17 00:12:12 linux-edbc kernel: ---[ end trace 321413246605d25a ]---
    Nov 17 00:12:22 linux-edbc kernel:  [drm:drm_atomic_helper_commit_cleanup_done [drm_kms_helper]] *ERROR*  [CRTC:26:pipe A] flip_done timed out
    Nov 17 00:12:33 linux-edbc kernel:  [drm:drm_atomic_helper_commit_cleanup_done [drm_kms_helper]] *ERROR*  [CRTC:30:pipe B] flip_done timed out
    Nov 17 00:12:33 linux-edbc sddm-helper[1793]: [PAM] Ended.
    Nov 17 00:12:33 linux-edbc sddm[1577]: Auth: sddm-helper exited successfully
    Nov 17 00:12:33 linux-edbc sddm[1577]: Display server stopping...
    Nov 17 00:12:33 linux-edbc sddm[1577]: Display server stopped.
    Nov 17 00:12:33 linux-edbc sddm[1577]: Running display stop script  "/usr/share/sddm/scripts/Xstop"
    Nov 17 00:12:33 linux-edbc NetworkManager[1014]: <info>   [1479312753.1161] device (wlp4s0): state change: activated ->  deactivating (reason 'connection-removed') [100 110 38]
    Nov 17 00:12:33 linux-edbc NetworkManager[1014]: <info>   [1479312753.1171] device (wlp4s0): state change: deactivating ->  disconnected (reason 'connection-removed') [110 30 38]
    Nov 17 00:12:33 linux-edbc avahi-daemon[1018]: Withdrawing address record for fe80::4685:ff:fed1:71ee on wlp4s0.
    Nov 17 00:12:33 linux-edbc avahi-daemon[1018]: Leaving mDNS multicast  group on interface wlp4s0.IPv6 with address fe80::4685:ff:fed1:71ee.
    Nov 17 00:12:33 linux-edbc avahi-daemon[1018]: Interface wlp4s0.IPv6 no longer relevant for mDNS.
    Nov 17 00:12:33 linux-edbc sddm[1577]: Removing display ":0" ...
    Nov 17 00:12:33 linux-edbc sddm[1577]: Adding new display on vt 7 ...
    Nov 17 00:12:33 linux-edbc sddm[1577]: Display server starting...
    Nov 17 00:12:33 linux-edbc sddm[1577]: Running: /usr/bin/X -nolisten tcp  -auth /run/sddm/{d8a55dcb-151d-49db-bacb-85f0c61e05ec} -background none  -noreset -displayfd 18 vt7
    Nov 17 00:12:33 linux-edbc NetworkManager[1014]: <info>   [1479312753.1499] dhcp4 (wlp4s0): canceled DHCP transaction, DHCP  client pid 4540
    Nov 17 00:12:33 linux-edbc NetworkManager[1014]: <info>  [1479312753.1499] dhcp4 (wlp4s0): state changed bound -> done
    Nov 17 00:12:33 linux-edbc kernel: wlp4s0: deauthenticating from 00:11:50:26:2f:eb by local choice (Reason: 3=DEAUTH_LEAVING)
    Nov 17 00:12:33 linux-edbc avahi-daemon[1018]: Withdrawing address record for 192.168.2.7 on wlp4s0.
    Nov 17 00:12:33 linux-edbc avahi-daemon[1018]: Leaving mDNS multicast group on interface wlp4s0.IPv4 with address 192.168.2.7.
    Nov 17 00:12:33 linux-edbc avahi-daemon[1018]: Interface wlp4s0.IPv4 no longer relevant for mDNS.
    Nov 17 00:12:33 linux-edbc NetworkManager[1014]: <info>   [1479312753.1659] device (wlp4s0): set-hw-addr: set MAC address to  FE:29:FD:1A:23:5D (scanning)
    Nov 17 00:12:33 linux-edbc kernel: IPv6: ADDRCONF(NETDEV_UP): wlp4s0: link is not ready
    Nov 17 00:12:33 linux-edbc dns-resolver[4878]: ATTENTION: You have modified /etc/resolv.conf. Leaving it untouched...
    Nov 17 00:12:33 linux-edbc dns-resolver[4880]: You can find my version in /etc/resolv.conf.netconfig
    Nov 17 00:12:33 linux-edbc NetworkManager[1014]: <13>Nov 17  00:12:33 dns-resolver: ATTENTION: You have modified /etc/resolv.conf.  Leaving it untouched...
    Nov 17 00:12:33 linux-edbc NetworkManager[1014]: <13>Nov 17  00:12:33 dns-resolver: You can find my version in  /etc/resolv.conf.netconfig

  10. #10

    Default Re: drm/915: Resetting chip after gpu hang

    dump part C

    Code:
    Nov 17 00:12:33 linux-edbc NetworkManager[1014]: ATTENTION: You have modified /etc/resolv.conf.  Leaving it untouched...
    Nov 17 00:12:33 linux-edbc NetworkManager[1014]: You can find my version in /etc/resolv.conf.netconfig ...
    Nov 17 00:12:33 linux-edbc NetworkManager[1014]: nisdomainname: you must be root to change the domain name
    Nov 17 00:12:33 linux-edbc NetworkManager[1014]: nisdomainname: you must be root to change the domain name
    Nov 17 00:12:33 linux-edbc NetworkManager[1014]: <warn>    [1479312753.4417] dns-mgr: could not commit DNS changes: Error calling   netconfig: exited with status 20
    Nov 17 00:12:33 linux-edbc dbus[1007]: [system] Activating via systemd:   service name='org.freedesktop.nm_dispatcher'   unit='dbus-org.freedesktop.nm-dispatcher.service'
    Nov 17 00:12:33 linux-edbc NetworkManager[1014]: <warn>    [1479312753.4508] sup-iface[0x141e8a0,wlp4s0]: connection disconnected   (reason -3)
    Nov 17 00:12:33 linux-edbc NetworkManager[1014]: <info>    [1479312753.4513] device (wlp4s0): supplicant interface state:   completed -> disconnected
    Nov 17 00:12:33 linux-edbc systemd[1]: Starting Network Manager Script Dispatcher Service...
    Nov 17 00:12:33 linux-edbc NetworkManager[1014]: <info>    [1479312753.4806] device (wlp4s0): supplicant interface state:   disconnected -> disabled
    Nov 17 00:12:33 linux-edbc dbus[1007]: [system] Successfully activated service 'org.freedesktop.nm_dispatcher'
    Nov 17 00:12:33 linux-edbc NetworkManager[1014]: <info>    [1479312753.4815] device (wlp4s0): supplicant interface state: disabled   -> disconnected
    Nov 17 00:12:33 linux-edbc systemd[1]: Started Network Manager Script Dispatcher Service.
    Nov 17 00:12:33 linux-edbc audit[1]: SERVICE_START pid=1 uid=0   auid=4294967295 ses=4294967295 msg='unit=NetworkManager-dispatcher   comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=?   terminal=? res=success'
    Nov 17 00:12:33 linux-edbc nm-dispatcher[4941]: req:1 'down' [wlp4s0]: new request (4 scripts)
    Nov 17 00:12:33 linux-edbc nm-dispatcher[4941]: req:1 'down' [wlp4s0]: start running ordered scripts...
    Nov 17 00:12:43 linux-edbc kernel:   [drm:drm_atomic_helper_commit_cleanup_done [drm_kms_helper]] *ERROR*   [CRTC:26:pipe A] flip_done timed out
    Nov 17 00:12:43 linux-edbc audit[1]: SERVICE_STOP pid=1 uid=0   auid=4294967295 ses=4294967295 msg='unit=NetworkManager-dispatcher   comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=?   terminal=? res=success'
    Nov 17 00:12:53 linux-edbc kernel:   [drm:drm_atomic_helper_commit_cleanup_done [drm_kms_helper]] *ERROR*   [CRTC:30:pipe B] flip_done timed out
    Nov 17 00:13:03 linux-edbc kernel: SFW2-INext-DROP-DEFLT IN=enp0s31f6   OUT= MAC=50:7b:9d:ff:db:d9:00:24:8c:09:b9:37:08:00 SRC=192.168.2.6   DST=192.168.2.5 LEN=139 TOS=0x00 PREC=0x00 TTL=64 ID=59195 DF PROTO=UDP   SPT=3128 DPT=50315 LEN=119 
    Nov 17 00:13:03 linux-edbc kernel:   [drm:drm_atomic_helper_commit_cleanup_done [drm_kms_helper]] *ERROR*   [CRTC:26:pipe A] flip_done timed out
    Nov 17 00:13:05 linux-edbc kernel: SFW2-INext-DROP-DEFLT IN=enp0s31f6   OUT= MAC=50:7b:9d:ff:db:d9:00:24:8c:09:b9:37:08:00 SRC=192.168.2.6   DST=192.168.2.5 LEN=139 TOS=0x00 PREC=0x00 TTL=64 ID=60604 DF PROTO=UDP   SPT=3128 DPT=50315 LEN=119 
    Nov 17 00:13:09 linux-edbc kernel: SFW2-INext-DROP-DEFLT IN=enp0s31f6   OUT= MAC=50:7b:9d:ff:db:d9:00:24:8c:09:b9:37:08:00 SRC=192.168.2.6   DST=192.168.2.5 LEN=139 TOS=0x00 PREC=0x00 TTL=64 ID=63736 DF PROTO=UDP   SPT=3128 DPT=50315 LEN=119 
    Nov 17 00:13:11 linux-edbc kernel: usb 1-1: USB disconnect, device number 2
    Nov 17 00:13:13 linux-edbc kernel: usb 1-1: new low-speed USB device number 10 using xhci_hcd
    Nov 17 00:13:13 linux-edbc kernel: usb 1-1: New USB device found, idVendor=046d, idProduct=c077
    Nov 17 00:13:13 linux-edbc kernel: usb 1-1: New USB device strings: Mfr=1, Product=2, SerialNumber=0
    Nov 17 00:13:13 linux-edbc kernel: usb 1-1: Product: USB Optical Mouse
    Nov 17 00:13:13 linux-edbc kernel: usb 1-1: Manufacturer: Logitech
    Nov 17 00:13:13 linux-edbc kernel: input: Logitech USB Optical Mouse as   /devices/pci0000:00/0000:00:14.0/usb1/1-1/1-1:1.0/0003:046D:C077.0002/input/input21
    Nov 17 00:13:13 linux-edbc kernel: hid-generic 0003:046D:C077.0002:   input,hidraw0: USB HID v1.11 Mouse [Logitech USB Optical Mouse] on   usb-0000:00:14.0-1/input0
    Nov 17 00:13:13 linux-edbc kernel: usb 1-4: new full-speed USB device number 11 using xhci_hcd
    Nov 17 00:13:13 linux-edbc kernel: usb 1-4: device descriptor read/64, error -71
    Nov 17 00:13:13 linux-edbc kernel: usb 1-4: device descriptor read/64, error -71
    Nov 17 00:13:14 linux-edbc kernel:   [drm:drm_atomic_helper_commit_cleanup_done [drm_kms_helper]] *ERROR*   [CRTC:30:pipe B] flip_done timed out
    Nov 17 00:13:14 linux-edbc kernel: usb 1-4: new full-speed USB device number 12 using xhci_hcd
    Nov 17 00:13:14 linux-edbc kernel: usb 1-4: device descriptor read/64, error -71
    Nov 17 00:13:14 linux-edbc kernel: usb 1-4: device descriptor read/64, error -71
    Nov 17 00:13:14 linux-edbc kernel: usb 1-4: new full-speed USB device number 13 using xhci_hcd
    Nov 17 00:13:14 linux-edbc kernel: usb 1-4: Device not responding to setup address.
    Nov 17 00:13:15 linux-edbc kernel: usb 1-4: Device not responding to setup address.
    Nov 17 00:13:15 linux-edbc kernel: usb 1-4: device not accepting address 13, error -71
    Nov 17 00:13:15 linux-edbc kernel: usb 1-4: new full-speed USB device number 14 using xhci_hcd
    Nov 17 00:13:15 linux-edbc kernel: usb 1-4: Device not responding to setup address.
    Nov 17 00:13:15 linux-edbc kernel: usb 1-4: Device not responding to setup address.
    Nov 17 00:13:15 linux-edbc kernel: usb 1-4: device not accepting address 14, error -71
    Nov 17 00:13:15 linux-edbc kernel: usb usb1-port4: unable to enumerate USB device
    Nov 17 00:13:15 linux-edbc mtp-probe[4978]: checking bus 1, device 10: "/sys/devices/pci0000:00/0000:00:14.0/usb1/1-1"
    Nov 17 00:13:15 linux-edbc mtp-probe[4978]: bus: 1, device: 10 was not an MTP device
    Nov 17 00:13:24 linux-edbc kernel:   [drm:drm_atomic_helper_commit_cleanup_done [drm_kms_helper]] *ERROR*   [CRTC:26:pipe A] flip_done timed out
    Nov 17 00:13:34 linux-edbc kernel:   [drm:drm_atomic_helper_commit_cleanup_done [drm_kms_helper]] *ERROR*   [CRTC:30:pipe B] flip_done timed out
    Nov 17 00:13:44 linux-edbc kernel:   [drm:drm_atomic_helper_commit_cleanup_done [drm_kms_helper]] *ERROR*   [CRTC:26:pipe A] flip_done timed out
    Nov 17 00:13:55 linux-edbc kernel:   [drm:drm_atomic_helper_commit_cleanup_done [drm_kms_helper]] *ERROR*   [CRTC:30:pipe B] flip_done timed out
    Nov 17 00:14:03 linux-edbc kernel: BUG: unable to handle kernel NULL pointer dereference at           (null)
    Nov 17 00:14:03 linux-edbc kernel: IP: [<ffffffffba0be835>] __wake_up_common+0x25/0x80
    Nov 17 00:14:03 linux-edbc kernel: PGD 2e162e067 PUD 2e162c067 PMD 0 
    Nov 17 00:14:03 linux-edbc kernel: Oops: 0000 [#1] PREEMPT SMP
    Nov 17 00:14:03 linux-edbc kernel: Modules linked in: fuse ccm cmac   rfcomm af_packet nf_log_ipv6 xt_pkttype nf_log_ipv4 nf_log_common  xt_LOG  xt_limit msr bnep xfs libcrc32c uvcvideo btusb btrtl btbcm   videobuf2_vmalloc videobuf2_memops btintel videobuf2_v4l2  videobuf2_core  bluetooth videodev snd_hda_codec_hdmi  snd_hda_codec_realtek  snd_hda_codec_generic ip6t_REJECT snd_soc_skl  snd_soc_skl_ipc  snd_soc_sst_ipc snd_soc_sst_dsp nf_reject_ipv6  snd_hda_ext_core  snd_soc_sst_match arc4 snd_soc_core snd_compress  xt_tcpudp  nf_conntrack_ipv6 nf_defrag_ipv6 ip6table_raw ipt_REJECT  nf_reject_ipv4  intel_rapl x86_pkg_temp_thermal iptable_raw  intel_powerclamp coretemp  xt_CT kvm irqbypass iptable_filter  crct10dif_pclmul crc32_pclmul mei_wdt  ghash_clmulni_intel iTCO_wdt  iTCO_vendor_support aesni_intel  ip6table_mangle snd_hda_intel  snd_hda_codec
    Nov 17 00:14:03 linux-edbc kernel:  snd_hda_core iwlmvm snd_hwdep   aes_x86_64 lrw glue_helper ablk_helper mac80211 nf_conntrack_netbios_ns   cryptd nf_conntrack_broadcast snd_pcm nf_conntrack_ipv4 nf_defrag_ipv4   thinkpad_acpi ip_tables snd_timer e1000e iwlwifi pcspkr ptp joydev   pps_core rtsx_pci_ms snd xt_conntrack mei_me thermal mei fjes i2c_i801   memstick intel_pch_thermal i2c_smbus cfg80211 soundcore battery ac   rfkill shpchp tpm_tis tpm_tis_core tpm nf_conntrack ip6table_filter   ip6_tables x_tables uas usb_storage btrfs xor zlib_deflate hid_generic   usbhid raid6_pq rtsx_pci_sdmmc mmc_core crc32c_intel serio_raw i915   rtsx_pci mfd_core i2c_algo_bit drm_kms_helper syscopyarea xhci_pci   sysfillrect sysimgblt fb_sys_fops xhci_hcd usbcore drm usb_common wmi   video button sg bbswitch(O)
    Nov 17 00:14:03 linux-edbc kernel: CPU: 0 PID: 4832 Comm: X Tainted: G        W  O    4.8.6-2-default #1
    Nov 17 00:14:03 linux-edbc kernel: Hardware name: LENOVO 20F9A00PHH/20F9A00PHH, BIOS N1CET43W (1.11 ) 04/14/2016
    Nov 17 00:14:03 linux-edbc kernel: task: ffff984ea939e140 task.stack: ffff984e617d8000
    Nov 17 00:14:03 linux-edbc kernel: RIP: 0010:[<ffffffffba0be835>]  [<ffffffffba0be835>] __wake_up_common+0x25/0x80
    Nov 17 00:14:03 linux-edbc kernel: RSP: 0018:ffff984e617dbab8  EFLAGS: 00010082
    Nov 17 00:14:03 linux-edbc kernel: RAX: 0000000000000082 RBX: ffff984e863a4618 RCX: 0000000000000000
    Nov 17 00:14:03 linux-edbc kernel: RDX: 0000000000000000 RSI: 0000000000000003 RDI: ffff984e863a4618
    Nov 17 00:14:03 linux-edbc kernel: RBP: ffff984e863a4620 R08: 0000000000000000 R09: 0000000000002ee8
    Nov 17 00:14:03 linux-edbc kernel: R10: 0000000000045071 R11: 0000000000000005 R12: 0000000000000082
    Nov 17 00:14:03 linux-edbc kernel: R13: 0000000000000001 R14: 0000000000000003 R15: ffff984eb1f201c8
    Nov 17 00:14:03 linux-edbc kernel: FS:  00007ff5d1efba00(0000) GS:ffff984ec1400000(0000) knlGS:0000000000000000
    Nov 17 00:14:03 linux-edbc kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    Nov 17 00:14:03 linux-edbc kernel: CR2: 0000000000000000 CR3: 00000002e162a000 CR4: 00000000003406f0
    Nov 17 00:14:03 linux-edbc kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
    Nov 17 00:14:03 linux-edbc kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
    Nov 17 00:14:03 linux-edbc kernel: Stack:
    Nov 17 00:14:03 linux-edbc kernel:  0000000000002ee8 ffff984e863a4618 ffff984e863a4610 0000000000000082
    Nov 17 00:14:03 linux-edbc kernel:  0000000000000001 ffff984eae716400 ffff984eb1f201c8 ffffffffba0bf291
    Nov 17 00:14:03 linux-edbc kernel:  ffff984eb35d5000 ffff984eb35d5000 dead000000000200 ffffffffc0431065
    Nov 17 00:14:03 linux-edbc kernel: Call Trace:
    Nov 17 00:14:03 linux-edbc kernel:  [<ffffffffba0bf291>] complete_all+0x31/0x40
    Nov 17 00:14:03 linux-edbc kernel:  [<ffffffffc0431065>] drm_send_event_locked+0x25/0x100 [drm]
    Nov 17 00:14:03 linux-edbc kernel:  [<ffffffffc0434094>] drm_vblank_off+0x164/0x210 [drm]
    Nov 17 00:14:03 linux-edbc kernel:  [<ffffffffc063ea3a>] haswell_crtc_disable+0x6a/0x270 [i915]
    Nov 17 00:14:03 linux-edbc kernel:  [<ffffffffc0645089>] intel_atomic_commit_tail+0x159/0xf40 [i915]
    Nov 17 00:14:03 linux-edbc kernel:  [<ffffffffc064627c>] intel_atomic_commit+0x40c/0x510 [i915]
    Nov 17 00:14:03 linux-edbc kernel:  [<ffffffffc064dcfa>] intel_crtc_restore_mode+0x3a/0xb0 [i915]
    Nov 17 00:14:03 linux-edbc kernel:  [<ffffffffc06784d3>] intel_hdmi_set_property+0x83/0x190 [i915]
    Nov 17 00:14:03 linux-edbc kernel:  [<ffffffffc0440ade>] drm_mode_obj_set_property_ioctl+0x14e/0x220 [drm]
    Nov 17 00:14:03 linux-edbc kernel:  [<ffffffffc0440bde>] drm_mode_connector_property_set_ioctl+0x2e/0x40 [drm]
    Nov 17 00:14:03 linux-edbc kernel:  [<ffffffffc0432913>] drm_ioctl+0x1b3/0x440 [drm]
    Nov 17 00:14:03 linux-edbc kernel:  [<ffffffffba22c50f>] do_vfs_ioctl+0x8f/0x5d0
    Nov 17 00:14:03 linux-edbc kernel:  [<ffffffffba22cac4>] SyS_ioctl+0x74/0x80
    Nov 17 00:14:03 linux-edbc kernel:  [<ffffffffba6d43f6>] entry_SYSCALL_64_fastpath+0x1e/0xa8
    Nov 17 00:14:03 linux-edbc kernel: DWARF2 unwinder stuck at entry_SYSCALL_64_fastpath+0x1e/0xa8
    Nov 17 00:14:03 linux-edbc kernel: 
    Nov 17 00:14:03 linux-edbc kernel: Leftover inexact backtrace:
    Nov 17 00:14:03 linux-edbc kernel: Code: 66 0f 1f 44 00 00 0f 1f 44 00   00 41 57 41 56 41 89 f6 41 55 41 54 55 53 48 8d 6f 08 48 83 ec 08 89 54   24 04 48 8b 57 08 48 39 d5 <48> 8b 32 74 43 48 8d 42 e8 4c 8d 7e   e8 41 89 cd 4d 89 c4 8b 18 
    Nov 17 00:14:03 linux-edbc kernel: RIP  [<ffffffffba0be835>] __wake_up_common+0x25/0x80
    -- Reboot --
    After a hard reboot everything is ok for many hours until the next long screenlock ...

    there are no nvidia drivers installed , optirun looks ok with discreet card off , nouveau is "blacklisted" as suggested
    and bumblebee is running fine. ipv6 has been turned off . but still a reboot is needed after several hours screenlock.
    There is a 2nd monitor connected via hdmi cable , which also works fine, in case this matters.

    Hope this helps to further analyse this issue and thank you to all who help along.

Page 1 of 2 12 LastLast

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •