Random reboots and crashes

Hello,

Happy New Year, 2020 to everyone.

I am witnessing random reboots, freezes while opening either Firefox, Falkon, Seamonkey or Tor-Browser under moderate load. I am using Cinnamon DE at present and have witnessed same behavior while using LXDE, IceWM.

I see some errors in /var/log/warn -

2020-01-02T05:25:24.810055-08:00 linux-gfn3 kernel:  3980.328909] CPU: 0 PID: 7894 Comm: Chrome_InProcGp Tainted: P           O     4.12.14-lp151.28.36-default #1 openSUSE Leap 15.1
2020-01-02T05:41:45.498012-08:00 linux-gfn3 kernel:  4961.032960] CPU: 2 PID: 27316 Comm: Web Content Tainted: P      D    O     4.12.14-lp151.28.36-default #1 openSUSE Leap 15.1
2020-01-03T03:20:18.529192-08:00 localhost kernel:  3935.153249] CPU: 0 PID: 13306 Comm: updatedb Tainted: P           O     4.12.14-lp151.28.36-default #1 openSUSE Leap 15.1
2020-01-03T03:20:18.529240-08:00 localhost kernel:  3935.153508] CPU: 0 PID: 13306 Comm: updatedb Tainted: P           O     4.12.14-lp151.28.36-default #1 openSUSE Leap 15.1
2020-01-03T03:20:18.529285-08:00 localhost kernel:  3935.153748] CPU: 0 PID: 13306 Comm: updatedb Tainted: P           O     4.12.14-lp151.28.36-default #1 openSUSE Leap 15.1
2020-01-03T03:45:20.441313-08:00 localhost kernel:  5437.088896] CPU: 3 PID: 3288 Comm: X Tainted: P           O     4.12.14-lp151.28.36-default #1 openSUSE Leap 15.1
2020-01-03T03:45:34.241604-08:00 localhost kernel:  5450.891114] CPU: 1 PID: 17266 Comm: c++ Tainted: P        W  O     4.12.14-lp151.28.36-default #1 openSUSE Leap 15.1
2020-01-03T03:56:17.444688-08:00 localhost kernel:   425.608139] CPU: 2 PID: 4862 Comm: cc1plus Tainted: P           O     4.12.14-lp151.28.36-default #1 openSUSE Leap 15.1
2020-01-05T20:07:50.721868-08:00 localhost kernel:  1885.715093] CPU: 3 PID: 14140 Comm: sh Tainted: P           O     4.12.14-lp151.28.36-default #1 openSUSE Leap 15.1
2020-01-07T22:30:01.755169-08:00 localhost kernel:   921.332617] CPU: 0 PID: 407 Comm: kworker/0:2 Tainted: P           O     4.12.14-lp151.28.36-default #1 openSUSE Leap 15.1
2020-01-08T00:35:28.823786-08:00 localhost kernel:  2832.295179] CPU: 3 PID: 374 Comm: kworker/u8:8 Tainted: P           O     4.12.14-lp151.28.36-default #1 openSUSE Leap 15.1
2020-01-08T22:01:19.211553-08:00 localhost kernel:  6185.010580] CPU: 1 PID: 5987 Comm: kworker/1:0 Tainted: P           O     4.12.14-lp151.28.36-default #1 openSUSE Leap 15.1
2020-01-09T09:06:08.103405-08:00 localhost kernel:  7008.005934] CPU: 0 PID: 9535 Comm: bash Tainted: P           O     4.12.14-lp151.28.36-default #1 openSUSE Leap 15.1
2020-01-10T00:51:13.373199-08:00 localhost kernel:   136.493812] CPU: 3 PID: 3823 Comm: falkon Tainted: P           O     4.12.14-lp151.28.36-default #1 openSUSE Leap 15.1
2020-01-10T00:51:41.373223-08:00 localhost kernel:   164.497193] CPU: 3 PID: 3823 Comm: falkon Tainted: P           O L   4.12.14-lp151.28.36-default #1 openSUSE Leap 15.1
2020-01-10T00:51:47.745154-08:00 localhost kernel:   170.869902] CPU: 3 PID: 3823 Comm: falkon Tainted: P           O L   4.12.14-lp151.28.36-default #1 openSUSE Leap 15.1
2020-01-10T00:52:13.373207-08:00 localhost kernel:   196.498546] CPU: 3 PID: 3823 Comm: falkon Tainted: P           O L   4.12.14-lp151.28.36-default #1 openSUSE Leap 15.1
2020-01-10T00:52:41.373210-08:00 localhost kernel:   224.496537] CPU: 3 PID: 3823 Comm: falkon Tainted: P           O L   4.12.14-lp151.28.36-default #1 openSUSE Leap 15.1
2020-01-10T00:53:09.373226-08:00 localhost kernel:   252.494560] CPU: 3 PID: 3823 Comm: falkon Tainted: P           O L   4.12.14-lp151.28.36-default #1 openSUSE Leap 15.1
2020-01-10T00:53:37.373220-08:00 localhost kernel:   280.494735] CPU: 3 PID: 3823 Comm: falkon Tainted: P           O L   4.12.14-lp151.28.36-default #1 openSUSE Leap 15.1
2020-01-10T00:54:05.373206-08:00 localhost kernel:   308.495204] CPU: 3 PID: 3823 Comm: falkon Tainted: P           O L   4.12.14-lp151.28.36-default #1 openSUSE Leap 15.1
2020-01-10T00:54:33.373205-08:00 localhost kernel:   336.495665] CPU: 3 PID: 3823 Comm: falkon Tainted: P           O L   4.12.14-lp151.28.36-default #1 openSUSE Leap 15.1
2020-01-10T00:54:47.761183-08:00 localhost kernel:   350.883824] CPU: 3 PID: 3823 Comm: falkon Tainted: P           O L   4.12.14-lp151.28.36-default #1 openSUSE Leap 15.1
2020-01-10T00:55:13.373201-08:00 localhost kernel:   376.496278] CPU: 3 PID: 3823 Comm: falkon Tainted: P           O L   4.12.14-lp151.28.36-default #1 openSUSE Leap 15.1
2020-01-10T00:55:41.373199-08:00 localhost kernel:   404.496709] CPU: 3 PID: 3823 Comm: falkon Tainted: P           O L   4.12.14-lp151.28.36-default #1 openSUSE Leap 15.1
2020-01-10T00:56:09.373184-08:00 localhost kernel:   432.497142] CPU: 3 PID: 3823 Comm: falkon Tainted: P           O L   4.12.14-lp151.28.36-default #1 openSUSE Leap 15.1


I don’t experience these issues in KDE on same machine, it runs smooth with many resource intensive applications such as clamd, amavisd-new, fail2ban, apache2, nginx, redis, grafana, influx, telegraf, prometheus, promtail, netdata, cockpit, zeek, ntopng, suricata, evebox, custom conntrack applications, etc and 2-3 browsers open at the same time.

Unfortunately, coredumpctl doesn’t catch this. Please let me know how to investigate this and fix it.

Thanks.

Warnings are just that… non-critical and not likely (but can’t be absolutely sure) related to anything show-stopping.
You need to try to find related errors, not warnings.

Journalctl has a number of ways to filter errors, by application, by process, by time range, etc.
A resource I often use to review what might be best way to filter

https://www.digitalocean.com/community/tutorials/how-to-use-journalctl-to-view-and-manipulate-systemd-logs

TSU

Some more info -

2020-01-02T05:25:24.810014-08:00 linux-gfn3 kernel:  3980.328874] BUG: unable to handle kernel NULL pointer dereference at 0000000000000001
2020-01-02T05:41:45.497989-08:00 linux-gfn3 kernel:  4961.032926] BUG: unable to handle kernel NULL pointer dereference at 0000000000000001
2020-01-03T03:45:34.241583-08:00 localhost kernel:  5450.891080] BUG: unable to handle kernel NULL pointer dereference at 0000000000000001
2020-01-03T03:56:17.444668-08:00 localhost kernel:   425.608082] BUG: unable to handle kernel NULL pointer dereference at 0000000000000001
2020-01-07T22:30:01.755146-08:00 localhost kernel:   921.332568] BUG: unable to handle kernel NULL pointer dereference at           (null)
2020-01-07T22:30:34.003083-08:00 localhost kernel:   953.582300] BUG: workqueue lockup - pool cpus=0 node=0 flags=0x0 nice=0 stuck for 32s!
2020-01-07T22:31:04.347102-08:00 localhost kernel:   983.922909] BUG: workqueue lockup - pool cpus=0 node=0 flags=0x0 nice=0 stuck for 62s!
2020-01-07T22:31:35.727309-08:00 localhost kernel:  1015.307526] BUG: workqueue lockup - pool cpus=0 node=0 flags=0x0 nice=0 stuck for 93s!
2020-01-07T22:32:05.795172-08:00 localhost kernel:  1045.372117] BUG: workqueue lockup - pool cpus=0 node=0 flags=0x0 nice=0 stuck for 124s!
2020-01-08T00:35:28.823766-08:00 localhost kernel:  2832.295142] BUG: unable to handle kernel NULL pointer dereference at 0000000000000034
2020-01-08T22:01:19.211550-08:00 localhost kernel:  6185.010563] kernel BUG at ../mm/vmalloc.c:1538!


I follow that and get back.

This issue is resolved now, I changed elevator to DEADLINE. Completely Fair Queuing (CFQ) wasn’t fair at all. Haven’t witnessed crashes tonight while running Firefox & Falkon at the same time.

Thanks for reporting back, others may benefit from this