named segfaults on openSuSE 11.2

Hi list,

I’ve just installed openSuSE 11.2 on two machines. The install procedure was excellent, however on both machines named won’t start: when I try to start it it segfaults with the message below. These are two totally different machines with different configurations (I even tried an empty default config, same problem).
Anyone else have this problem? And maybe found a solution? Or is it a known bug?

cheers
Stefan

Nov 14 02:37:30 mentat1 named[7894]: built with ‘–prefix=/usr’ ‘–bindir=/usr/bin’ ‘–sbindir=/usr/sbin’ ‘–sysconfdir=/etc’ ‘–localstatedir=/var’ ‘–libdir=/usr/lib64’ ‘–includedir=/usr/include/bind’ ‘–mandir=/usr/share/man’ ‘–infodir=/usr/share/info’ ‘–with-openssl’ ‘–enable-threads’ ‘–with-libtool’ ‘–enable-runidn’ ‘–with-libxml2’ ‘–with-dlz-mysql’ ‘CFLAGS=-fmessage-length=0 -O2 -Wall -D_FORTIFY_SOURCE=2 -fstack-protector -funwind-tables -fasynchronous-unwind-tables -g -fno-strict-aliasing’ ‘LDFLAGS=-L/usr/lib64’
Nov 14 02:37:30 mentat1 kernel: 3127.909898] named[7894]: segfault at 7fd510c5692c ip 00007fd510a35f77 sp 00007ffffbf7a3e0 error 4
Nov 14 02:37:30 mentat1 kernel: 3127.909937] BUG: scheduling while atomic: named/7894/0x00000002
Nov 14 02:37:30 mentat1 kernel: 3127.909946] Modules linked in: ip6t_LOG ipt_MASQUERADE xt_pkttype xt_TCPMSS xt_tcpudp ipt_LOG xt_limit iptable_nat nf_nat af_packet radeon drm cpufreq_conservative cpufreq_userspace cpufreq_powersave acpi_cpufreq ip6t_REJECT nf_conntrack_ipv6 ip6table_raw xt_NOTRACK ipt_REJECT xt_state iptable_raw iptable_filter ip6table_mangle nf_conntrack_netbios_ns nf_conntrack_ipv4 nf_conntrack nf_defrag_ipv4 ip_tables ip6table_filter ip6_tables x_tables snd_pcm_oss snd_mixer_oss snd_seq snd_seq_device fuse loop dm_mod snd_hda_codec_atihdmi snd_hda_codec_nvhdmi snd_hda_codec_realtek snd_hda_intel snd_hda_codec ohci1394 sr_mod snd_hwdep sg r8169 i2c_nforce2 forcedeth ieee1394 serio_raw pcspkr wmi cdrom joydev button snd_pcm snd_timer snd snd_page_alloc fan processor ide_pci_generic amd74xx ide_core ata_generic pata_amd thermal thermal_sys
Nov 14 02:37:30 mentat1 kernel: 3127.910079] Pid: 7894, comm: named Not tainted 2.6.31.5-0.1-desktop #1
Nov 14 02:37:30 mentat1 kernel: 3127.910088] Call Trace:
Nov 14 02:37:30 mentat1 kernel: 3127.910110] <ffffffff81011a19>] try_stack_unwind+0x189/0x1b0
Nov 14 02:37:30 mentat1 kernel: 3127.910123] <ffffffff8101025d>] dump_trace+0xad/0x3a0
Nov 14 02:37:30 mentat1 kernel: 3127.910135] <ffffffff81011524>] show_trace_log_lvl+0x64/0x90
Nov 14 02:37:30 mentat1 kernel: 3127.910147] <ffffffff81011573>] show_trace+0x23/0x40
Nov 14 02:37:30 mentat1 kernel: 3127.910160] <ffffffff81551ee2>] dump_stack+0x81/0x9e
Nov 14 02:37:30 mentat1 kernel: 3127.910173] <ffffffff81056f12>] __schedule_bug+0x92/0xa0
Nov 14 02:37:30 mentat1 kernel: 3127.910186] <ffffffff81552e1f>] thread_return+0x2a7/0x3c8
Nov 14 02:37:30 mentat1 kernel: 3127.910199] <ffffffff8100d136>] retint_careful+0x14/0x32
Nov 14 02:37:30 mentat1 kernel: 3127.910216] <00007fd510a35f77>] 0x7fd510a35f77
Nov 14 02:37:30 mentat1 kernel: 3127.910337] note: named[7894] exited with preempt_count 1
Nov 14 02:37:30 mentat1 kernel: 3127.910500] BUG: scheduling while atomic: named/7894/0x10000002
Nov 14 02:37:30 mentat1 kernel: 3127.910510] Modules linked in: ip6t_LOG ipt_MASQUERADE xt_pkttype xt_TCPMSS xt_tcpudp ipt_LOG xt_limit iptable_nat nf_nat af_packet radeon drm cpufreq_conservative cpufreq_userspace cpufreq_powersave acpi_cpufreq ip6t_REJECT nf_conntrack_ipv6 ip6table_raw xt_NOTRACK ipt_REJECT xt_state iptable_raw iptable_filter ip6table_mangle nf_conntrack_netbios_ns nf_conntrack_ipv4 nf_conntrack nf_defrag_ipv4 ip_tables ip6table_filter ip6_tables x_tables snd_pcm_oss snd_mixer_oss snd_seq snd_seq_device fuse loop dm_mod snd_hda_codec_atihdmi snd_hda_codec_nvhdmi snd_hda_codec_realtek snd_hda_intel snd_hda_codec ohci1394 sr_mod snd_hwdep sg r8169 i2c_nforce2 forcedeth ieee1394 serio_raw pcspkr wmi cdrom joydev button snd_pcm snd_timer snd snd_page_alloc fan processor ide_pci_generic amd74xx ide_core ata_generic pata_amd thermal thermal_sys
Nov 14 02:37:30 mentat1 kernel: 3127.910627] Pid: 7894, comm: named Not tainted 2.6.31.5-0.1-desktop #1
Nov 14 02:37:30 mentat1 kernel: 3127.910640] Call Trace:
Nov 14 02:37:30 mentat1 kernel: 3127.910650] <ffffffff81011a19>] try_stack_unwind+0x189/0x1b0
Nov 14 02:37:30 mentat1 kernel: 3127.910665] <ffffffff8101025d>] dump_trace+0xad/0x3a0
Nov 14 02:37:30 mentat1 kernel: 3127.910674] <ffffffff81011524>] show_trace_log_lvl+0x64/0x90
Nov 14 02:37:30 mentat1 kernel: 3127.910685] <ffffffff81011573>] show_trace+0x23/0x40
Nov 14 02:37:30 mentat1 kernel: 3127.910697] <ffffffff81551ee2>] dump_stack+0x81/0x9e
Nov 14 02:37:30 mentat1 kernel: 3127.910709] <ffffffff81056f12>] __schedule_bug+0x92/0xa0
Nov 14 02:37:30 mentat1 kernel: 3127.910722] <ffffffff81552e1f>] thread_return+0x2a7/0x3c8
Nov 14 02:37:30 mentat1 kernel: 3127.910737] <ffffffff81060da8>] __cond_resched+0x38/0x80
Nov 14 02:37:30 mentat1 kernel: 3127.910751] <ffffffff815530dd>] _cond_resched+0x4d/0x60
Nov 14 02:37:30 mentat1 kernel: 3127.910764] <ffffffff8106d90c>] put_files_struct+0x8c/0x100
Nov 14 02:37:30 mentat1 kernel: 3127.910778] <ffffffff8106d9dc>] exit_files+0x5c/0x80
Nov 14 02:37:30 mentat1 kernel: 3127.910789] <ffffffff8106f41e>] do_exit+0x17e/0x3c0
Nov 14 02:37:30 mentat1 kernel: 3127.910801] <ffffffff8106f6b8>] do_group_exit+0x58/0xd0
Nov 14 02:37:30 mentat1 kernel: 3127.910816] <ffffffff8108069b>] get_signal_to_deliver+0x1ab/0x350
Nov 14 02:37:30 mentat1 kernel: 3127.910832] <ffffffff8100bfcf>] do_signal+0x7f/0x1f0
Nov 14 02:37:30 mentat1 kernel: 3127.910848] <ffffffff8100c1af>] do_notify_resume+0x6f/0x80
Nov 14 02:37:30 mentat1 kernel: 3127.910864] <ffffffff8100d19c>] retint_signal+0x48/0x8c
Nov 14 02:37:30 mentat1 kernel: 3127.910881] <00007fd510a35f77>] 0x7fd510a35f77

Same here. I’ve tried without config, small config, large config, configs that have worked before. This is going to be a deal-breaker for me if I can’t get a fix soon.

Steve Rooke

Nov 14 23:06:38 willow named[19785]: starting BIND 9.6.1-P1 -t /var/lib/named -u named
Nov 14 23:06:38 willow named[19785]: built with ‘–prefix=/usr’ ‘–bindir=/usr/bin’ ‘–sbindir=/usr/sbin’ ‘–sysconfdir=/etc’ ‘–localstatedir=/var’ ‘–libdir=/usr/lib64’ ‘–includedir=/usr/include/bind’ ‘–mandir=/usr/share/man’ ‘–infodir=/usr/share/info’ ‘–with-openssl’ ‘–enable-threads’ ‘–with-libtool’ ‘–enable-runidn’ ‘–with-libxml2’ ‘–with-dlz-mysql’ ‘CFLAGS=-fmessage-length=0 -O2 -Wall -D_FORTIFY_SOURCE=2 -fstack-protector -funwind-tables -fasynchronous-unwind-tables -g -fno-strict-aliasing’ ‘LDFLAGS=-L/usr/lib64’
Nov 14 23:06:38 willow kernel: [73505.485664] named[19785]: segfault at 7fb6947fb92c ip 00007fb6945daf77 sp 00007fff4b8ae140 error 4
Nov 14 23:06:38 willow kernel: [73505.485687] BUG: scheduling while atomic: named/19785/0x00000002
Nov 14 23:06:38 willow kernel: [73505.485694] Modules linked in: ip6t_LOG xt_tcpudp xt_pkttype ipt_LOG xt_limit snd_pcm_oss snd_mixer_oss snd_seq snd_seq_device edd cpufreq_conservative ip6t_REJECT cpufreq_userspace nf_conntrack_ipv6 cpufreq_powersave ip6table_raw acpi_cpufreq xt_NOTRACK ipt_REJECT xt_state iptable_raw iptable_filter ip6table_mangle nf_conntrack_netbios_ns nf_conntrack_ipv4 nf_conntrack nf_defrag_ipv4 ip_tables ip6table_filter ip6_tables x_tables fuse loop dm_mod snd_hda_codec_realtek snd_hda_intel snd_hda_codec snd_hwdep snd_pcm snd_timer snd ohci1394 mxser snd_page_alloc floppy nvidia(P) pcspkr i2c_nforce2 forcedeth joydev ieee1394 sr_mod sg cdrom button fan processor ide_pci_generic amd74xx siimage ide_core pata_sil680 sata_sil24 pata_amd ata_generic sata_nv thermal thermal_sys [last unloaded: preloadtrace]
Nov 14 23:06:38 willow kernel: [73505.485797] CPU 3:
Nov 14 23:06:38 willow kernel: [73505.485801] Modules linked in: ip6t_LOG xt_tcpudp xt_pkttype ipt_LOG xt_limit snd_pcm_oss snd_mixer_oss snd_seq snd_seq_device edd cpufreq_conservative ip6t_REJECT cpufreq_userspace nf_conntrack_ipv6 cpufreq_powersave ip6table_raw acpi_cpufreq xt_NOTRACK ipt_REJECT xt_state iptable_raw iptable_filter ip6table_mangle nf_conntrack_netbios_ns nf_conntrack_ipv4 nf_conntrack nf_defrag_ipv4 ip_tables ip6table_filter ip6_tables x_tables fuse loop dm_mod snd_hda_codec_realtek snd_hda_intel snd_hda_codec snd_hwdep snd_pcm snd_timer snd ohci1394 mxser snd_page_alloc floppy nvidia(P) pcspkr i2c_nforce2 forcedeth joydev ieee1394 sr_mod sg cdrom button fan processor ide_pci_generic amd74xx siimage ide_core pata_sil680 sata_sil24 pata_amd ata_generic sata_nv thermal thermal_sys [last unloaded: preloadtrace]
Nov 14 23:06:38 willow kernel: [73505.485893] Pid: 19785, comm: named Tainted: P 2.6.31.5-0.1-desktop #1 System Product Name
Nov 14 23:06:38 willow kernel: [73505.485901] RIP: 0010:<ffffffffa002b030>] <ffffffffa002b030>] 0xffffffffa002b030
Nov 14 23:06:38 willow kernel: [73505.485913] RSP: 0018:ffff88020486bf80 EFLAGS: 00000297
Nov 14 23:06:38 willow kernel: [73505.485919] RAX: 0000000000000002 RBX: 0000000000000000 RCX: 00000000006a2bd0
Nov 14 23:06:38 willow kernel: [73505.485925] RDX: 00000000000001b6 RSI: 0000000000000241 RDI: 00000000006a2bb0
Nov 14 23:06:38 willow kernel: [73505.485931] RBP: 00007fff010adc30 R08: 0000000000000020 R09: 0000000000000020
Nov 14 23:06:38 willow kernel: [73505.485938] R10: 00000000006a2bd0 R11: 0000000000000246 R12: 0000000000000001
Nov 14 23:06:38 willow kernel: [73505.485944] R13: 00000000006a4cb0 R14: 0000000000000001 R15: 0000000000000241
Nov 14 23:06:38 willow kernel: [73505.485950] FS: 00007fb6947ab6f0(0000) GS:ffff88000909e000(0000) knlGS:0000000000000000
Nov 14 23:06:38 willow kernel: [73505.485957] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Nov 14 23:06:38 willow kernel: [73505.485963] CR2: 00007fb6947fb92c CR3: 000000012f848000 CR4: 00000000000006e0
Nov 14 23:06:38 willow kernel: [73505.485969] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Nov 14 23:06:38 willow kernel: [73505.485976] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Nov 14 23:06:38 willow kernel: [73505.485982] Call Trace:
Nov 14 23:06:38 willow kernel: [73505.485997] Inexact backtrace:
Nov 14 23:06:38 willow kernel: [73505.485998]
Nov 14 23:06:38 willow kernel: [73505.486005] <#MC> <<EOE>>
Nov 14 23:06:38 willow kernel: [73505.486139] note: named[19785] exited with preempt_count 1
Nov 14 23:06:38 willow kernel: [73505.486362] BUG: scheduling while atomic: named/19785/0x10000002
Nov 14 23:06:38 willow kernel: [73505.486369] Modules linked in: ip6t_LOG xt_tcpudp xt_pkttype ipt_LOG xt_limit snd_pcm_oss snd_mixer_oss snd_seq snd_seq_device edd cpufreq_conservative ip6t_REJECT cpufreq_userspace nf_conntrack_ipv6 cpufreq_powersave ip6table_raw acpi_cpufreq xt_NOTRACK ipt_REJECT xt_state iptable_raw iptable_filter ip6table_mangle nf_conntrack_netbios_ns nf_conntrack_ipv4 nf_conntrack nf_defrag_ipv4 ip_tables ip6table_filter ip6_tables x_tables fuse loop dm_mod snd_hda_codec_realtek snd_hda_intel snd_hda_codec snd_hwdep snd_pcm snd_timer snd ohci1394 mxser snd_page_alloc floppy nvidia(P) pcspkr i2c_nforce2 forcedeth joydev ieee1394 sr_mod sg cdrom button fan processor ide_pci_generic amd74xx siimage ide_core pata_sil680 sata_sil24 pata_amd ata_generic sata_nv thermal thermal_sys [last unloaded: preloadtrace]
Nov 14 23:06:38 willow kernel: [73505.486673] CPU 3:
Nov 14 23:06:38 willow kernel: [73505.486684] Modules linked in: ip6t_LOG xt_tcpudp xt_pkttype ipt_LOG xt_limit snd_pcm_oss snd_mixer_oss snd_seq snd_seq_device edd cpufreq_conservative ip6t_REJECT cpufreq_userspace nf_conntrack_ipv6 cpufreq_powersave ip6table_raw acpi_cpufreq xt_NOTRACK ipt_REJECT xt_state iptable_raw iptable_filter ip6table_mangle nf_conntrack_netbios_ns nf_conntrack_ipv4 nf_conntrack nf_defrag_ipv4 ip_tables ip6table_filter ip6_tables x_tables fuse loop dm_mod snd_hda_codec_realtek snd_hda_intel snd_hda_codec snd_hwdep snd_pcm snd_timer snd ohci1394 mxser snd_page_alloc floppy nvidia(P) pcspkr i2c_nforce2 forcedeth joydev ieee1394 sr_mod sg cdrom button fan processor ide_pci_generic amd74xx siimage ide_core pata_sil680 sata_sil24 pata_amd ata_generic sata_nv thermal thermal_sys [last unloaded: preloadtrace]
Nov 14 23:06:38 willow kernel: [73505.486981] Pid: 19785, comm: named Tainted: P 2.6.31.5-0.1-desktop #1 System Product Name
Nov 14 23:06:38 willow kernel: [73505.486999] RIP: 0010:<ffffffffa002b030>] <ffffffffa002b030>] 0xffffffffa002b030
Nov 14 23:06:38 willow kernel: [73505.487015] RSP: 0018:ffff88020486bf80 EFLAGS: 00000297
Nov 14 23:06:38 willow kernel: [73505.487027] RAX: 0000000000000002 RBX: 0000000000000000 RCX: 00000000006a2bd0
Nov 14 23:06:38 willow kernel: [73505.487041] RDX: 00000000000001b6 RSI: 0000000000000241 RDI: 00000000006a2bb0
Nov 14 23:06:38 willow kernel: [73505.487055] RBP: 00007fff010adc30 R08: 0000000000000020 R09: 0000000000000020
Nov 14 23:06:38 willow kernel: [73505.487069] R10: 00000000006a2bd0 R11: 0000000000000246 R12: 0000000000000001
Nov 14 23:06:38 willow kernel: [73505.487083] R13: 00000000006a4cb0 R14: 0000000000000001 R15: 0000000000000241
Nov 14 23:06:38 willow kernel: [73505.487098] FS: 0000000000000000(0000) GS:ffff88000909e000(0000) knlGS:0000000000000000
Nov 14 23:06:38 willow kernel: [73505.487114] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
Nov 14 23:06:38 willow kernel: [73505.487129] CR2: 00007fb6947fb92c CR3: 0000000001001000 CR4: 00000000000006e0
Nov 14 23:06:38 willow kernel: [73505.487142] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Nov 14 23:06:38 willow kernel: [73505.487157] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Nov 14 23:06:38 willow kernel: [73505.487171] Call Trace:
Nov 14 23:06:38 willow kernel: [73505.487186] Inexact backtrace:
Nov 14 23:06:38 willow kernel: [73505.487189]
Nov 14 23:06:58 willow kernel: [73505.487201] <#MC> <<EOE>>

Sorry to reply to my own post but it looks like it’s the 64bit version that is busted.

Steve Rooke

I’ve just tried installing the 32bit, i586, verion of bind but still get the same problem.

Cheers,
Steve Rooke

Not that I’m an expert or anything, but I see ip6 mentioned in both outputs several times.

Maybe try disabling that through Yast (the admin control panel thing) and see if it makes any difference.

Also try running the system with the wireless (if you have wireless) turned off or unplugged if it an external USB dongle, see if that improves things.

I had already disabled ipv6 on both machines. I’ll try enabling it to see if that fixes the problem. I don’t want to use ipv6 on these machines though…

This looks like a kernel issue, a user space app shouldn’t be able to cause an oops. Have you checked the Novell bugzilla to see if any such issue has been reported?

Also do you have apparmor or similar enabled? Try without it.

BINGO! Thanks ken_yap: it’s indeed an apparmor issue. Running the update profile wizard found unhandled named issues. Updating the profile resolved the problem!
Thanks a lot.

Hello,

For me the same problem.
Prompt that means to refresh profiles


zypper up apparmor-profiles?

The answer:


Loading repository data...
Reading installed packages...
No update candidate for 'apparmor-profiles'.
Resolving package dependencies...

Nothing to do.

Hi Akitos,

I personally don’t use zypper, but unless I’m very much mistaken, the command you issued tells zypper to look for an update-packet (newer version) for apparmor-profiles. That’s not what you want to do.
You want to run the apparmor update wizard. You can find it in yast -> Novell apparmor -> update profile wizard (or by just running “yast2 LogProf” from the command line).
It’ll tell you there are unhandled events for named. Just update it and named will work fine

cheers,
Stefan

Any operations :frowning:

#yast2 LogProf


  No unhandled AppArmor events were found in the system log.

Hm, and you still get a segfault? Is it exactly the same as the one I posted? If not you may have another issue…

No, a problem same. Has temporarily disabled apparmor, now not to it.

No need to. Just learn how to audit AppArmor.

Turn process in complain mode:
aa-complain /usr/sbin/named

Restart named, you should see something like that in /var/log/audit/audit.log:
type=APPARMOR_DENIED msg=audit(1259658330.088:65): operation=“capable” pid=26879 parent=26878 profile="/usr/sbin/named" name=“sys_resource”

Hint: APPARMOR_DENIED + sys_resource

Edit profile: /etc/apparmor.d/usr.sbin.named
There is no sys_resource capability (someone forgot it? :wink: ) add below other capabilities:
capability sys_resource,

Now, turn enforce back: aa-enforce /usr/sbin/named

…and restart named. From now it should work fine.

Best regards from Poland :wink:
Jakub

Thanks
All works!