Re: Page fault in FreeBSD 5.3 on IBM e325, Dual AMD64 2.2GHz, 4GB RAM, ServeRAID 6M - debug logs

From: Ganbold <ganbold_at_micom.mng.net>
Date: Mon, 22 Nov 2004 15:27:56 +0800
Scott,

I didn't put debug.mpsafenet=0 to /boot/loader.conf. I compiled kernel with:

	options DDB
	options	KDB

Here is crash time debug output and dmesg:

--------------------------------------------------------------------------------------------------------
Console: serial port
BIOS drive C: is disk0
BIOS 614kB/4127168kB available memory

FreeBSD/i386 bootstrap loader, Revision 1.1
(root_at_publica.ub.mng.net, Sun Nov 21 18:24:58 ULAT 2004)
Loading /boot/defaults/loader.conf
/boot/kernel/kernel text=0x2fa370 data=0x7a920+0x3b9d0 
-syms=[0x8+0x58278-+0x8+0x4b956|
KDB: debugger backends: ddb
KDB: current backend: ddb
Copyright (c) 1992-2004 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
	The Regents of the University of California. All rights reserved.
FreeBSD 5.3-STABLE #12: Mon Nov 22 12:04:57 ULAT 2004
     tsgan_at_publicc.ub.mng.net:/usr/obj/usr/src/sys/AMD
Timecounter "i8254" frequency 1193182 Hz quality 0
CPU: AMD Opteron(tm) Processor 248 (2193.17-MHz K8-class CPU)
   Origin = "AuthenticAMD"  Id = 0xf58  Stepping = 8
   Features=0x78bfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,MMX,FXSR,SSE,SSE2>
   AMD Features=0xe0500800<SYSCALL,NX,MMX+,LM,3DNow+,3DNow>
real memory  = 4227268608 (4031 MB)
avail memory = 4083208192 (3894 MB)
ACPI APIC Table: <PTLTD  	 APIC  >
FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs
  cpu0 (BSP): APIC ID:  0
  cpu1 (AP): APIC ID:  1
MADT: Forcing active-low polarity and level trigger for SCI
ioapic0 <Version 1.1> irqs 0-23 on motherboard
ioapic1 <Version 1.1> irqs 24-27 on motherboard
ioapic2 <Version 1.1> irqs 28-31 on motherboard
acpi0: <PTLTD 	 XSDT> on motherboard
acpi0: Power Button (fixed)
unknown: I/O range not supported
unknown: I/O range not supported
Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000
acpi_timer0: <24-bit timer at 3.579545MHz> port 0x8008-0x800b on acpi0
cpu0: <ACPI CPU> on acpi0
cpu1: <ACPI CPU> on acpi0
acpi_button0: <Power Button> on acpi0
pcib0: <ACPI Host-PCI bridge> port 0x8080-0x80ff,0x8000-0x807f,0xcf8-0xcff 
iomem 0xd8000-0xdbfff on acpi0
pci0: <ACPI PCI bus> on pcib0
pcib1: <ACPI PCI-PCI bridge> at device 6.0 on pci0
pci1: <ACPI PCI bus> on pcib1
ohci0: <OHCI (generic) USB controller> mem 0xfc100000-0xfc100fff irq 19 at 
device 0.0 on pci1
ohci0: [GIANT-LOCKED]
usb0: OHCI version 1.0, legacy support
usb0: SMM does not respond, resetting
usb0: <OHCI (generic) USB controller> on ohci0
usb0: USB revision 1.0
uhub0: AMD OHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub0: 3 ports with 3 removable, self powered
ohci1: <OHCI (generic) USB controller> mem 0xfc101000-0xfc101fff irq 19 at 
device 0.1 on pci1
ohci1: [GIANT-LOCKED]
usb1: OHCI version 1.0, legacy support
usb1: SMM does not respond, resetting
usb1: <OHCI (generic) USB controller> on ohci1
usb1: USB revision 1.0
uhub1: AMD OHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub1: 3 ports with 3 removable, self powered
ukbd0: Silitek IBM USB Keyboard, rev 1.10/1.00, addr 2, iclass 3/1
kbd1 at ukbd0
uhid0: Silitek IBM USB Keyboard, rev 1.10/1.00, addr 2, iclass 3/1
pci1: <display, VGA> at device 5.0 (no driver attached)
isab0: <PCI-ISA bridge> at device 7.0 on pci0
isa0: <ISA bus> on isab0
pci0: <mass storage, ATA> at device 7.1 (no driver attached)
pci0: <bridge, PCI-unknown> at device 7.3 (no driver attached)
pcib2: <ACPI PCI-PCI bridge> at device 10.0 on pci0
pci2: <ACPI PCI bus> on pcib2
bge0: <Broadcom BCM5704C Dual Gigabit Ethernet, ASIC rev. 0x2003> mem 
0xfe000000-0xfe00ffff,0xfe010000-0xfe01ffff irq 24 at device 1.0 on pci2
miibus0: <MII bus> on bge0
brgphy0: <BCM5704 10/100/1000baseTX PHY> on miibus0
brgphy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseTX, 
1000baseTX-FDX, auto
bge0: Ethernet address: 00:0d:60:14:cc:9e
bge1: <Broadcom BCM5704C Dual Gigabit Ethernet, ASIC rev. 0x2003> mem 
0xfe020000-0xfe02ffff,0xfe030000-0xfe03ffff irq 25 at device 1.1 on pci2
miibus1: <MII bus> on bge1
brgphy1: <BCM5704 10/100/1000baseTX PHY> on miibus1
brgphy1:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseTX, 
1000baseTX-FDX, auto
bge1: Ethernet address: 00:0d:60:14:cc:9f
pci2: <mass storage, SCSI> at device 2.0 (no driver attached)
pci0: <base peripheral, interrupt controller> at device 10.1 (no driver 
attached)
pcib3: <ACPI PCI-PCI bridge> at device 11.0 on pci0
pci3: <ACPI PCI bus> on pcib3
pcib4: <PCI-PCI bridge> at device 3.0 on pci3
pci4: <PCI bus> on pcib4
ips0: <Adaptec ServeRAID Adapter> mem 0xfe100000-0xfe100fff irq 28 at 
device 8.0 on pci4
ips0: [GIANT-LOCKED]
pci0: <base peripheral, interrupt controller> at device 11.1 (no driver 
attached)
sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0
sio0: type 16550A, console
orm0: <ISA Option ROMs> at iomem 
0xcb000-0xcb7ff,0xc9800-0xcafff,0xc8000-0xc97ff,0xc0000-0xc7fff on isa0
atkbdc0: <Keyboard controller (i8042)> at port 0x64,0x60 on isa0
atkbd0: <AT Keyboard> flags 0x1 irq 1 on atkbdc0
device_attach: atkbd0 attach returned 6
sc0: <System console> at flags 0x100 on isa0
sc0: VGA <16 virtual consoles, flags=0x100>
sio1: configured irq 3 not in bitmap of probed irqs 0
sio1: port may not be enabled
vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0
Timecounters tick every 0.976 msec
ips0: adapter type: ServeRAID 6M (marco)
ips0: logical drives: 1
ips0: Logical Drive 0: RAID5 sectors: 355481600, state OK
ipsd0: <Logical Drive> on ips0
ipsd0: Logical Drive  (173575MB)
SMP: AP CPU #1 Launched!
Mounting root from ufs:/dev/ipsd0s1a
Pre-seeding PRNG: kickstart.
Loading configuration files.
Entropy harvesting: interrupts ethernet point_to_point kickstart.
swapon: adding /dev/ipsd0s1b as swap device
Starting file system checks:
/dev/ipsd0s1a: FILE SYSTEM CLEAN; SKIPPING CHECKS
/dev/ipsd0s1a: clean, 507157 free (7565 frags, 62449 blocks, 0.7% 
fragmentation)
/dev/ipsd0s1f: FILE SYSTEM CLEAN; SKIPPING CHECKS
/dev/ipsd0s1f: clean, 1012200 free (40 frags, 126520 blocks, 0.0% 
fragmentation)
/dev/ipsd0s1d: FILE SYSTEM CLEAN; SKIPPING CHECKS
/dev/ipsd0s1d: clean, 4876181 free (45669 frags, 603814 blocks, 0.4% 
fragmentation)
/dev/ipsd0s1e: FILE SYSTEM CLEAN; SKIPPING CHECKS
/dev/ipsd0s1e: clean, 4965575 free (1087 frags, 620561 blocks, 0.0% 
fragmentation)
/dev/ipsd0s1g: FILE SYSTEM CLEAN; SKIPPING CHECKS
/dev/ipsd0s1g: clean, 53622407 free (671 frags, 6702717 blocks, 0.0% 
fragmentation)
Setting hostname: publica.ub.mng.net.
bge0: flags=8843<UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST> mtu 1500
	options=1a<TXCSUM,VLAN_MTU,VLAN_HWTAGGING>
	inet 202.179.0.80 netmask 0xffffffe0 broadcast 202.179.0.95
	ether 00:0d:60:14:cc:9e
	media: Ethernet 100baseTX <full-duplex>
	status: active
lo0: flags=8049<UP,LOOPBACK,RUNNING,MULTICAST> mtu 16384
	inet 127.0.0.1 netmask 0xff000000
add net default: gateway 202.179.0.65
Additional routing options:.
Starting devd.
kbdcontrol: cannot open /dev/kbd1: Device busy
Mounting NFS file systems:.
Starting syslogd.
Nov 22 13:49:36 publica syslogd: kernel boot file is /boot/kernel/kernel

ELF ldconfig path: /lib /usr/lib /usr/lib/compat /usr/X11R6/lib /usr/local/lib
Starting usbd.
Starting local daemons:.
Updating motd.
Configuring syscons: blanktime.
Starting sshd.
Initial amd64 initialization:.
Additional ABI support:.
Starting cron.
Local package initialization:Starting apache.
Starting clamav_clamd.
Starting clamav_freshclam.
Starting exim.
Starting mysql.
Starting spamd.
.
Additional TCP options:.
Starting inetd.
Starting background file system checks in 60 seconds.

Mon Nov 22 13:49:41 ULAT 2004
ààààààNààààààNàNov ààNààààNoàNàNov 22 14:08:àNàààààààààààààààààààààààààààà

Fatal trap 12: page fault while in kernel mode
cpuid = 1; apic id = 01
fault virtual address	= 0x18
fault code		= supervisor read, page not present
instruction pointer	= 0x8:0xffffffff80277fc0
stack pointer	        = 0x10:0xffffffffb36ab830
frame pointer	        = 0x10:0xffffffffb36ab890
code segment		= base 0x0, limit 0xfffff, type 0x1b
			= DPL 0, pres 1, long 1, def32 0, gran 1
processor eflags	= interrupt enabled, resume, IOPL = 0
current process		= 44 (swi1: net)
[thread 100044]
Stopped at      m_copym+0x190:  incl    %ecx
db> bt
No such command
db> help
print       p           examine     x           search      set         write
w           delete      d           break       dwatch      watch       dhwatch
hwatch      step        s           continue    c           until       next
match       trace       where       call        show        ps          gdb
reset       kill        watchdog    thread      panic
db> trace
m_copym() at m_copym+0x190
tcp_output() at tcp_output+0xe91
tcp_input() at tcp_input+0x2f80
ip_input() at ip_input+0xff
netisr_processqueue() at netisr_processqueue+0x79
swi_net() at swi_net+0x14d
ithread_loop() at ithread_loop+0xde
fork_exit() at fork_exit+0x8f
fork_trampoline() at fork_trampoline+0xe
--- trap 0, rip = 0, rsp = 0xffffffffb36abd00, rbp = 0 ---
db> trace
m_copym() at m_copym+0x190
tcp_output() at tcp_output+0xe91
tcp_input() at tcp_input+0x2f80
ip_input() at ip_input+0xff
netisr_processqueue() at netisr_processqueue+0x79
swi_net() at swi_net+0x14d
ithread_loop() at ithread_loop+0xde
fork_exit() at fork_exit+0x8f
fork_trampoline() at fork_trampoline+0xe
--- trap 0, rip = 0, rsp = 0xffffffffb36abd00, rbp = 0 ---
db>
--------------------------------------------------------------------------------------------------------

It seems to me the problem is related to network stack and threading.
Am I right? How to solve this problem?

thanks in advance,

Ganbold
Received on Mon Nov 22 2004 - 06:29:06 UTC

This archive was generated by hypermail 2.4.0 : Wed May 19 2021 - 11:38:23 UTC