Re: Fatal trap 19: non-maskable interrupt trap while in kernel mode

From: Gabor MICSKO <gmicsko_at_szintezis.hu>
Date: Mon, 08 Nov 2004 17:32:50 +0100
Success!

This patch seems good. Thanks!

[...]
installed MI handler for int 11
fwohci0: OHCI version 1.10 (ROM=1)
fwohci0: No. of Isochronous channel is 4.
fwohci0: EUI64 00:90:f5:00:00:32:10:37
fwohci0: Phy 1394a available S400, 1 ports.
fwohci0: Link S400, max_rec 2048 bytes.
firewire0: <IEEE1394(FireWire) bus> on fwohci0
fwe0: <Ethernet over FireWire> on firewire0
if_fwe0: Fake Ethernet address: 02:90:f5:32:10:37
fwe0: MAC address: 02:90:f5:32:10:37
sbp0: <SBP-2/SCSI over FireWire> on firewire0
fwohci0: Initiate bus reset
fwohci0: node_id=0xc000ffc0, gen=1, CYCLEMASTER mode
firewire0: 1 nodes, maxhop <= 0, cable IRM = 0 (me)
firewire0: bus manager 0 (me)
[...]

Full log:

Type '?' for a list of commands, 'help' for more detailed help.
OK boot -Dh
/modules/acpi.ko text=0x42ab8 data=0x1cd4+0x878
syms=[0x4+0x6630+0x4+0x7f56]
Copyright (c) 2003, 2004 The DragonFly Project.
Copyright (c) 1992-2003 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
        The Regents of the University of California. All rights
reserved.
DragonFly 1.1-Stable #3: Sun Nov  7 12:49:56 PST 2004
    root_at_:/usr/obj/usr/src/sys/GENERIC
TSC clock: 1804075595 Hz, i8254 clock: 1193171 Hz
CPU: Mobile AMD Athlon(tm) 64 Processor 3000+ (1804.10-MHz 686-class
CPU)
  Origin = "AuthenticAMD"  Id = 0xf4a  Stepping = 10
 
Features=0x78bfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,MMX,FXSR,SSE,SSE2>
  AMD Features=0xe0500000<<b20>,AMIE,<b29>,DSP,3DNow!>
real memory  = 267321344 (261056K bytes)
avail memory = 249200640 (243360K bytes)
Preloaded elf kernel "/kernel" at 0xc0662000.
Preloaded elf module "/modules/acpi.ko" at 0xc066221c.
Pentium Pro MTRR support enabled
md0: Malloc disk
pcibios: BIOS version 2.10
Using $PIR table, 10 entries at 0xc00fdf20
npx0: <math processor> on motherboard
npx0: INT 16 interface
Using XMM optimized bcopy/copyin/copyout
acpi0: <PTLTD   RSDT> on motherboard
installed MI handler for int 10
acpi0: Power Button (fixed)
cpu0: <ACPI CPU> on acpi0
acpi_tz0: <Thermal Zone> on acpi0
acpi_tz0: _CRT value is absurd, ignored (154.8C)
acpi_cmbat0: <Control Method Battery> on acpi0
acpi_button0: <Power Button> on acpi0
acpi_button1: <Sleep Button> on acpi0
acpi_acad0: <AC Adapter> on acpi0
acpi_lid0: <Control Method Lid Switch> on acpi0
acpi_ec0: <Embedded Controller: GPE 0xb> port 0x66,0x62 on acpi0
pcib0: <Host to PCI bridge> at pcibus 0 on motherboard
pci0: <PCI bus> on pcib0
agp0: <VIA 8385 host to PCI bridge> mem 0xe0000000-0xefffffff at device
0.0
on pci0
pcib1: <PCI to PCI bridge (vendor=1106 device=b188)> at device 1.0 on
pci0
pci1: <PCI bus> on pcib1
pci1: <ATI model 4e50 graphics accelerator> at 0.0 irq 9
fwohci0: <Texas Instruments TSB43AB21/A/AI/A-EP> mem
0xd0000000-0xd0003fff,0xd0008000-0xd00087ff irq 11 at device 6.0 on pci0
installed MI handler for int 11
fwohci0: OHCI version 1.10 (ROM=1)
fwohci0: No. of Isochronous channel is 4.
fwohci0: EUI64 00:90:f5:00:00:32:10:37
fwohci0: Phy 1394a available S400, 1 ports.
fwohci0: Link S400, max_rec 2048 bytes.
firewire0: <IEEE1394(FireWire) bus> on fwohci0
fwe0: <Ethernet over FireWire> on firewire0
if_fwe0: Fake Ethernet address: 02:90:f5:32:10:37
fwe0: MAC address: 02:90:f5:32:10:37
sbp0: <SBP-2/SCSI over FireWire> on firewire0
fwohci0: Initiate bus reset
fwohci0: node_id=0xc000ffc0, gen=1, CYCLEMASTER mode
firewire0: 1 nodes, maxhop <= 0, cable IRM = 0 (me)
firewire0: bus manager 0 (me)
re0: <RealTek 8169S Single-chip Gigabit Ethernet> port 0x1000-0x10ff mem
0xd0008800-0xd00088ff irq 5 at device 8.0 on pci0
miibus0: <MII bus> on re0
ukphy0: <Generic IEEE 802.3u media interface> on miibus0
ukphy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
re0: MAC address: 00:90:f5:32:10:37
installed MI handler for int 5
cbb0: <PCI-CardBus Bridge> at device 12.0 on pci0
cardbus0: <CardBus bus> on cbb0
pccard0: <16-bit PCCard bus> on cbb0
pci_cfgintr: 0:12 INTA routed to irq 9
installed MI handler for int 9
pci0: <unknown card> (vendor=0x1050, dev=0x8481) at 14.0 irq 5
pci0: <unknown card> (vendor=0x1050, dev=0x8482) at 14.1 irq 5
uhci0: <VIA 83C572 USB controller> port 0x1c00-0x1c1f at device 16.0 on
pci0
pci_cfgintr: 0:16 INTA routed to irq 9
usb0: <VIA 83C572 USB controller> on uhci0
usb0: USB revision 1.0
uhub0: VIA UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub0: 2 ports with 2 removable, self powered
uhub0: port error, restarting port 1
uhub0: port error, giving up port 1
uhub0: port error, restarting port 2
uhub0: port error, giving up port 2
uhci1: <VIA 83C572 USB controller> port 0x1c20-0x1c3f at device 16.1 on
pci0
pci_cfgintr: 0:16 INTB routed to irq 9
usb1: <VIA 83C572 USB controller> on uhci1
usb1: USB revision 1.0
uhub1: VIA UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub1: 2 ports with 2 removable, self powered
uhub1: port error, restarting port 1
uhub1: port error, giving up port 1
uhub1: port error, restarting port 2
uhub1: port error, giving up port 2
uhci2: <VIA 83C572 USB controller> port 0x1c40-0x1c5f at device 16.2 on
pci0
pci_cfgintr: 0:16 INTC routed to irq 9
usb2: <VIA 83C572 USB controller> on uhci2
usb2: USB revision 1.0
uhub2: VIA UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub2: 2 ports with 2 removable, self powered
uhub2: port error, restarting port 1
uhub2: port error, giving up port 1
uhub2: port error, restarting port 2
uhub2: port error, giving up port 2
pci0: <USB controller> at 16.3
isab0: <PCI to ISA bridge (vendor=1106 device=3177)> at device 17.0 on
pci0
isa0: <ISA bus> on isab0
atapci0: <VIA 8235 ATA133 controller> port 0x1c60-0x1c6f at device 17.1
on
pci0
ata0: at 0x1f0 irq 14 on atapci0
installed MI handler for int 14
ata1: at 0x170 irq 15 on atapci0
installed MI handler for int 15
pci0: <unknown card> (vendor=0x1106, dev=0x3059) at 17.5 irq 10
pci0: <VIA 82C686 AC97 Modem> at 17.6 irq 10
orm0: <Option ROMs> at iomem
0xc0000-0xcefff,0xcf000-0xcffff,0xd8000-0xdbfff,0xdc000-0xdffff on isa0
pmtimer0 on isa0
fdc0: ready for input in output
fdc0: cmd 3 failed at out byte 1 of 3
atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0
atkbd0: <AT Keyboard> flags 0x1 irq 1 on atkbdc0
kbd0 at atkbd0
callout_reset(0xc0552b5c) from 0xc03e58ff: callout was not initialized
callout_reset(c0468c00,c0552b5c,c03e58ff,c0552b5c) at callout_reset+0x2d
callout_reset(c0552b5c,a,c03e5864,c0552a80,ffffffff) at
callout_reset+0x2d
atkbd_timeout(c0552a80) at atkbd_timeout+0x9b
atkbd_attach_unit(0,c0684920,0,1,1) at atkbd_attach_unit+0x84
atkbdattach(c604bff8,c604bff8,c604bb78,0,1) at atkbdattach+0xd9
device_probe_and_attach(c604bff8) at device_probe_and_attach+0xf2
bus_generic_attach(c604bb78,c604b860,c604bb78,0,1) at
bus_generic_attach+0x16
atkbdc_attach(c604bb78,c604bb78,c06849b8,9,1) at atkbdc_attach+0x18c
device_probe_and_attach(c604bb78) at device_probe_and_attach+0xf2
isa_probe_children(c604b860) at isa_probe_children+0xde
configure(0,681c00,68f000,0,c013b45c) at configure+0x39
mi_startup(681000,0,0,0,0) at mi_startup+0x90
begin() at begin+0x43
installed MI handler for int 1
psm0: <PS/2 Mouse> irq 12 on atkbdc0
installed MI handler for int 12
psm0: model GlidePoint, device ID 0
vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on
isa0
sc0: <System console> at flags 0x100 on isa0
sc0: VGA <16 virtual consoles, flags=0x100>
sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0
sio0: type 16550A, console
installed MI handler for int 4
sio1: can't drain, serial port might not exist, disabling
ppc0: <Parallel port> at port 0x378-0x37f irq 7 on isa0
ppc0: SMC-like chipset (ECP/EPP/PS2/NIBBLE) in COMPATIBLE mode
ppc0: FIFO with 16/16/8 bytes threshold
ppbus0: <Parallel port bus> on ppc0
plip0: <PLIP network interface> on ppbus0
lpt0: <Printer> on ppbus0
lpt0: Interrupt-driven port
ppi0: <Parallel I/O> on ppbus0
installed MI handler for int 7
installed MI handler for int 0
system power profile changed to 'economy'
acpi_tz0: _CRT value is absurd, ignored (154.8C)
ad0: 38154MB <FUJITSU MHT2040AT> [77520/16/63] at ata0-master UDMA100
acd0: CD-RW <SAMSUNG CDRW/DVD SN-324F> at ata1-master PIO4
acpi_tz0: _CRT value is absurd, ignored (154.8C)
Mounting root from cd9660:/dev/acd0a
cd0 at ata1 bus 0 target 0 lun 0
cd0: <SAMSUNG CDRW/DVD SN-324F U204> Removable CD-ROM SCSI-0 device
cd0: 16.000MB/s transfers
cd0: cd present [123104 x 2048 byte records]
cd9660: RockRidge Extension
Loading configuration files.
.
Starting file system checks:
cd9660: /dev/acd0a: Device busy

Setting hostname: .
lo0: flags=8049<UP,LOOPBACK,RUNNING,MULTICAST> mtu 16384
        inet6 ::1 prefixlen 128
        inet6 fe80::1%lo0 prefixlen 64 scopeid 0x4
        inet 127.0.0.1 netmask 0xff000000
 Additional routing options:.
Mounting NFS file systems:.
Building databases...
 nfsiodNFS access cache time=2
ELF ldconfig
path: /usr/lib /usr/lib/gcc2 /usr/lib/gcc34 /usr/lib/compat
/usr/local/lib
Starting local daemons:.
Updating motd.
Configuring syscons: blanktime.
Generating public/private rsa1 key pair.
Your identification has been saved in /etc/ssh/ssh_host_key.
Your public key has been saved in /etc/ssh/ssh_host_key.pub.
The key fingerprint is:
94:1f:0b:6d:63:e8:5a:15:91:13:e8:74:94:fe:1b:1f root_at_
Generating public/private dsa key pair.
Your identification has been saved in /etc/ssh/ssh_host_dsa_key.
Your public key has been saved in /etc/ssh/ssh_host_dsa_key.pub.
The key fingerprint is:
b9:08:92:ea:f6:3b:d8:02:22:ff:0f:98:92:d5:3e:1c root_at_
Generating public/private rsa key pair.
Your identification has been saved in /etc/ssh/ssh_host_rsa_key.
Your public key has been saved in /etc/ssh/ssh_host_rsa_key.pub.
The key fingerprint is:
7c:4f:08:84:59:b6:20:a7:a2:db:c6:32:29:89:7d:d8 root_at_
Starting sshd.
Initial i386 initialization:.
Additional ABI support:.
Local package initialization:.
Additional TCP options:.
Starting pfi...
Looking for pfi.conf on /dev/acd0c...mount_cd9660: /dev/acd0c: Device
busy
 not found
Looking for pfi.conf on /dev/da0s1...mount_msdos: /dev/da0s1: Device not
configured
 not found
Looking for pfi.conf on /dev/da1s1...mount_msdos: /dev/da1s1: Device not
configured
 not found
Looking for pfi.conf on /dev/fd0...mount_msdos: /dev/fd0: Device not
configured
 not found
Looking for pfi.conf on /dev/fd1...mount_msdos: /dev/fd1: Device not
configured
 not found


2004-11-08, h keltezéssel 03:00-kor Hidetoshi Shimokawa ezt írta:
> At Sun, 7 Nov 2004 12:21:59 -0800 (PST),
> Matthew Dillon wrote:
> > 
> > 
> > :During the boot, interrputs are disabled and this shouldn't be a problem.
> > :And the interrupt vector is already initialized in fwohci_pci_attach()
> > :before fwochi_init() is called.
> > :It's a good idea that we should mask IT/IR interrupt before probing channels
> > :but it should have nothing to do with this problem.
> > :
> > :As interrupts are disable during the boot, it must not be a usual interrupt
> > :but a NMI. I think it's a PCI bus problem rather than RAM.
> > :
> > :Try the following patch,
> > :
> > :Index: fwohci_pci.c
> > :===================================================================
> > :RCS file: /home/dcvs/src/sys/bus/firewire/fwohci_pci.c,v
> > :retrieving revision 1.15
> > :diff -u -r1.15 fwohci_pci.c
> > :--- fwohci_pci.c	18 Jul 2004 12:37:03 -0000	1.15
> > :+++ fwohci_pci.c	7 Nov 2004 19:47:52 -0000
> > :_at__at_ -238,6 +238,7 _at__at_
> > : 		PCIM_CMD_SERRESPEN | PCIM_CMD_PERRESPEN;
> > : #if 1
> > : 	cmd &= ~PCIM_CMD_MWRICEN; 
> > :+	cmd &= ~(PCIM_CMD_SERRESPEN | PCIM_CMD_PERRESPEN);
> > : #endif
> > : 	pci_write_config(self, PCIR_COMMAND, cmd, 2);
> > :...
> > :
> > :I suppose their code doesn't enable above flags.
> > :...
> > :/\ Hidetoshi Shimokawa
> > :\/  simokawa_at_FreeBSD.org
> > 
> >     I think you've found it.  All the OpenBSD code does is enable the bus
> >     master bit.  It doesn't touch any of the other bits.
> > 
> >     The original FreeBSD commit associated with this issue is:
> > 
> > >revision 1.20
> > >date: 2003/03/24 03:47:36;  author: simokawa;  state: Exp;  lines: +6 -2
> > >Safe PCI configuration.
> > >- Clear PCIM_CMD_MWRICEN:
> > >        some chips seem to have problem with write invalidate.
> > >        clearing this bit fixes SBP timeout problem.
> > >
> > >Tested by: Michael Reifenberger <Michael.Reifenberger_at_Plaut.de>
> > >
> > >- Set PCIM_CMD_SERRESPEN and PCIM_CMD_PERRESPEN
> > >- Moderate value for latency timer.
> > 
> >     He doesn't explain *WHY* he is turning on SERRESPEN and PERRESPEN. 
> >     Generally, however, any device with its own on-board memory (as these
> >     devices have) is subject to parity errors on the PCI bus if that
> >     memory is not completely cleared on boot.  And that is what could be
> >     happening here.
> 
> Hmm, PERR and SERR indicates PCI bus parity errors and other fatal errors.
> I added it to detect broken hardwares. This is the first report of the error
> I have ever got.
> 
> Are you sure it has something to do with clearing on-chip memory?
> Do you know how to clear them?
> 
> >     Note that in his commit message he had to turn off write-invalidate.
> >     That's a sure sign of on-chip parity checked memory not being initialized.
> 
> I thought PERR/SERR is independent of write-invalidate. Could you
> explain more?
> 
> >     I will role another ISO with the change and post when it's ready.
> 
> Thanks,
> 
> /\ Hidetoshi Shimokawa
> \/  simokawa_at_FreeBSD.org

-- 
Micskó Gábor
HP Accredited Platform Specialist, System Engineer (APS, ASE)
Szintézis Computer Rendszerház Rt.      
H-9021 Győr, Tihanyi Árpád út 2.
Tel: +36-96-502-216
Fax: +36-96-318-658
E-mail: gmicsko_at_szintezis.hu
Web: http://www.hup.hu/
GPG Key fingerprint: 6FA1 D8BF 3C73 0570 ED5D  82AC A3BE 4E6A CF95 5F50

Received on Mon Nov 08 2004 - 15:39:04 UTC

This archive was generated by hypermail 2.4.0 : Wed May 19 2021 - 11:38:21 UTC