mpt(4) under SMP + HTT (was: Re: Issues with mpt(4)?)

From: Ruslan Ermilov <ru_at_FreeBSD.org>
Date: Mon, 21 Apr 2003 23:15:33 +0300
An update.  Further investigation shows that this was caused by
SMP (HyperThreading) turned on in the kernel.  Disabling SMP and
HTT options gives a stable kernel.  The panic was like this:

panic: mpt_get_request: corrupted request free list
mp_lock = 0100001; cpuid = 1; lapic.id = 01000000

Sorry, no dump is available since duping is also prevented
by this panic.  Anyone has an idea on how to fix it?

On Sat, Apr 19, 2003 at 07:12:38AM +0300, Ruslan Ermilov wrote:
> One of my recently installed remote boxes started to
> boot up incredibly slow, and I dug up the following.
> What does that mean?
> 
> %%%
> Copyright (c) 1992-2003 The FreeBSD Project.
> Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
> 	The Regents of the University of California. All rights reserved.
> FreeBSD 4.8-STABLE #0: Wed Apr  9 04:29:55 PDT 2003
>     ru_at_otter.us.sunbay.com:/usr/obj/usr/src/sys/OTTER
> Timecounter "i8254"  frequency 1193182 Hz
> CPU: Intel(R) Xeon(TM) CPU 2.40GHz (2392.05-MHz 686-class CPU)
>   Origin = "GenuineIntel"  Id = 0xf27  Stepping = 7
>   Features=0xbfebfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE>
>   Hyperthreading: 2 logical CPUs
> real memory  = 536805376 (524224K bytes)
> avail memory = 518684672 (506528K bytes)
> Changing APIC ID for IO APIC #0 from 0 to 2 on chip
> Changing APIC ID for IO APIC #1 from 0 to 3 on chip
> Changing APIC ID for IO APIC #2 from 0 to 4 on chip
> Programming 16 pins in IOAPIC #0
> IOAPIC #0 intpin 2 -> irq 0
> Programming 16 pins in IOAPIC #1
> Programming 16 pins in IOAPIC #2
> FreeBSD/SMP: Multiprocessor motherboard
>  cpu0 (BSP): apic id:  0, version: 0x00050014, at 0xfee00000
>  cpu1 (AP):  apic id:  1, version: 0x00050014, at 0xfee00000
>  io0 (APIC): apic id:  2, version: 0x000f0011, at 0xfec00000
>  io1 (APIC): apic id:  3, version: 0x000f0011, at 0xfec01000
>  io2 (APIC): apic id:  4, version: 0x000f0011, at 0xfec02000
> Preloaded elf kernel "kernel" at 0xc0342000.
> Pentium Pro MTRR support enabled
> Using $PIR table, 9 entries at 0xc00fc270
> npx0: <math processor> on motherboard
> npx0: INT 16 interface
> pcib0: <ServerWorks host to PCI bridge(unknown chipset)> on motherboard
> IOAPIC #1 intpin 0 -> irq 2
> IOAPIC #1 intpin 9 -> irq 3
> pci0: <PCI bus> on pcib0
> em0: <Intel(R) PRO/1000 Network Connection, Version - 1.5.31> port 0xecc0-0xecff mem 0xfe120000-0xfe13ffff irq 2 at device 2.0 on pci0
> em0:  Speed:100 Mbps  Duplex:Full
> fxp0: <Intel 82557/8/9 EtherExpress Pro/100(B) Ethernet> port 0xec80-0xecbf mem 0xfe100000-0xfe11ffff,0xfe142000-0xfe142fff irq 3 at device 4.0 on pci0
> fxp0: Ethernet address 00:02:b3:c9:7c:c9
> inphy0: <i82555 10/100 media interface> on miibus0
> inphy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
> pci0: <ATI Mach64-GR graphics accelerator> at 14.0
> atapci0: <ServerWorks CSB5 ATA100 controller> port 0x8b0-0x8bf,0x374-0x377,0x170-0x177,0x3f4-0x3f7,0x1f0-0x1f7 at device 15.1 on pci0
> ata0: at 0x1f0 irq 14 on atapci0
> ata1: at 0x170 irq 15 on atapci0
> pci0: <OHCI USB controller> at 15.2 irq 5
> isab0: <PCI to ISA bridge (vendor=1166 device=0225)> at device 15.3 on pci0
> isa0: <ISA bus> on isab0
> pcib1: <ServerWorks host to PCI bridge(unknown chipset)> on motherboard
> IOAPIC #1 intpin 1 -> irq 10
> IOAPIC #1 intpin 2 -> irq 11
> IOAPIC #1 intpin 13 -> irq 13
> pci1: <PCI bus> on pcib1
> ahc0: <Adaptec 3960D Ultra160 SCSI adapter> port 0xdc00-0xdcff mem 0xfcf21000-0xfcf21fff irq 10 at device 2.0 on pci1
> aic7899: Ultra160 Wide Channel A, SCSI Id=7, 32/253 SCBs
> ahc1: <Adaptec 3960D Ultra160 SCSI adapter> port 0xd800-0xd8ff mem 0xfcf20000-0xfcf20fff irq 11 at device 2.1 on pci1
> aic7899: Ultra160 Wide Channel B, SCSI Id=7, 32/253 SCBs
> mpt0: <LSILogic 1030 Ultra4 Adapter> port 0xd400-0xd4ff mem 0xfcf00000-0xfcf0ffff,0xfcf10000-0xfcf1ffff irq 13 at device 4.0 on pci1
> pcib2: <Host to PCI bridge> on motherboard
> pci2: <PCI bus> on pcib2
> pcib3: <ServerWorks host to PCI bridge(unknown chipset)> on motherboard
> pci3: <PCI bus> on pcib3
> pcib4: <ServerWorks host to PCI bridge(unknown chipset)> on motherboard
> pci4: <PCI bus> on pcib4
> orm0: <Option ROMs> at iomem 0xc0000-0xc7fff,0xc8000-0xc8fff,0xc9000-0xccfff,0xec000-0xeffff on isa0
> atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0
> atkbd0: <AT Keyboard> irq 1 on atkbdc0
> psm0: <PS/2 Mouse> irq 12 on atkbdc0
> psm0: model Generic PS/2 mouse, device ID 0
> vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0
> sc0: <System console> on isa0
> sc0: VGA <16 virtual consoles, flags=0x200>
> fdc0: <NEC 72065B or clone> at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0
> fdc0: FIFO enabled, 8 bytes threshold
> fd0: <1440-KB 3.5" drive> on fdc0 drive 0
> sio0 at port 0x3f8-0x3ff irq 4 on isa0
> sio0: type 16550A
> APIC_IO: Testing 8254 interrupt delivery
> APIC_IO: Broken MP table detected: 8254 is not connected to IOAPIC #0 intpin 2
> APIC_IO: routing 8254 via 8259 and IOAPIC #0 intpin 0
> IP packet filtering initialized, divert enabled, rule-based forwarding enabled, default to accept, unlimited logging
> IPsec: Initialized Security Association Processing.
> SMP: AP CPU #1 Launched!
> acd0: MODE_SENSE_BIG command timeout - resetting
> ata1: resetting devices .. done
> acd0: CDROM <SAMSUNG CD-ROM SC-148C> at ata1-master PIO4
> Waiting 2 seconds for SCSI devices to settle
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020011mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020011mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020011mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020011mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020011mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020011mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020011mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020011mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020011mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020011mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020011mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020011mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020011mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020011mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020011mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020011mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020011mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020011mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020011mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020011mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020011mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020011mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020011mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020011mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020011mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020011mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020011mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020011mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020011mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020011mpt0: 
> mpt0: IOC Bus Reset Port: 0
> (probe2:mpt0:0:1:1): INQUIRY. CDB: 12 20 0 0 24 0 
> (probe2:mpt0:0:1:1): ABORTED COMMAND asc:48,0
> (probe2:mpt0:0:1:1): Initiator detected error message received
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020010mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020010mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020010mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020010mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020010mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020010mpt0: 
> mpt0: IOC Bus Reset Port: 0
> (probe1:mpt0:0:1:2): INQUIRY. CDB: 12 40 0 0 24 0 
> (probe1:mpt0:0:1:2): ABORTED COMMAND asc:48,0
> (probe1:mpt0:0:1:2): Initiator detected error message received
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020010mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020010mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020010mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020010mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020010mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020010mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020010mpt0: 
> mpt0: 	EvtLogData: IOCLogInfo: 0x110a0000
> mpt0: 	EvtLogData: Event Data:mpt0:   05020010mpt0: 
> mpt0: IOC Bus Reset Port: 0
> (probe1:mpt0:0:1:3): INQUIRY. CDB: 12 60 0 0 24 0 
> (probe1:mpt0:0:1:3): ABORTED COMMAND asc:48,0
> (probe1:mpt0:0:1:3): Initiator detected error message received
> pass2 at mpt0 bus 0 target 6 lun 0
> pass2: <SDR GEM318P 1> Fixed Processor SCSI-2 device 
> pass2: 3.300MB/s transfers
> Mounting root from ufs:/dev/da0s1a
> da0 at mpt0 bus 0 target 0 lun 0
> da0: <FUJITSU MAP3735NC 5605> Fixed Direct Access SCSI-3 device 
> da0: 320.000MB/s transfers (160.000MHz, offset 127, 16bit), Tagged Queueing Enabled
> da0: 70007MB (143374650 512 byte sectors: 255H 63S/T 8924C)
> (da0:mpt0:0:0:0): READ(06). CDB: 8 0 0 0 1 0 
> (da0:mpt0:0:0:0): ABORTED COMMAND asc:48,0
> (da0:mpt0:0:0:0): Initiator detected error message received
> %%%
> 
> -- 
> Ruslan Ermilov		Sysadmin and DBA,
> ru_at_sunbay.com		Sunbay Software AG,
> ru_at_FreeBSD.org		FreeBSD committer,
> +380.652.512.251	Simferopol, Ukraine
> 
> http://www.FreeBSD.org	The Power To Serve
> http://www.oracle.com	Enabling The Information Age



-- 
Ruslan Ermilov		Sysadmin and DBA,
ru_at_sunbay.com		Sunbay Software AG,
ru_at_FreeBSD.org		FreeBSD committer,
+380.652.512.251	Simferopol, Ukraine

http://www.FreeBSD.org	The Power To Serve
http://www.oracle.com	Enabling The Information Age

Received on Mon Apr 21 2003 - 11:15:43 UTC

This archive was generated by hypermail 2.4.0 : Wed May 19 2021 - 11:37:04 UTC