Re: FreeBSD/Current: As of 2004/05/08 at 11:00am EST - FSCK lockup

From: Alain Hebert <ahebert_at_pubnix.net>
Date: Tue, 18 May 2004 16:10:14 -0400
Follow up on this.

    The situation is still hapenning.

    This time I also took the time to dump 0 the partitions and no 
problem.  fsck still lockup the kernel solid.

    Any ideas...

Alain Hebert wrote:

>    Hi,
>
> Situation:
>
>    The computer will lock up real tight (no keyboard light working, no 
> kernel debugger, no network, no disk) running fsck.
>    (Dual P4 Xeon 2.4Ghz, 2GB RAM, using a Promise PDC20378 SATA150 
> controller in mirror mode on two identical Seagate ST3160023AS drives)
>
>    This just happen after a lockup with XFree, thus I dont have a 
> theorie nor any infos on a way to figure out which fix/patch introduce 
> this problem.
>
> Work done:
>
>    Platform test: CPU's, Memory, Disk, Temperature, power supply.  All 
> Ok.
>
>    Was using a 2 weeks old current, updated to latest current this 
> morning, no luck.
>
>    Change the kernel from SMP to UP, no luck.
>
>    Try different method/parms with fsck, no luck
>
>    Been googling for reference or similar situation without luck.
>
> Things to do:
>
>    Have yet to try with RELENG_5_2.
>
>    use strace to see where this happen.
>
>    Add debugging code to fsck to find where this happen.
>
> Status:
>
>    The file system seems ok, all files are there, find works, no 
> obvious inconsistency, dd if=/dev/ar0s1 dump the entire disk without 
> any errors.
>
> Request:
>
>    Any hint?  This combinaison of hardware is deployed (10+ servers) 
> for months now and this is the first problem I encounter.  But this is 
> also the only one running current.
>
> Config:
>
> ----- # disklabel -r /dev/ar0s1
> # /dev/ar0s1:
> 8 partitions:
> #        size   offset    fstype   [fsize bsize bps/cpg]
>  a: 304107709  8388608    4.2BSD     2048 16384 28552
>  b:  8388608        0      swap                    c: 312496317        
> 0    unused        0     0         # "raw" part, don't edit
>
> ----- # tunefs -p /dev/ar0s1a
> tunefs: ACLs: (-a)                                         disabled
> tunefs: MAC multilabel: (-l)                               disabled
> tunefs: soft updates: (-n)                                 enabled
> tunefs: maximum blocks per file in a cylinder group: (-e)  2048
> tunefs: average file size: (-f)                            16384
> tunefs: average number of files in a directory: (-s)       64
> tunefs: minimum percentage of free space: (-m)             8%
> tunefs: optimization preference: (-o)                      time
> tunefs: volume label: (-L)                 
>
>------------------------------------------------------------------------
>
>Copyright (c) 1992-2004 The FreeBSD Project.
>Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
>	The Regents of the University of California. All rights reserved.
>FreeBSD 5.2-CURRENT #1: Sat May  8 10:56:30 EDT 2004
>    root_at_aal2.pubnix.net:/usr/src/sys/i386/compile/AAL
>Preloaded elf kernel "/boot/kernel/kernel" at 0xc0945000.
>Preloaded elf module "/boot/kernel/acpi.ko" at 0xc09451f4.
>Timecounter "i8254" frequency 1193182 Hz quality 0
>CPU: Intel(R) Xeon(TM) CPU 2.40GHz (2405.46-MHz 686-class CPU)
>  Origin = "GenuineIntel"  Id = 0xf27  Stepping = 7
>  Features=0xbfebfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE>
>  Hyperthreading: 2 logical CPUs
>real memory  = 2147418112 (2047 MB)
>avail memory = 2095943680 (1998 MB)
>ACPI APIC Table: <IntelR AWRDACPI>
>FreeBSD/SMP: Multiprocessor System Detected: 4 CPUs
> cpu0 (BSP): APIC ID:  0
> cpu1 (AP): APIC ID:  1
> cpu2 (AP): APIC ID:  6
> cpu3 (AP): APIC ID:  7
>ioapic0: Changing APIC ID to 4
>ioapic0 <Version 2.0> irqs 0-23 on motherboard
>Pentium Pro MTRR support enabled
>random: <entropy source, Software, Yarrow>
>npx0: [FAST]
>npx0: <math processor> on motherboard
>npx0: INT 16 interface
>acpi0: <IntelR AWRDACPI> on motherboard
>acpi0: [GIANT-LOCKED]
>pcibios: BIOS version 2.10
>acpi0: Power Button (fixed)
>Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000
>acpi_timer0: <24-bit timer at 3.579545MHz> port 0x408-0x40b on acpi0
>cpu0: <ACPI CPU> port 0x530-0x537 on acpi0
>cpu1: <ACPI CPU> port 0x530-0x537 on acpi0
>cpu2: <ACPI CPU> port 0x530-0x537 on acpi0
>cpu3: <ACPI CPU> port 0x530-0x537 on acpi0
>acpi_tz0: <Thermal Zone> port 0x530-0x537 on acpi0
>acpi_button0: <Power Button> on acpi0
>acpi_button1: <Sleep Button> on acpi0
>pcib0: <ACPI Host-PCI bridge> port 0xcf8-0xcff on acpi0
>pci0: <ACPI PCI bus> on pcib0
>agp0: <Intel 82875P host to AGP bridge> mem 0xf0000000-0xf7ffffff at device 0.0 on pci0
>agp0: Reserved 0x8000000 bytes for rid 0x10 type 3 at 0xf0000000
>pcib1: <PCI-PCI bridge> at device 1.0 on pci0
>pci1: <PCI bus> on pcib1
>pcib1: slot 0 INTA is routed to irq 16
>pci1: <display, VGA> at device 0.0 (no driver attached)
>pci1: <display> at device 0.1 (no driver attached)
>pcib2: <ACPI PCI-PCI bridge> at device 3.0 on pci0
>pcib2: could not get PCI interrupt routing table for \\_SB_.PCI0.CSAB - AE_NOT_FOUND
>pci2: <ACPI PCI bus> on pcib2
>pci2: <network, ethernet> at device 1.0 (no driver attached)
>uhci0: <Intel 82801EB (ICH5) USB controller USB-A> port 0xbc00-0xbc1f irq 16 at device 29.0 on pci0
>uhci0: Reserved 0x20 bytes for rid 0x20 type 4 at 0xbc00
>uhci0: [GIANT-LOCKED]
>usb0: <Intel 82801EB (ICH5) USB controller USB-A> on uhci0
>usb0: USB revision 1.0
>uhub0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
>uhub0: 2 ports with 2 removable, self powered
>uhci1: <Intel 82801EB (ICH5) USB controller USB-B> port 0xb000-0xb01f irq 19 at device 29.1 on pci0
>uhci1: Reserved 0x20 bytes for rid 0x20 type 4 at 0xb000
>uhci1: [GIANT-LOCKED]
>usb1: <Intel 82801EB (ICH5) USB controller USB-B> on uhci1
>usb1: USB revision 1.0
>uhub1: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
>uhub1: 2 ports with 2 removable, self powered
>uhci2: <Intel 82801EB (ICH5) USB controller USB-C> port 0xb400-0xb41f irq 18 at device 29.2 on pci0
>uhci2: Reserved 0x20 bytes for rid 0x20 type 4 at 0xb400
>uhci2: [GIANT-LOCKED]
>usb2: <Intel 82801EB (ICH5) USB controller USB-C> on uhci2
>usb2: USB revision 1.0
>uhub2: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
>uhub2: 2 ports with 2 removable, self powered
>uhci3: <Intel 82801EB (ICH5) USB controller USB-D> port 0xb800-0xb81f irq 16 at device 29.3 on pci0
>uhci3: Reserved 0x20 bytes for rid 0x20 type 4 at 0xb800
>uhci3: [GIANT-LOCKED]
>usb3: <Intel 82801EB (ICH5) USB controller USB-D> on uhci3
>usb3: USB revision 1.0
>uhub3: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
>uhub3: 2 ports with 2 removable, self powered
>ehci0: <EHCI (generic) USB 2.0 controller> mem 0xfc100000-0xfc1003ff irq 23 at device 29.7 on pci0
>ehci0: Reserved 0x400 bytes for rid 0x10 type 3 at 0xfc100000
>ehci0: [GIANT-LOCKED]
>ehci_pci_attach: companion usb0
>ehci_pci_attach: companion usb1
>ehci_pci_attach: companion usb2
>ehci_pci_attach: companion usb3
>usb4: EHCI version 1.0
>usb4: companion controllers, 2 ports each: usb0 usb1 usb2 usb3
>usb4: <EHCI (generic) USB 2.0 controller> on ehci0
>usb4: USB revision 2.0
>uhub4: (0x8086) EHCI root hub, class 9/0, rev 2.00/1.00, addr 1
>uhub4: 8 ports with 8 removable, self powered
>pcib3: <ACPI PCI-PCI bridge> at device 30.0 on pci0
>pci3: <ACPI PCI bus> on pcib3
>fwohci0: <Texas Instruments TSB43AB22/A> mem 0xfb120000-0xfb123fff,0xfb126000-0xfb1267ff irq 20 at device 3.0 on pci3
>fwohci0: Reserved 0x800 bytes for rid 0x10 type 3 at 0xfb126000
>fwohci0: [GIANT-LOCKED]
>fwohci0: OHCI version 1.10 (ROM=1)
>fwohci0: No. of Isochronous channel is 4.
>fwohci0: EUI64 00:e0:18:00:00:43:2b:be
>fwohci0: Phy 1394a available S400, 2 ports.
>fwohci0: Link S400, max_rec 2048 bytes.
>firewire0: <IEEE1394(FireWire) bus> on fwohci0
>sbp0: <SBP-2/SCSI over FireWire> on firewire0
>fwe0: <Ethernet over FireWire> on firewire0
>if_fwe0: Fake Ethernet address: 02:e0:18:43:2b:be
>fwe0: Ethernet address: 02:e0:18:43:2b:be
>fwohci0: Initiate bus reset
>fwohci0: node_id=0xc800ffc0, gen=1, CYCLEMASTER mode
>firewire0: 1 nodes, maxhop <= 0, cable IRM = 0 (me)
>firewire0: bus manager 0 (me)
>atapci0: <Promise PDC20378 SATA150 controller> port 0x8400-0x847f,0x8000-0x800f,0x8c00-0x8c3f mem 0xfb100000-0xfb11ffff,0xfb125000-0xfb125fff irq 23 at device 4.0 on pci3
>atapci0: failed: rid 0x20 is memory, requested 4
>atapci0: Reserved 0x20000 bytes for rid 0x20 type 3 at 0xfb100000
>atapci0: Reserved 0x1000 bytes for rid 0x1c type 3 at 0xfb125000
>ata2: at 0xfb125000 on atapci0
>ata3: at 0xfb125000 on atapci0
>ata4: at 0xfb125000 on atapci0
>ahc0: <Adaptec 2940 Ultra SCSI adapter> port 0x8800-0x88ff mem 0xfb124000-0xfb124fff irq 22 at device 10.0 on pci3
>ahc0: Reserved 0x100 bytes for rid 0x10 type 4 at 0x8800
>ahc0: [GIANT-LOCKED]
>aic7880: Ultra Wide Channel A, SCSI Id=7, 16/253 SCBs
>pcib4: <PCI-PCI bridge> at device 11.0 on pci3
>pci4: <PCI bus> on pcib4
>pcib4: slot 4 INTA is routed to irq 23
>pcib4: slot 5 INTA is routed to irq 20
>fxp0: <Intel 82550 Pro/100 Ethernet> port 0x7000-0x703f mem 0xfb000000-0xfb01ffff,0xfb041000-0xfb041fff irq 23 at device 4.0 on pci4
>fxp0: Reserved 0x1000 bytes for rid 0x10 type 3 at 0xfb041000
>miibus0: <MII bus> on fxp0
>inphy0: <i82555 10/100 media interface> on miibus0
>inphy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
>fxp0: Ethernet address: 00:02:b3:ce:19:3f
>fxp0: [GIANT-LOCKED]
>fxp1: <Intel 82550 Pro/100 Ethernet> port 0x7400-0x743f mem 0xfb020000-0xfb03ffff,0xfb040000-0xfb040fff irq 20 at device 5.0 on pci4
>fxp1: Reserved 0x1000 bytes for rid 0x10 type 3 at 0xfb040000
>miibus1: <MII bus> on fxp1
>inphy1: <i82555 10/100 media interface> on miibus1
>inphy1:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
>fxp1: Ethernet address: 00:02:b3:ce:19:40
>fxp1: [GIANT-LOCKED]
>ohci0: <OPTi 82C861 (FireLink) USB controller> mem 0xfb127000-0xfb127fff irq 20 at device 12.0 on pci3
>ohci0: Reserved 0x1000 bytes for rid 0x10 type 3 at 0xfb127000
>ohci0: [GIANT-LOCKED]
>usb5: OHCI version 1.0, legacy support
>usb5: <OPTi 82C861 (FireLink) USB controller> on ohci0
>usb5: USB revision 1.0
>uhub5: OPTi OHCI root hub, class 9/0, rev 1.00/1.00, addr 1
>uhub5: 2 ports with 2 removable, self powered
>isab0: <PCI-ISA bridge> at device 31.0 on pci0
>isa0: <ISA bus> on isab0
>atapci1: <Intel ICH5 UDMA100 controller> port 0xf000-0xf00f,0x376,0x170-0x177,0x3f6,0x1f0-0x1f7 at device 31.1 on pci0
>atapci1: Reserved 0x10 bytes for rid 0x20 type 4 at 0xf000
>atapci1: Reserved 0x8 bytes for rid 0x10 type 4 at 0x1f0
>atapci1: Reserved 0x1 bytes for rid 0x14 type 4 at 0x3f6
>ata0: at 0x1f0 irq 14 on atapci1
>atapci1: Reserved 0x8 bytes for rid 0x18 type 4 at 0x170
>atapci1: Reserved 0x1 bytes for rid 0x1c type 4 at 0x376
>ata1: at 0x170 irq 15 on atapci1
>atapci2: <Intel ICH5 SATA150 controller> port 0xd000-0xd00f,0xcc00-0xcc03,0xc800-0xc807,0xc400-0xc403,0xc000-0xc007 irq 18 at device 31.2 on pci0
>atapci2: Reserved 0x10 bytes for rid 0x20 type 4 at 0xd000
>atapci2: Reserved 0x8 bytes for rid 0x10 type 4 at 0xc000
>atapci2: Reserved 0x4 bytes for rid 0x14 type 4 at 0xc400
>ata5: at 0xc000 on atapci2
>atapci2: Reserved 0x8 bytes for rid 0x18 type 4 at 0xc800
>atapci2: Reserved 0x4 bytes for rid 0x1c type 4 at 0xcc00
>ata6: at 0xc800 on atapci2
>ichsmb0: <Intel 82801EB (ICH5) SMBus controller> port 0x500-0x51f irq 17 at device 31.3 on pci0
>ichsmb0: Reserved 0x20 bytes for rid 0x20 type 4 at 0x500
>ichsmb0: [GIANT-LOCKED]
>smbus0: <System Management Bus> on ichsmb0
>smb0: <SMBus generic I/O> on smbus0
>pcm0: <Intel ICH5 (82801EB)> port 0xdc00-0xdc3f,0xd800-0xd8ff mem 0xfc102000-0xfc1020ff,0xfc101000-0xfc1011ff irq 17 at device 31.5 on pci0
>pcm0: Reserved 0x200 bytes for rid 0x18 type 3 at 0xfc101000
>pcm0: Reserved 0x100 bytes for rid 0x1c type 3 at 0xfc102000
>pcm0: [GIANT-LOCKED]
>pcm0: <Analog Devices AD1985 AC97 Codec>
>fdc0: <Enhanced floppy controller (i82077, NE72065 or clone)> port 0x3f7,0x3f0-0x3f5 irq 6 drq 2 on acpi0
>fdc0: FIFO enabled, 8 bytes threshold
>fd0: <1440-KB 3.5" drive> on fdc0 drive 0
>sio0 port 0x3f8-0x3ff irq 4 on acpi0
>sio0: type 16550A
>sio1 port 0x2f8-0x2ff irq 3 on acpi0
>sio1: type 16550A
>ppc0 port 0x778-0x77b,0x378-0x37f irq 7 drq 3 on acpi0
>ppc0: SMC-like chipset (ECP/EPP/PS2/NIBBLE) in COMPATIBLE mode
>ppc0: FIFO with 16/16/9 bytes threshold
>ppbus0: <Parallel port bus> on ppc0
>ppbus0: IEEE1284 device found 
>Probing for PnP devices on ppbus0:
>plip0: <PLIP network interface> on ppbus0
>lpt0: <Printer> on ppbus0
>lpt0: Interrupt-driven port
>ppi0: <Parallel I/O> on ppbus0
>atkbdc0: <Keyboard controller (i8042)> port 0x64,0x60 irq 1 on acpi0
>atkbd0: <AT Keyboard> irq 1 on atkbdc0
>kbd0 at atkbd0
>atkbd0: [GIANT-LOCKED]
>psm0: <PS/2 Mouse> irq 12 on atkbdc0
>psm0: [GIANT-LOCKED]
>psm0: model IntelliMouse, device ID 3
>orm0: <Option ROMs> at iomem 0xd5000-0xd57ff,0xcc000-0xccfff,0xc0000-0xcbfff on isa0
>pmtimer0 on isa0
>sc0: <System console> at flags 0x100 on isa0
>sc0: VGA <16 virtual consoles, flags=0x300>
>vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0
>Timecounters tick every 10.000 msec
>acd0: DVDR <PIONEER DVD-RW DVR-106D> at ata0-master PIO4
>ad4: 152627MB <ST3160023AS> [310101/16/63] at ata2-master SATA150
>ad6: 152627MB <ST3160023AS> [310101/16/63] at ata3-master SATA150
>ar0: 152587MB <ATA RAID1 array> [19452/255/63] status: READY subdisks:
> disk0 READY on ad4 at ata2-master
> disk1 READY on ad6 at ata3-master
>Waiting 15 seconds for SCSI devices to settle
>SMP: AP CPU #3 Launched!
>SMP: AP CPU #1 Launched!
>SMP: AP CPU #2 Launched!
>cd1 at ata0 bus 0 target 0 lun 0
>cd1: <PIONEER DVD-RW  DVR-106D 1.07> Removable CD-ROM SCSI-0 device 
>cd1: 16.000MB/s transfers
>cd1: Attempt to query device size failed: NOT READY, Medium not present
>cd0 at ahc0 bus 0 target 0 lun 0
>cd0: <YAMAHA CRW2100S 1.0G> Removable CD-ROM SCSI-2 device 
>cd0: 10.000MB/s transfers (10.000MHz, offset 7)
>cd0: Attempt to query device size failed: NOT READY, Medium not present - tray closed
>Mounting root from ufs:/dev/ar0s1a
>WARNING: / was not properly dismounted
>WARNING: / was not properly dismounted
>IP Filter: v3.4.31 initialized.  Default = pass all, Logging = enabled
>  
>
>------------------------------------------------------------------------
>
>_______________________________________________
>freebsd-current_at_freebsd.org mailing list
>http://lists.freebsd.org/mailman/listinfo/freebsd-current
>To unsubscribe, send any mail to "freebsd-current-unsubscribe_at_freebsd.org"
>  
>

-- 
Alain Hebert                                ahebert_at_pubnix.net   
PubNIX Inc.        
P.O. Box 175       Beaconsfield, Quebec     H9W 5T7	
tel 514-990-5911   http://www.pubnix.net    fax 514-990-9443
Received on Tue May 18 2004 - 14:08:27 UTC

This archive was generated by hypermail 2.4.0 : Wed May 19 2021 - 11:37:54 UTC