Re: panic: System call lstat returning with 1 locks held

From: Scot Hetzel <swhetzel_at_gmail.com>
Date: Fri, 18 Jan 2008 01:40:11 -0600
Previous message sent prematurely.

On 1/18/08, Scot Hetzel <swhetzel_at_gmail.com> wrote:
> On 1/17/08, Pawel Jakub Dawidek <pjd_at_freebsd.org> wrote:
> > On Wed, Jan 16, 2008 at 08:10:03PM -0600, Scot Hetzel wrote:
> > > The local CVS repository is on a ZFS filesystem.  Is anyone seeing
> > > this problem on a UFS filesystem?
> >
> > WITNESS won't work on ZFS' lock by default. Please add:
> >
> > CFLAGS+=-DDEBUG
> >
> > to sys/modules/zfs/Makefile and recompile zfs kernel module. DEBUG
> > define tells ZFS not to add NOWITNESS flag at lock initialization time.
> >
>
> I rebuilt the zfs module as suggested.
>
> When I reboot, I am now seeing 4 different lock order reversals related to ZFS:
>
> 1. This lock order reversal occurs most often:
>
> lock order reversal:
>  1st 0xffffff0001b95838 dr->dt.di.dr_mtx (dr->dt.di.dr_mtx) _at_
> /usr/src/sys/modules/zfs/../../contrib/opensolaris/uts/common/fs/zfs/dbuf.c:1866
>  2nd 0xffffff00017531c0 db->db_mtx (db->db_mtx) _at_
> /usr/src/sys/modules/zfs/../../contrib/opensolaris/uts/common/fs/zfs/dbuf.c:1888
> KDB: stack backtrace:
> db_trace_self_wrapper() at db_trace_self_wrapper+0x2a
> witness_checkorder() at witness_checkorder+0x606
> _sx_xlock() at _sx_xlock+0x52
> dbuf_sync_list() at dbuf_sync_list+0x215
> dbuf_sync_list() at dbuf_sync_list+0x194
> dnode_sync() at dnode_sync+0x385
> dmu_objset_sync() at dmu_objset_sync+0x116
> dsl_pool_sync() at dsl_pool_sync+0x153
> spa_sync() at spa_sync+0x39e
> txg_sync_thread() at txg_sync_thread+0x17d
> fork_exit() at fork_exit+0x12a
> fork_trampoline() at fork_trampoline+0xe
> --- trap 0, rip = 0, rsp = 0xffffffffd72a3d30, rbp = 0 ---
>
> 2. This lock order reversal is similar to the one above:
>
> lock order reversal:
>  1st 0xffffff0001b00d38 dr->dt.di.dr_mtx (dr->dt.di.dr_mtx) _at_
> /usr/src/sys/modules/zfs/../../contrib/opensolaris/uts/common/fs/zfs/dbuf.c:1866
>  2nd 0xffffff0001a27760 db->db_mtx (db->db_mtx) _at_
> /usr/src/sys/modules/zfs/../../contrib/opensolaris/uts/common/fs/zfs/dbuf.c:1837
> KDB: stack backtrace:
> db_trace_self_wrapper() at db_trace_self_wrapper+0x2a
> witness_checkorder() at witness_checkorder+0x606
> _sx_xlock() at _sx_xlock+0x52
> dbuf_sync_list() at dbuf_sync_list+0xaf
> dbuf_sync_list() at dbuf_sync_list+0x194
> dnode_sync() at dnode_sync+0x385
> dmu_objset_sync() at dmu_objset_sync+0x116
> dsl_pool_sync() at dsl_pool_sync+0x72
> spa_sync() at spa_sync+0x39e
> txg_sync_thread() at txg_sync_thread+0x17d
> fork_exit() at fork_exit+0x12a
> fork_trampoline() at fork_trampoline+0xe
> --- trap 0, rip = 0, rsp = 0xffffffffd72a3d30, rbp = 0 ---
>
3. lock order reversal related to the write syscall

lock order reversal:
1st 0xffffff00269f6500 dn->dn_mtx (dn->dn_mtx) _at_
/usr/src/sys/modules/zfs/../../contrib/opensolaris/uts/common/fs/zfs/dnode.c:874
2nd 0xffffff0026338d38 dr->dt.di.dr_mtx (dr->dt.di.dr_mtx) _at_
/usr/src/sys/modules/zfs/../../contrib/opensolaris/uts/common/fs/zfs/dnode.c:875
KDB: Stack backtrace:
db_trace_self_wrapper() at db_trace_self_wrapper+0x2a
witness_check_order() at witness_check_order+0x606
_sx_lock() at _sx_lock+0x52
dnode_new_blkid() at dnode_new_blkid+0x15b
dbuf_dirty() at dbuf_dirty+0x7dc
dmu_write_uio() at dmu_write_uio+0x167
zfs_freebsd_write() at zfs_freebsd_write+0x9b4
VOP_WRITE_APV() at VOP_WRITE_APV+0x131
vn_write() at vn_write+0x24f
dofilewrite() at dofilewrite+0x85
kern_writev() at kern_writev+0x60
write() at write+0x54
syscall() at syscall+0x1ce
Xfast_syscall() at Xfast_syscall+0xab
--- syscall (4, FreeBSD ELF64, write), rip = 0x8009f623c, rsp =
0x7729d0, rbp = 0x772a18 ---

4. lock order reversal related to the lstat syscall

lock order reversal:
1st 0xffffff0001578058 zfsvfs->z_um_lock (zfsvfs->z_um_lock) _at_
/usr/src/sys/modules/zfs/../../contrib/opensolaris/uts/common/fs/zfs/zfs_vnops.c:2949
2nd 0xffffff001725cd0 tx->tx_sync_lock (tx->tx_sync_lock) _at_
/usr/src/sys/modules/zfs/../../contrib/opensolaris/uts/common/fs/zfs/txg.c:414
KDB: Stack backtrace:
db_trace_self_wrapper() at db_trace_self_wrapper+0x2a
witness_check_order() at witness_check_order+0x606
_sx_lock() at _sx_lock+0x52
txg_wait_open() at txg_wait_open+0x34
dmu_tx_assign() at dmu_tx_assign+0x2a5
zfs_inactive() at zfs_inactive+0x21f
zfs_freebsd_inactive() at zfs_freebsd_inactive+0x18
VOP_INACTIVE_APV() at VOP_INACTIVE_APV+0xb5
vinactive() at vinactive+0x90
vput() at vput+0x24d
namei() at namei+0x29a
kern_lstat() at kern_lstat+0x5e
lstat() at lstat+0x2a
syscall() at syscall+0x1ce
Xfast_syscall() at Xfast_syscall+0xab
--- syscall (190, FreeBSD ELF64, lstat), rip = 0x8009e87ec, rsp =
0x72f1d0, rbp = 0x72f2a8 ---

The panic didn't occur after lock order reversal 4, instead it occured
after the system had displayed lock order reversal 2.

hp010# /usr/local/etc/cvsup/update.sh &
hp010# tail -f /var/log/cvsup.log
:
<lock order reversal 1>
:
<lock order reversal 3>
:
<lock order reversal 1>
<lock order reversal 2>
:
<lock order reversal 4>
:
<lock order reversal 1>
<lock order reversal 2>
Edit CVSROOT-ports/modules,v
:
:
Edit ports/audio/sonata/pkg-plist,v
panic: System call lstat returning with 1 locks held
cpuid = 0
KDB: enter: panic
[ thread pid 996 tid 10011 ]
stopped at kdb_enter+0x3d: movq $0,0x4ad188(%rip)
db> show lockedvnods
Locked vnodes
db>

Scot
Received on Fri Jan 18 2008 - 06:40:13 UTC

This archive was generated by hypermail 2.4.0 : Wed May 19 2021 - 11:39:26 UTC