Re: Non-responsive 8.0-RC1 (now 8.0-STABLE)

From: Elliot Finley <efinley.lists_at_gmail.com>
Date: Sat, 5 Dec 2009 17:31:43 -0700
On Sat, Dec 5, 2009 at 3:48 PM, Peter Jeremy <peterjeremy_at_acm.org> wrote:

> On 2009-Nov-30 19:13:30 +1100, Peter Jeremy <peter_at_server.vk2pj.dyndns.org>
> wrote:
> >On 2009-Nov-29 08:56:55 +0100, Thomas Backman <serenity_at_exscape.org>
> wrote:
> >>
> >>On Nov 28, 2009, at 10:22 PM, Peter Jeremy wrote:
> >>
> >>> My main server is running 8.0/amd64 from between RC1 and RC2 and I've
> >>> recently had a couple of long-duration hangs on it during which time
> >>> processes doing I/O will stop responding.
> ...
> >It actually "hung" again just after I sent the original mail.  This
> >time I managed to get console access and could check the kernel state.
> >This showed that a number of processes were blocked on ZFS locks.
> >The most commonly reported state was 'tx->tx_quiesce_done_cv)'.
>
> I've upgraded to 8-STABLE from 30-Nov and the problem is still present,
> even after disabling the boinc processes.
>
> This seems to leave race conditions inside ZFS as the only option.
>
> Has anyone else seen anything like this?
>
>
I have a machine running 7.2 that does the same thing if I don't disable ZIL
and prefetch (probably just one of them triggers the hang, just haven't had
time to see which one).  I'll be upgrading it to 8-Stable in the next week
or so and I'll see if the problem persists.  One data point that may or may
not be relevant is that the process that always triggers the hangs is istgt
(iSCSI target from ports).

Elliot
Received on Sat Dec 05 2009 - 23:31:44 UTC

This archive was generated by hypermail 2.4.0 : Wed May 19 2021 - 11:39:58 UTC