Re: Unkillable and runaway processes

From: Pawel Jakub Dawidek <pjd_at_FreeBSD.org>
Date: Wed, 5 Sep 2007 16:17:59 +0200
On Tue, Sep 04, 2007 at 03:08:20PM +0200, Kenneth Vestergaard Schmidt wrote:
> Hello.
> 
> Our ZFS testbed is experiencing some weird problems with rsync. We run a
> nightly backup of about 1.6 TB data (that's how much is stored, not how
> much is transferred), but after the initial sync I haven't been able to
> get the machine through one full cycle.
> 
> After many hours of rsyncing data from 50+ machines, suddenly one
> rsync-process will hang, spinning on the CPU.
> 
> It switches state between CPU0, CPU1, RUN and 'zfs:(&', but doesn't
> really do anything. It can't be killed, and you can't reboot the machine
> - it'll get past syncing disks, but won't shutdown or reboot.
> 
> I can't do an 'ls' in the directory that rsync is running on - it'll
> just hang, too.
> 
> The machine is running current from August 29th.
> 
> I could use some pointers on what to do - is there some way I can debug
> this better, maybe give some better info?

Try disabling ZIL. This looks like a bug was already reported by Kris.
This was already reported to OpenSolaris.

-- 
Pawel Jakub Dawidek                       http://www.wheel.pl
pjd_at_FreeBSD.org                           http://www.FreeBSD.org
FreeBSD committer                         Am I Evil? Yes, I Am!

Received on Wed Sep 05 2007 - 12:19:23 UTC

This archive was generated by hypermail 2.4.0 : Wed May 19 2021 - 11:39:17 UTC