Re: panic on application core dump?

From: Konstantin Belousov <kostikbel_at_gmail.com>
Date: Sun, 22 Feb 2015 20:04:25 +0200
On Sun, Feb 22, 2015 at 09:34:29AM -0800, Sean Bruno wrote:
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA512
> 
> > Err.  Is it easily reproducable in your setup ? The core file vnode
> > is indeed unreferenced before notification is sent.
> > 
> > Try this.
> > 
> > diff --git a/sys/kern/kern_sig.c b/sys/kern/kern_sig.c index
> > 41da3dd..57f66b0 100644
> 
> Restarted my non-deterministic test case.  Three instances of qemu
> core dumped and the system did *not* panic.
> 
> However, this appears to be interfering with signal handling and
> reaping.  Applications seems to stall out and become
> unkillable/unreapable.  I have to reboot the system via panic/reset.
> 
What applications ?  What is the (kernel) backtrace for the ?

> sean
> 
> last pid: 41009;  load averages:  0.46,  0.29,  0.21
> 
> 
>    up 0+12:18:57  17:33:37
> 72 processes:  1 running, 69 sleeping, 2 zombie
> CPU:  0.9% user,  0.0% nice,  0.2% system,  0.0% interrupt, 98.8% idle
> Mem: 15M Active, 6675M Inact, 7201M Wired, 244K Cache, 2010M Free
> ARC: 4678M Total, 910M MFU, 3001M MRU, 6748K Anon, 70M Header, 691M Other
> Swap: 16G Total, 16G Free
> 
>   PID USERNAME    THR PRI NICE   SIZE    RES STATE   C   TIME    WCPU
> COMMAND
>   718 www           1  20    0 28708K  6636K kqread 12   0:08   0.70%
> nginx
>  1430 root          1  52    0 17180K  3448K wait   13   2:28   0.44% sh
>  1276 sbruno        1  20    0 21548K  8796K select 11   4:37   0.17% tmux
> 40832 root          1  20    0 22000K  3072K CPU7    7   0:00   0.08% top
>  1267 sbruno        1  20    0 86528K  7580K select 15   0:04   0.02% sshd
>   695 root          1  20    0 25496K  4884K select  9   0:03   0.01% ntpd
>   698 root          1  20    0 14492K  1992K select  7   0:04   0.01%
> powerd
>  1288 root          1  20    0 17180K  5336K zfs    14   1:13   0.00% sh
Is this the problem ^^^^ ?

If yes, do you have ddb compiled in ?  Show the output of 'show lockedvnods'
and 'show alllocks'.

> 79939 root          1  52    0 17180K  3384K wait   10   0:03   0.00% sh
>   750 root          1  20    0 24164K  5444K select  2   0:01   0.00%
> sendmail
>   444 unbound       1  20    0 34780K  9672K select  7   0:00   0.00%
> unbound
> 40727 root          1  20    0 17180K  5276K wait   14   0:00   0.00% sh
> 68893 root          1  26    0 17180K  5336K wait    3   0:00   0.00% sh
> 80937 root          1  24    0 17180K  5336K wait   13   0:00   0.00% sh
> 55102 root          1  24    0 17180K  5336K wait    3   0:00   0.00% sh
> 35713 root          1  25    0 17180K  5336K wait   10   0:00   0.00% sh
> 48828 root          1  24    0 17180K  5336K wait    6   0:00   0.00% sh
> 97473 root          1  26    0 17180K  5336K wait    0   0:00   0.00% sh
> 64113 root          1  24    0 17180K  5336K wait    5   0:00   0.00% sh
> 46980 root          1  27    0 17180K  5336K wait   12   0:00   0.00% sh
> 80439 root          1  24    0 17180K  5336K wait    7   0:00   0.00% sh
> 44960 root          1  20    0 17180K  5276K wait    3   0:00   0.00% sh
> 49661 root          1  27    0 17180K  5336K wait    8   0:00   0.00% sh
> 78496 root          1  52    0 17180K  5336K zfs     0   0:00   0.00% sh
> 69491 root          1  27    0 17180K  5336K wait    6   0:00   0.00% sh
> 75907 root          1  52    0 17180K  5336K zfs     8   0:00   0.00% sh
>   534 root          1  20    0 14524K  2200K select 12   0:00   0.00%
> syslogd
>   757 root          1  52    0 16620K  2296K nanslp 13   0:00   0.00% cron
>   747 root          1  20    0 59148K  6620K select  6   0:00   0.00% sshd
> 26179 root          1  31    0 23652K  4044K pause  14   0:00   0.00% csh
> 79371 root          1  20    0 17180K  5336K wait    9   0:00   0.00% sh
>  1268 sbruno        1  23    0 23652K  3736K pause   5   0:00   0.00% tcsh
>  1265 root          1  22    0 86528K  7284K select 11   0:00   0.00% sshd
>  1283 root          1  21    0 23652K  4044K pause   3   0:00   0.00% csh
>  1277 sbruno        1  23    0 23652K  3748K pause   6   0:00   0.00% tcsh
>  1282 sbruno        1  21    0 74188K  6348K wait   11   0:00   0.00% su
> 24565 sbruno        1  20    0 23652K  3740K pause   8   0:00   0.00% tcsh
>   438 root          1  20    0 13588K  4788K select 11   0:00   0.00% devd
> 26119 sbruno        1  20    0 74188K  6348K wait   13   0:00   0.00% su
>   753 smmsp         1  20    0 24164K  5392K pause   1   0:00   0.00%
> sendmail
Received on Sun Feb 22 2015 - 17:04:33 UTC

This archive was generated by hypermail 2.4.0 : Wed May 19 2021 - 11:40:55 UTC