Re: msk watchdog timeout

From: Pyun YongHyeon <pyunyh_at_gmail.com>
Date: Wed, 15 Oct 2008 21:34:40 +0900
On Wed, Oct 15, 2008 at 02:22:34PM +0200, Koen Martens wrote:
 > On Thu, Oct 04, 2007 at 10:13:48AM +0900, Pyun YongHyeon wrote:
 > > On Wed, Oct 03, 2007 at 01:31:32PM +0800, Kudo Chien wrote:
 > >  > > Thanks for testing. Would you sumbit a PR for the issue and assign
 > >  > > it to me? I'll let you know when I manage to find a clue.
 > >  > >
 > >  > > OK. I've submitted a PR at
 > >  > http://www.freebsd.org/cgi/query-pr.cgi?pr=116853.
 > >  > Thanks you.
 > >  > 
 > > 
 > > I've grabbed it. Thanks.
 > 
 > For what it's worth, i've been having instability issues with msk0
 > too. Included is a dmesg and pciconf output.
 > 
 > The problem occurs under load (rsyncing tens of gigabytes over
 > gigabit link for example). I tried configuring the switch port
 > down to 100MB, in the hopes that msk0 would be more stable. It
 > is, but it still goes down after a while with watchdog timeouts.
 > 
 > I am now running it with msi disabled, it appears it lasts longer
 > than before now. But judging by what others said on this subject
 > already, it might still go wrong after as much as a month.
 > 
 > Also, I've never had these problems when the machine was still
 > on 6.x with the myk driver. Only after I upgraded it this tuesday
 > to RELENG_7, trouble started.
 > 
 > This is a server that I need to put back into production. I could
 > give you some time on it before I do that, but that'd have to be 
 > *right now* so i guess that won't work out really.
 > 
 > I'll probably install a nic to be used instead of the built-in
 > yukon interface, to get back the required stability.
 > 

I'm not sure whether 88E8050 also has RAM buffer. Youkon
controllers seems to have silicon bugs for hardwares with RAM
buffer. msk(4) in HEAD has workaround code for the silicon bug.
Would you try latest msk(4) from HEAD?(Just copy
if_msk.c/if_mskreg.h from HEAD to your box and rebuild kernel.)
Also show me verbosed boot message(msk(4) related one would be
enough).

-- 
Regards,
Pyun YongHyeon
Received on Wed Oct 15 2008 - 10:36:43 UTC

This archive was generated by hypermail 2.4.0 : Wed May 19 2021 - 11:39:36 UTC