From: Norbert Preining on
Hi Reinette,

On Mo, 18 Jan 2010, reinette chatre wrote:
> > > Does that mean no netconsole either? Does anything show up in the logs?
> > > Is it easy to reproduce? If so, perhaps you can have increased debug at
> > > that time and hopefully something will be captured in the logs when the
> > > problem occurs.

I tried it today, but had "real work" (university job) to do. It worked
and I found out that it happend (up to now) *NOT* when I was only doing
a ping on a server, but when I ssh-ed into my server it hang.

More testing tomorrow (here it is already 2am).

BTW, logs were empty, unfortunately, complete hard hang.

> > Jan 17 03:28:58 mithrandir kernel: [34535.332419] iwlagn 0000:06:00.0: BUG_ON idx doesn't match seq control idx=170, seq_idx=3466, seq=55456
> >
> > Actually many many many of these lines.
> >
>
> What you are seeing here is currently being looked into at
> http://bugzilla.intellinuxwireless.org/show_bug.cgi?id=2098 - could you
> please add your information there?

I did that, although I was not sure what information to provide.

Best wishes

Norbert
------------------------------------------------------------------------
Norbert Preining preining@{jaist.ac.jp, logic.at, debian.org}
JAIST, Japan TU Wien, Austria Debian TeX Task Force
DSA: 0x09C5B094 fp: 14DF 2E6C 0307 BE6D AD76 A9C0 D2BF 4AA3 09C5 B094
------------------------------------------------------------------------
THURNBY (n.)
A rucked-up edge of carpet or linoleum which everyone says someone
will trip over and break a leg unless it gets fixed. After a year or
two someone trips over it and breaks a leg.
--- Douglas Adams, The Meaning of Liff
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo(a)vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
From: Norbert Preining on
Hi Reinette,

On Fr, 15 Jan 2010, reinette chatre wrote:
> > I am having repeatable complete hard lockups on my laptop with 2.6.33-rc4.
> > 2.6.32.3 works fine.
> >
> > I believe that it is related to the network, because sometimes I can
> > actually log in (gnomes session) and as soon as I do some network
> > related suddenly hard hang, not even Sysrq working anymore.
> >
> > Interestingly it only happens at a specific AP where the ESSID is
> > hidden (at work). At home I can work without any problems (ESSID not
> > hidden).
> >
> > Unfortunately I cannot set up a serial console or similar.
>
> Does that mean no netconsole either? Does anything show up in the logs?
> Is it easy to reproduce? If so, perhaps you can have increased debug at
> that time and hopefully something will be captured in the logs when the
> problem occurs.

Ok, I can confirm that setting up the network is not the problem, nor
is it pinging other hosts. But ssh-ing into another server
made it go boom. From the screenshot I attach it looks like something
in TCP code (that explains why it does not happen in pings), below
I see tcp_data_snd_check

I managed to swithc in time to a console with tail -f syslog before
it hard locked up. The log files are empty, but I got a screenshot photo
which has some hopefully useful information. I cannot scroll up or down
anymore ...

If you want me to create a bug report or you create one in bugzilla,
I can also upload it htere, but I attach it for now.

Best wishes

Norbert
------------------------------------------------------------------------
Norbert Preining preining@{jaist.ac.jp, logic.at, debian.org}
JAIST, Japan TU Wien, Austria Debian TeX Task Force
DSA: 0x09C5B094 fp: 14DF 2E6C 0307 BE6D AD76 A9C0 D2BF 4AA3 09C5 B094
------------------------------------------------------------------------
SOLENT (adj.)
Descriptive of the state of serene self-knowledge reached through
drink.
--- Douglas Adams, The Meaning of Liff
From: reinette chatre on
Hi Norbert,

On Mon, 2010-01-18 at 21:47 -0800, Norbert Preining wrote:
> On Fr, 15 Jan 2010, reinette chatre wrote:
> > > I am having repeatable complete hard lockups on my laptop with 2.6.33-rc4.
> > > 2.6.32.3 works fine.
> > >
> > > I believe that it is related to the network, because sometimes I can
> > > actually log in (gnomes session) and as soon as I do some network
> > > related suddenly hard hang, not even Sysrq working anymore.
> > >
> > > Interestingly it only happens at a specific AP where the ESSID is
> > > hidden (at work). At home I can work without any problems (ESSID not
> > > hidden).
> > >
> > > Unfortunately I cannot set up a serial console or similar.
> >
> > Does that mean no netconsole either? Does anything show up in the logs?
> > Is it easy to reproduce? If so, perhaps you can have increased debug at
> > that time and hopefully something will be captured in the logs when the
> > problem occurs.
>
> Ok, I can confirm that setting up the network is not the problem, nor
> is it pinging other hosts. But ssh-ing into another server
> made it go boom. From the screenshot I attach it looks like something
> in TCP code (that explains why it does not happen in pings), below
> I see tcp_data_snd_check
>
> I managed to swithc in time to a console with tail -f syslog before
> it hard locked up. The log files are empty, but I got a screenshot photo
> which has some hopefully useful information. I cannot scroll up or down
> anymore ...
>
> If you want me to create a bug report or you create one in bugzilla,
> I can also upload it htere, but I attach it for now.

I see that it fails in skb_pull after being called from one of the RX
handlers. Let's add Johannes.

Johannes, does anything perhaps look familiar to you in this trace?

Thank you

Reinette


From: Johannes Berg on
On Tue, 2010-01-19 at 09:01 -0800, reinette chatre wrote:

> > If you want me to create a bug report or you create one in bugzilla,
> > I can also upload it htere, but I attach it for now.
>
> I see that it fails in skb_pull after being called from one of the RX
> handlers. Let's add Johannes.
>
> Johannes, does anything perhaps look familiar to you in this trace?

Sorry, no, seems weird. The trace is not very useful unfortunately, is
this with CONFIG_FRAME_POINTER?

johannes
From: Norbert Preining on
On Di, 19 Jan 2010, Johannes Berg wrote:
> Sorry, no, seems weird. The trace is not very useful unfortunately, is
> this with CONFIG_FRAME_POINTER?

# CONFIG_FRAME_POINTER is not set

Do you need it?

Other things for the .config needed?

Best wishes

Norbert
------------------------------------------------------------------------
Norbert Preining preining@{jaist.ac.jp, logic.at, debian.org}
JAIST, Japan TU Wien, Austria Debian TeX Task Force
DSA: 0x09C5B094 fp: 14DF 2E6C 0307 BE6D AD76 A9C0 D2BF 4AA3 09C5 B094
------------------------------------------------------------------------
GREAT TOSSON (n.)
A fat book containing four words and six cartoons which cost �6.95.
--- Douglas Adams, The Meaning of Liff
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo(a)vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/