From: Sachin Sant on
While executing LTP Controller tests(memcg regression) on
a POWER6 box came across this following OOPS.

Memory cgroup out of memory: kill process 9139 (memcg_test_1) score 3 or a child
Killed process 9139 (memcg_test_1) vsz:3456kB, anon-rss:448kB, file-rss:1088kB
Memory cgroup out of memory: kill process 9140 (memcg_test_1) score 3 or a child
Killed process 9140 (memcg_test_1) vsz:3456kB, anon-rss:448kB, file-rss:1088kB
Unable to handle kernel paging request for data at address 0x720072007200720
Faulting instruction address: 0xc00000000015b778
Oops: Kernel access of bad area, sig: 11 [#2]
SMP NR_CPUS=1024 NUMA pSeries
last sysfs file: /sys/devices/system/cpu/cpu1/cache/index1/shared_cpu_map
Modules linked in: quota_v2 quota_tree ipv6 fuse loop dm_mod sr_mod cdrom sg sd_mod crc_t10dif ibmvscsic scsi_transport_srp scsi_tgt scsi_mod
NIP: c00000000015b778 LR: c00000000015b740 CTR: 0000000000000000
REGS: c000000009812ff0 TRAP: 0300 Tainted: G D (2.6.35-rc2-autotest)
MSR: 8000000000009032 <EE,ME,IR,DR> CR: 44004424 XER: 00000001
DAR: 0720072007200720, DSISR: 0000000040000000
TASK = c000000005fb1100[9155] 'umount' THREAD: c000000009810000 CPU: 0
GPR00: 0000000000000000 c000000009813270 c000000000d3d7a0 0000000000000000
GPR04: 0000000000008050 0000000000160000 0000000000000027 c00000000f2c6870
GPR08: 00000000000006a5 c000000000b16870 c000000000cf0140 000000000e7b0000
GPR12: 0000000024004428 c000000007440000 0000000000008000 fffffffffffff000
GPR16: 0000000000000000 c0000000098138f0 000000000000002d 0000000000000027
GPR20: 0000000000000000 0000000000000027 0000000000000000 c000000007063138
GPR24: ffffffffffffffff 0000000000000000 c00000000019bafc c00000000e02e000
GPR28: 0000000000000001 0000000000008050 c000000000ca6b00 0720072007200720
NIP [c00000000015b778] .kmem_cache_alloc+0xb0/0x13c
LR [c00000000015b740] .kmem_cache_alloc+0x78/0x13c
Call Trace:
[c000000009813270] [c00000000015b740] .kmem_cache_alloc+0x78/0x13c (unreliable)
[c000000009813310] [c00000000019bafc] .alloc_buffer_head+0x2c/0x78
[c000000009813390] [c00000000019c99c] .alloc_page_buffers+0x60/0x114
[c000000009813450] [c00000000019ca78] .create_empty_buffers+0x28/0x140
[c0000000098134e0] [c00000000019f2ec] .__block_prepare_write+0xe4/0x4f0
[c000000009813610] [c00000000019f94c] .block_write_begin_newtrunc+0xa8/0x120
[c0000000098136d0] [c00000000019fea0] .block_write_begin+0x34/0x8c
[c000000009813770] [c00000000022b458] .ext3_write_begin+0x13c/0x298
[c000000009813880] [c000000000117500] .generic_file_buffered_write+0x13c/0x320
[c0000000098139b0] [c000000000119c80] .__generic_file_aio_write+0x378/0x3dc
[c000000009813ab0] [c000000000119d68] .generic_file_aio_write+0x84/0xfc
[c000000009813b60] [c00000000016e460] .do_sync_write+0xac/0x10c
[c000000009813ce0] [c00000000016f204] .vfs_write+0xd0/0x1dc
[c000000009813d80] [c00000000016f418] .SyS_write+0x58/0xa0
[c000000009813e30] [c0000000000085b4] syscall_exit+0x0/0x40
Instruction dump:
38600000 409e0090 38000000 8b8d0212 980d0212 e96d0040 e93b0000 7ce95a14
7fe9582a 2fbf0000 419e0014 e81b001a <7c1f002a> 7c09592a 4800001c 7f46d378
---[ end trace f24cb0cb5729d2bb ]---

And few more of these. Previous snapshot release 2.6.35-rc1-git5(6c5de280b6...)
was good.

Thanks
-Sachin


--

---------------------------------
Sachin Sant
IBM Linux Technology Center
India Systems and Technology Labs
Bangalore, India
---------------------------------

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo(a)vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
From: Al Viro on
On Sun, Jun 06, 2010 at 08:36:54PM +0530, Sachin Sant wrote:

> And few more of these. Previous snapshot release 2.6.35-rc1-git5(6c5de280b6...)
> was good.

That's very odd, since
; git diff --stat 6c5de280b6..v2.6.35-rc2
Makefile | 2 +-
drivers/gpu/drm/i915/intel_display.c | 9 +++++++
fs/ext4/inode.c | 40 +++++++++++++++++++--------------
fs/ext4/move_extent.c | 3 ++
4 files changed, 36 insertions(+), 18 deletions(-)
;
and nothing of that looks like good candidates...
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo(a)vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
From: Markus Trippelsdorf on

>> And few more of these. Previous snapshot release 2.6.35-rc1-git5(6c5de280b6...)
>> was good.
>
>That's very odd, since
>; git diff --stat 6c5de280b6..v2.6.35-rc2
> Makefile | 2 +-
> drivers/gpu/drm/i915/intel_display.c | 9 +++++++
> fs/ext4/inode.c | 40 +++++++++++++++++++--------------
> fs/ext4/move_extent.c | 3 ++
> 4 files changed, 36 insertions(+), 18 deletions(-)
>
>and nothing of that looks like good candidates...

I may have the same problem on my machine.
(See also the thread: ext4 2.6.35-rc2 regression (ext4: Make sure the MOVE_EXT ioctl...))

general protection fault: 0000 [#1] SMP
last sysfs file: /sys/devices/pci0000:00/0000:00:11.0/host2/target2:0:0/2:0:0:0/block/sdb/size
CPU 2
Pid: 1683, comm: iptables-restor Not tainted 2.6.35-rc2-00033-gcc1f375 #46 M4A78T-E/System Product Name
RIP: 0010:[<ffffffff810cc6e6>] [<ffffffff810cc6e6>] kmem_cache_alloc+0x59/0xda
RSP: 0018:ffff88011c993d78 EFLAGS: 00010002
RAX: 0000000000000000 RBX: 0720072007200720 RCX: ffffffff810bd4c9
RDX: 00007f076cee3000 RSI: 00000000000000d0 RDI: ffff88011fc01800
RBP: ffff88011c993db8 R08: ffff880001b13f48 R09: 0000000000000000
R10: ffff88011d387c00 R11: ffff88011c983930 R12: ffff88011fc01800
R13: 0000000000000202 R14: 00000000000000d0 R15: 00000000000000d0
FS: 00007f076dc43700(0000) GS:ffff880001b00000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 00007f8595d364f8 CR3: 000000011b8b0000 CR4: 00000000000006e0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Process iptables-restor (pid: 1683, threadinfo ffff88011c992000, task ffff88011ec09610)
Stack:
ffff88011d387c10 ffff88011c983930 ffff88011c993d98 000000000000fffa
<0> ffff88011d387bd0 00007f076cee3000 ffff88011f77ea40 0000000000000000
<0> ffff88011c993e08 ffffffff810bd4c9 ffff88011b8f5cc0 ffffffff810bd639
Call Trace:
[<ffffffff810bd4c9>] __split_vma+0x33/0x18d
[<ffffffff810bd639>] ? vma_merge+0x16/0x1fc
[<ffffffff810bdc01>] split_vma+0x23/0x28
[<ffffffff810bf572>] mprotect_fixup+0x146/0x54c
[<ffffffff810befff>] ? do_mmap_pgoff+0x2a4/0x2fe
[<ffffffff810bfaf0>] sys_mprotect+0x178/0x1f4
[<ffffffff8102b93b>] system_call_fastpath+0x16/0x1b
Code: 65 4c 8b 04 25 88 d4 00 00 48 8b 07 49 01 c0 49 8b 18 48 85 db 75 10 83 ca ff 44 89 f6 e8 58 fa ff ff 48 89 c3 eb 0b 48 63 47 18 <48> 8b 04 03 49 89 00 41 55 9d 48 85 db 74 15 41 81 e6 00 80 00
RIP [<ffffffff810cc6e6>] kmem_cache_alloc+0x59/0xda
RSP <ffff88011c993d78>
---[ end trace e2fb1ccd3cb9dd77 ]---
--
Markus
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo(a)vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
From: Maciej Rutecki on
I created a Bugzilla entry at
https://bugzilla.kernel.org/show_bug.cgi?id=16178
for your bug report, please add your address to the CC list in there, thanks!

On niedziela, 6 czerwca 2010 o 17:06:54 Sachin Sant wrote:
> While executing LTP Controller tests(memcg regression) on
> a POWER6 box came across this following OOPS.
>
> Memory cgroup out of memory: kill process 9139 (memcg_test_1) score 3 or a
> child Killed process 9139 (memcg_test_1) vsz:3456kB, anon-rss:448kB,
> file-rss:1088kB Memory cgroup out of memory: kill process 9140
> (memcg_test_1) score 3 or a child Killed process 9140 (memcg_test_1)
> vsz:3456kB, anon-rss:448kB, file-rss:1088kB Unable to handle kernel paging
> request for data at address 0x720072007200720 Faulting instruction
> address: 0xc00000000015b778
> Oops: Kernel access of bad area, sig: 11 [#2]
> SMP NR_CPUS=1024 NUMA pSeries
> last sysfs file: /sys/devices/system/cpu/cpu1/cache/index1/shared_cpu_map
> Modules linked in: quota_v2 quota_tree ipv6 fuse loop dm_mod sr_mod cdrom
> sg sd_mod crc_t10dif ibmvscsic scsi_transport_srp scsi_tgt scsi_mod NIP:
> c00000000015b778 LR: c00000000015b740 CTR: 0000000000000000
> REGS: c000000009812ff0 TRAP: 0300 Tainted: G D
> (2.6.35-rc2-autotest) MSR: 8000000000009032 <EE,ME,IR,DR> CR: 44004424
> XER: 00000001
> DAR: 0720072007200720, DSISR: 0000000040000000
> TASK = c000000005fb1100[9155] 'umount' THREAD: c000000009810000 CPU: 0
> GPR00: 0000000000000000 c000000009813270 c000000000d3d7a0 0000000000000000
> GPR04: 0000000000008050 0000000000160000 0000000000000027 c00000000f2c6870
> GPR08: 00000000000006a5 c000000000b16870 c000000000cf0140 000000000e7b0000
> GPR12: 0000000024004428 c000000007440000 0000000000008000 fffffffffffff000
> GPR16: 0000000000000000 c0000000098138f0 000000000000002d 0000000000000027
> GPR20: 0000000000000000 0000000000000027 0000000000000000 c000000007063138
> GPR24: ffffffffffffffff 0000000000000000 c00000000019bafc c00000000e02e000
> GPR28: 0000000000000001 0000000000008050 c000000000ca6b00 0720072007200720
> NIP [c00000000015b778] .kmem_cache_alloc+0xb0/0x13c
> LR [c00000000015b740] .kmem_cache_alloc+0x78/0x13c
> Call Trace:
> [c000000009813270] [c00000000015b740] .kmem_cache_alloc+0x78/0x13c
> (unreliable) [c000000009813310] [c00000000019bafc]
> .alloc_buffer_head+0x2c/0x78 [c000000009813390] [c00000000019c99c]
> .alloc_page_buffers+0x60/0x114 [c000000009813450] [c00000000019ca78]
> .create_empty_buffers+0x28/0x140 [c0000000098134e0] [c00000000019f2ec]
> .__block_prepare_write+0xe4/0x4f0 [c000000009813610] [c00000000019f94c]
> .block_write_begin_newtrunc+0xa8/0x120 [c0000000098136d0]
> [c00000000019fea0] .block_write_begin+0x34/0x8c [c000000009813770]
> [c00000000022b458] .ext3_write_begin+0x13c/0x298 [c000000009813880]
> [c000000000117500] .generic_file_buffered_write+0x13c/0x320
> [c0000000098139b0] [c000000000119c80]
> .__generic_file_aio_write+0x378/0x3dc [c000000009813ab0]
> [c000000000119d68] .generic_file_aio_write+0x84/0xfc [c000000009813b60]
> [c00000000016e460] .do_sync_write+0xac/0x10c
> [c000000009813ce0] [c00000000016f204] .vfs_write+0xd0/0x1dc
> [c000000009813d80] [c00000000016f418] .SyS_write+0x58/0xa0
> [c000000009813e30] [c0000000000085b4] syscall_exit+0x0/0x40
> Instruction dump:
> 38600000 409e0090 38000000 8b8d0212 980d0212 e96d0040 e93b0000 7ce95a14
> 7fe9582a 2fbf0000 419e0014 e81b001a <7c1f002a> 7c09592a 4800001c 7f46d378
> ---[ end trace f24cb0cb5729d2bb ]---
>
> And few more of these. Previous snapshot release
> 2.6.35-rc1-git5(6c5de280b6...) was good.
>
> Thanks
> -Sachin
>

--
Maciej Rutecki
http://www.maciek.unixy.pl
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo(a)vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
From: KAMEZAWA Hiroyuki on
On Thu, 10 Jun 2010 22:00:57 +0200
Maciej Rutecki <maciej.rutecki(a)gmail.com> wrote:

> I created a Bugzilla entry at
> https://bugzilla.kernel.org/show_bug.cgi?id=16178
> for your bug report, please add your address to the CC list in there, thanks!
>

Hmm... It seems a panic in SLUB or SLAB.
Is .config available ?

-Kame


> On niedziela, 6 czerwca 2010 o 17:06:54 Sachin Sant wrote:
> > While executing LTP Controller tests(memcg regression) on
> > a POWER6 box came across this following OOPS.
> >
> > Memory cgroup out of memory: kill process 9139 (memcg_test_1) score 3 or a
> > child Killed process 9139 (memcg_test_1) vsz:3456kB, anon-rss:448kB,
> > file-rss:1088kB Memory cgroup out of memory: kill process 9140
> > (memcg_test_1) score 3 or a child Killed process 9140 (memcg_test_1)
> > vsz:3456kB, anon-rss:448kB, file-rss:1088kB Unable to handle kernel paging
> > request for data at address 0x720072007200720 Faulting instruction
> > address: 0xc00000000015b778
> > Oops: Kernel access of bad area, sig: 11 [#2]
> > SMP NR_CPUS=1024 NUMA pSeries
> > last sysfs file: /sys/devices/system/cpu/cpu1/cache/index1/shared_cpu_map
> > Modules linked in: quota_v2 quota_tree ipv6 fuse loop dm_mod sr_mod cdrom
> > sg sd_mod crc_t10dif ibmvscsic scsi_transport_srp scsi_tgt scsi_mod NIP:
> > c00000000015b778 LR: c00000000015b740 CTR: 0000000000000000
> > REGS: c000000009812ff0 TRAP: 0300 Tainted: G D
> > (2.6.35-rc2-autotest) MSR: 8000000000009032 <EE,ME,IR,DR> CR: 44004424
> > XER: 00000001
> > DAR: 0720072007200720, DSISR: 0000000040000000
> > TASK = c000000005fb1100[9155] 'umount' THREAD: c000000009810000 CPU: 0
> > GPR00: 0000000000000000 c000000009813270 c000000000d3d7a0 0000000000000000
> > GPR04: 0000000000008050 0000000000160000 0000000000000027 c00000000f2c6870
> > GPR08: 00000000000006a5 c000000000b16870 c000000000cf0140 000000000e7b0000
> > GPR12: 0000000024004428 c000000007440000 0000000000008000 fffffffffffff000
> > GPR16: 0000000000000000 c0000000098138f0 000000000000002d 0000000000000027
> > GPR20: 0000000000000000 0000000000000027 0000000000000000 c000000007063138
> > GPR24: ffffffffffffffff 0000000000000000 c00000000019bafc c00000000e02e000
> > GPR28: 0000000000000001 0000000000008050 c000000000ca6b00 0720072007200720
> > NIP [c00000000015b778] .kmem_cache_alloc+0xb0/0x13c
> > LR [c00000000015b740] .kmem_cache_alloc+0x78/0x13c
> > Call Trace:
> > [c000000009813270] [c00000000015b740] .kmem_cache_alloc+0x78/0x13c
> > (unreliable) [c000000009813310] [c00000000019bafc]
> > .alloc_buffer_head+0x2c/0x78 [c000000009813390] [c00000000019c99c]
> > .alloc_page_buffers+0x60/0x114 [c000000009813450] [c00000000019ca78]
> > .create_empty_buffers+0x28/0x140 [c0000000098134e0] [c00000000019f2ec]
> > .__block_prepare_write+0xe4/0x4f0 [c000000009813610] [c00000000019f94c]
> > .block_write_begin_newtrunc+0xa8/0x120 [c0000000098136d0]
> > [c00000000019fea0] .block_write_begin+0x34/0x8c [c000000009813770]
> > [c00000000022b458] .ext3_write_begin+0x13c/0x298 [c000000009813880]
> > [c000000000117500] .generic_file_buffered_write+0x13c/0x320
> > [c0000000098139b0] [c000000000119c80]
> > .__generic_file_aio_write+0x378/0x3dc [c000000009813ab0]
> > [c000000000119d68] .generic_file_aio_write+0x84/0xfc [c000000009813b60]
> > [c00000000016e460] .do_sync_write+0xac/0x10c
> > [c000000009813ce0] [c00000000016f204] .vfs_write+0xd0/0x1dc
> > [c000000009813d80] [c00000000016f418] .SyS_write+0x58/0xa0
> > [c000000009813e30] [c0000000000085b4] syscall_exit+0x0/0x40
> > Instruction dump:
> > 38600000 409e0090 38000000 8b8d0212 980d0212 e96d0040 e93b0000 7ce95a14
> > 7fe9582a 2fbf0000 419e0014 e81b001a <7c1f002a> 7c09592a 4800001c 7f46d378
> > ---[ end trace f24cb0cb5729d2bb ]---
> >
> > And few more of these. Previous snapshot release
> > 2.6.35-rc1-git5(6c5de280b6...) was good.
> >
> > Thanks
> > -Sachin
> >
>
> --
> Maciej Rutecki
> http://www.maciek.unixy.pl
>
> --
> To unsubscribe, send a message with 'unsubscribe linux-mm' in
> the body to majordomo(a)kvack.org. For more info on Linux MM,
> see: http://www.linux-mm.org/ .
> Don't email: <a href=mailto:"dont(a)kvack.org"> email(a)kvack.org </a>
>

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo(a)vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/