Hi
We habe to DSS v6 Machines running with iSCSI Failover
Every few days we get some error messages:
Code:
2009/09/08 03:23:01 kernel:CPU 1
2009/09/08 03:23:01 kernel:Pid: 1175, comm: bonding Tainted: G D 2.6.27.10-oe64-00000-g9b2116f #12
2009/09/08 03:23:01 kernel:RIP: 0010:[<ffffffff802961b8>] [<ffffffff802961b8>] filp_close+0x18/0xa0
2009/09/08 03:23:01 kernel:RSP: 0000:ffff8800ad9d9f48 EFLAGS: 00010246
2009/09/08 03:23:01 kernel:RAX: fffffffffffffff7 RBX: 0008ab5abe171500 RCX: 0000000000000003
2009/09/08 03:23:01 kernel:RDX: 0000000000000000 RSI: ffff88013e73a9c0 RDI: 0008ab5abe171500
2009/09/08 03:23:01 kernel:RBP: 0000000000000003 R08: 0000000000000003 R09: 00000000ffdfead8
2009/09/08 03:23:01 kernel:R10: ffff8800ad9d8000 R11: 0000000000000000 R12: 0000000000000000
2009/09/08 03:23:01 kernel:R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
2009/09/08 03:23:01 kernel:FS: 0000000000000000(0000) GS:ffff88013f871dc0(0063) knlGS:00000000f7d666c0
2009/09/08 03:23:01 kernel:CS: 0010 DS: 002b ES: 002b CR0: 000000008005003b
2009/09/08 03:23:01 kernel:CR2: 0000000009f66268 CR3: 000000007fb73000 CR4: 00000000000006a0
2009/09/08 03:23:01 kernel:DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
2009/09/08 03:23:01 kernel:DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
2009/09/08 03:23:01 kernel:Process bonding (pid: 1175, threadinfo ffff8800ad9d8000, task ffff88013b798b50)
2009/09/08 03:23:01 kernel:Stack: ffff88013e73a9c0 0000000000000003 0008ab5abe171500 ffffffff802962cc
2009/09/08 03:23:01 kernel:0000000000000003 00000000ffdfead8 0000000000000000 ffffffff80228282
2009/09/08 03:23:01 kernel:0000000000000000 0000000000000000 0000000000000000 0000000000000000
2009/09/08 03:23:01 kernel:Call Trace:
2009/09/08 03:23:01 kernel:[<ffffffff802962cc>] ? sys_close+0x8c/0xf0
2009/09/08 03:23:01 kernel:[<ffffffff80228282>] ? ia32_sysret+0x0/0xa
2009/09/08 03:23:01 kernel:
2009/09/08 03:23:01 kernel:
2009/09/08 03:23:01 kernel:Code: ff 66 90 89 f2 be 41 02 00 00 e9 f4 fd ff ff 66 66 66 90 48 83 ec 18 48 89 1c 24 48 89 6c 24 08 48 89 fb 4c 89 64 24 10 45 31 e4 <48> 83 7f 28 00 48 89 f5 74 50 48 8b 47 20 48 85 c0 75 35 48 89
2009/09/08 03:23:01 kernel:RSP <ffff8800ad9d9f48>
2009/09/08 03:23:01 kernel:PGD 203067 PUD 204067 PMD 0
2009/09/08 03:23:01 kernel:CPU 1
2009/09/08 03:23:01 kernel:Pid: 1175, comm: bonding Tainted: G D 2.6.27.10-oe64-00000-g9b2116f #12
2009/09/08 03:23:01 kernel:RIP: 0010:[<ffffffff802961b8>] [<ffffffff802961b8>] filp_close+0x18/0xa0
2009/09/08 03:23:01 kernel:RSP: 0000:ffff8800ad9d9db8 EFLAGS: 00010246
2009/09/08 03:23:01 kernel:RAX: ffff8800bdbd2810 RBX: ffffffffffffad80 RCX: 0000000000000003
2009/09/08 03:23:01 kernel:RDX: ffff8800bdbd2800 RSI: ffff88013e73a9c0 RDI: ffffffffffffad80
2009/09/08 03:23:01 kernel:RBP: 0000000000000002 R08: ffff8800ad9d8000 R09: 0000000000000000
2009/09/08 03:23:01 kernel:R10: 0000000000000200 R11: 0000000022100000 R12: 0000000000000000
2009/09/08 03:23:01 kernel:R13: ffff88013e73a9c0 R14: 0000000000000001 R15: 0000000000000001
2009/09/08 03:23:01 kernel:FS: 0000000000000000(0000) GS:ffff88013f871dc0(0000) knlGS:0000000000000000
2009/09/08 03:23:01 kernel:CS: 0010 DS: 002b ES: 002b CR0: 000000008005003b
2009/09/08 03:23:01 kernel:CR2: ffffffffffffada8 CR3: 000000012a5e8000 CR4: 00000000000006a0
2009/09/08 03:23:01 kernel:DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
2009/09/08 03:23:01 kernel:DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
2009/09/08 03:23:01 kernel:Process bonding (pid: 1175, threadinfo ffff8800ad9d8000, task ffff88013b798b50)
2009/09/08 03:23:01 kernel:Stack: 0000000000000001 0000000000000002 ffff8801299ad200 ffffffff80237390
2009/09/08 03:23:01 kernel:000000000000000b ffff88013b798b50 0000000000000000 0000000000000000
2009/09/08 03:23:01 kernel:0000000000000000 ffffffff80237a9b ffff88013df8e000 ffff880000000000
2009/09/08 03:23:01 kernel:Call Trace:
2009/09/08 03:23:01 kernel:[<ffffffff80237390>] ? put_files_struct+0x70/0xc0
2009/09/08 03:23:01 kernel:[<ffffffff80237a9b>] ? do_exit+0x17b/0x8b0
2009/09/08 03:23:01 kernel:[<ffffffff8069cf27>] ? oops_end+0x87/0x90
2009/09/08 03:23:01 kernel:[<ffffffff8069cab9>] ? error_exit+0x0/0x51
2009/09/08 03:23:01 kernel:[<ffffffff802961b8>] ? filp_close+0x18/0xa0
2009/09/08 03:23:01 kernel:[<ffffffff802962cc>] ? sys_close+0x8c/0xf0
2009/09/08 03:23:01 kernel:[<ffffffff80228282>] ? ia32_sysret+0x0/0xa
2009/09/08 03:23:01 kernel:
2009/09/08 03:23:01 kernel:
2009/09/08 03:23:01 kernel:Code: ff 66 90 89 f2 be 41 02 00 00 e9 f4 fd ff ff 66 66 66 90 48 83 ec 18 48 89 1c 24 48 89 6c 24 08 48 89 fb 4c 89 64 24 10 45 31 e4 <48> 83 7f 28 00 48 89 f5 74 50 48 8b 47 20 48 85 c0 75 35 48 89
2009/09/08 03:23:01 kernel:RSP <ffff8800ad9d9db8>
Code:
2009/09/08 04:54:02 kernel:CPU 4
2009/09/08 04:54:02 kernel:Pid: 14719, comm: check_cs Tainted: G D 2.6.27.10-oe64-00000-g9b2116f #12
2009/09/08 04:54:02 kernel:RIP: 0010:[<ffffffff802961b8>] [<ffffffff802961b8>] filp_close+0x18/0xa0
2009/09/08 04:54:02 kernel:RSP: 0000:ffff8800bdae7f48 EFLAGS: 00010246
2009/09/08 04:54:02 kernel:RAX: ffffffffffffffef RBX: 004000001a010045 RCX: 0000000000000004
2009/09/08 04:54:02 kernel:RDX: 0000000000000000 RSI: ffff88013d1243c0 RDI: 004000001a010045
2009/09/08 04:54:02 kernel:RBP: 0000000000000004 R08: 0000000000000004 R09: 00000000fffc9488
2009/09/08 04:54:02 kernel:R10: ffff8800bdae6000 R11: 0000000000000000 R12: 0000000000000000
2009/09/08 04:54:02 kernel:R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
2009/09/08 04:54:02 kernel:FS: 0000000000000000(0000) GS:ffff88013f8d49c0(0063) knlGS:00000000f7e306c0
2009/09/08 04:54:02 kernel:CS: 0010 DS: 002b ES: 002b CR0: 000000008005003b
2009/09/08 04:54:02 kernel:CR2: 00000000080905e0 CR3: 00000000bdba3000 CR4: 00000000000006a0
2009/09/08 04:54:02 kernel:DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
2009/09/08 04:54:02 kernel:DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
2009/09/08 04:54:02 kernel:Process check_cs (pid: 14719, threadinfo ffff8800bdae6000, task ffff88013d7b3390)
2009/09/08 04:54:02 kernel:Stack: ffff88013d1243c0 0000000000000004 004000001a010045 ffffffff802962cc
2009/09/08 04:54:02 kernel:0000000000000004 00000000fffc9488 0000000000000000 ffffffff80228282
2009/09/08 04:54:02 kernel:0000000000000000 0000000000000000 0000000000000000 0000000000000000
2009/09/08 04:54:02 kernel:Call Trace:
2009/09/08 04:54:02 kernel:[<ffffffff802962cc>] ? sys_close+0x8c/0xf0
2009/09/08 04:54:02 kernel:[<ffffffff80228282>] ? ia32_sysret+0x0/0xa
2009/09/08 04:54:02 kernel:
2009/09/08 04:54:02 kernel:
2009/09/08 04:54:02 kernel:Code: ff 66 90 89 f2 be 41 02 00 00 e9 f4 fd ff ff 66 66 66 90 48 83 ec 18 48 89 1c 24 48 89 6c 24 08 48 89 fb 4c 89 64 24 10 45 31 e4 <48> 83 7f 28 00 48 89 f5 74 50 48 8b 47 20 48 85 c0 75 35 48 89
2009/09/08 04:54:02 kernel:RSP <ffff8800bdae7f48>
2009/09/08 04:54:02 kernel:CPU 4
2009/09/08 04:54:02 kernel:Pid: 14719, comm: check_cs Tainted: G D 2.6.27.10-oe64-00000-g9b2116f #12
2009/09/08 04:54:02 kernel:RIP: 0010:[<ffffffff802961b8>] [<ffffffff802961b8>] filp_close+0x18/0xa0
2009/09/08 04:54:02 kernel:RSP: 0000:ffff8800bdae7db8 EFLAGS: 00010246
2009/09/08 04:54:02 kernel:RAX: ffff88013b352010 RBX: db5abe1715001600 RCX: 0000000000000032
2009/09/08 04:54:02 kernel:RDX: ffff88013b352000 RSI: ffff88013d1243c0 RDI: db5abe1715001600
2009/09/08 04:54:02 kernel:RBP: 0000000000000002 R08: ffff88013d8e8ff0 R09: 000013519eba9e10
2009/09/08 04:54:02 kernel:R10: ffff88002802baf0 R11: ffffffff803fde80 R12: 0000000000000000
2009/09/08 04:54:02 kernel:R13: ffff88013d1243c0 R14: 0000000000000001 R15: 0000000000000001
2009/09/08 04:54:02 kernel:FS: 0000000000000000(0000) GS:ffff88013f8d49c0(0000) knlGS:0000000000000000
2009/09/08 04:54:02 kernel:CS: 0010 DS: 002b ES: 002b CR0: 000000008005003b
2009/09/08 04:54:02 kernel:CR2: 00000000080905e0 CR3: 000000012ae23000 CR4: 00000000000006a0
2009/09/08 04:54:02 kernel:DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
2009/09/08 04:54:02 kernel:DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
2009/09/08 04:54:02 kernel:Process check_cs (pid: 14719, threadinfo ffff8800bdae6000, task ffff88013d7b3390)
2009/09/08 04:54:02 kernel:Stack: 0000000000000001 0000000000000002 ffff88013e0f4740 ffffffff80237390
2009/09/08 04:54:02 kernel:000000000000000b ffff88013d7b3390 0000000000000000 0000000000000000
2009/09/08 04:54:02 kernel:0000000000000000 ffffffff80237a9b ffff88013df8e000 ffff880000000000
2009/09/08 04:54:02 kernel:Call Trace:
2009/09/08 04:54:02 kernel:[<ffffffff80237390>] ? put_files_struct+0x70/0xc0
2009/09/08 04:54:02 kernel:[<ffffffff80237a9b>] ? do_exit+0x17b/0x8b0
2009/09/08 04:54:02 kernel:[<ffffffff8069cf27>] ? oops_end+0x87/0x90
2009/09/08 04:54:02 kernel:[<ffffffff8069cab9>] ? error_exit+0x0/0x51
2009/09/08 04:54:02 kernel:[<ffffffff802961b8>] ? filp_close+0x18/0xa0
2009/09/08 04:54:02 kernel:[<ffffffff802962cc>] ? sys_close+0x8c/0xf0
2009/09/08 04:54:02 kernel:[<ffffffff80228282>] ? ia32_sysret+0x0/0xa
2009/09/08 04:54:02 kernel:
2009/09/08 04:54:02 kernel:
2009/09/08 04:54:02 kernel:Code: ff 66 90 89 f2 be 41 02 00 00 e9 f4 fd ff ff 66 66 66 90 48 83 ec 18 48 89 1c 24 48 89 6c 24 08 48 89 fb 4c 89 64 24 10 45 31 e4 <48> 83 7f 28 00 48 89 f5 74 50 48 8b 47 20 48 85 c0 75 35 48 89
This is the primary machine, the second takes over the shares and the primary system hangs. We have to do a reset to get this machine online again.
Whereīs the problem?
Thanks