Aneurin Price
2014-03-14 13:43:38 UTC
Hello everyone,
I have an OI system which has today had four^W five panics in a row,
and I'm hoping somebody might be able to help me investigate what's
going on. The information I have so far is pasted at the end of this
mail. This machine was upgraded from OIa8 to a9 on Wednesday, and at
the same time, I installed an l2arc. Since copying this information I
removed the cache device though, and it's just happened again, so I
*guess* it's probably not relevant. Next I'll try telling it to boot
into the old BE, just in case.
Can anyone give me any pointers on how to debug this further, or what
information might be useful, bearing in mind that this system now only
runs for a few minutes at a time?
Thanks for your time,
Nye
--
[13:25:47]***@openindiana:~$ grep -i panic /var/adm/messages
Mar 14 12:24:32 openindiana ^Mpanic[cpu1]/thread=ffffff002e26bc40:
Mar 14 12:31:26 openindiana savecore: [ID 570001 auth.error] reboot
after panic: BAD TRAP: type=e (#pf Page fault) rp=ffffff002e26b640
addr=408 occurred in module "zfs" due to a NULL pointer dereference
Mar 14 12:31:26 openindiana savecore: [ID 906182 auth.error] Panic
crashdump pending on dump device but dumpadm -n in effect; run
savecore(1M) manually to extract. Image UUID
6bc392ff-4b14-e595-d465-838f7e5ef5f3.
Mar 14 12:31:49 openindiana DESC: The system has rebooted after a
kernel panic. Refer to http://illumos.org/msg/SUNOS-8000-KL for more
information.
Mar 14 12:31:49 openindiana IMPACT: There may be some performance
impact while the panic is copied to the savecore directory. Disk
space usage by panics can be substantial.
Mar 14 12:31:49 openindiana Use 'fmdump -Vp -u
6bc392ff-4b14-e595-d465-838f7e5ef5f3' to view more panic detail.
Please refer to the knowledge article for additional information.
Mar 14 12:37:00 openindiana ^Mpanic[cpu6]/thread=ffffff002e131c40:
Mar 14 12:42:37 openindiana savecore: [ID 570001 auth.error] reboot
after panic: BAD TRAP: type=e (#pf Page fault) rp=ffffff002e130f00
addr=40 occurred in module "zfs" due to a NULL pointer dereference
Mar 14 12:42:38 openindiana savecore: [ID 786448 auth.error] Panic
crashdump pending on dump device but dumpadm -n in effect; run
savecore(1M) manually to extract. Image UUID
80249880-02ff-64c2-9632-99797216da0a.
Mar 14 12:42:50 openindiana DESC: The system has rebooted after a
kernel panic. Refer to http://illumos.org/msg/SUNOS-8000-KL for more
information.
Mar 14 12:42:50 openindiana IMPACT: There may be some performance
impact while the panic is copied to the savecore directory. Disk
space usage by panics can be substantial.
Mar 14 12:42:50 openindiana Use 'fmdump -Vp -u
80249880-02ff-64c2-9632-99797216da0a' to view more panic detail.
Please refer to the knowledge article for additional information.
Mar 14 13:07:29 openindiana ^Mpanic[cpu7]/thread=ffffff002f67ec40:
Mar 14 13:13:17 openindiana savecore: [ID 570001 auth.error] reboot
after panic: BAD TRAP: type=e (#pf Page fault) rp=ffffff002f67df00
addr=40 occurred in module "zfs" due to a NULL pointer dereference
Mar 14 13:13:18 openindiana savecore: [ID 738693 auth.error] Panic
crashdump pending on dump device but dumpadm -n in effect; run
savecore(1M) manually to extract. Image UUID
c9c783ba-268d-c503-b8e7-990cefd266ee.
Mar 14 13:13:31 openindiana DESC: The system has rebooted after a
kernel panic. Refer to http://illumos.org/msg/SUNOS-8000-KL for more
information.
Mar 14 13:13:31 openindiana IMPACT: There may be some performance
impact while the panic is copied to the savecore directory. Disk
space usage by panics can be substantial.
Mar 14 13:13:31 openindiana Use 'fmdump -Vp -u
c9c783ba-268d-c503-b8e7-990cefd266ee' to view more panic detail.
Please refer to the knowledge article for additional information.
Mar 14 13:18:15 openindiana ^Mpanic[cpu6]/thread=ffffff002e26bc40:
Mar 14 13:23:51 openindiana savecore: [ID 570001 auth.error] reboot
after panic: BAD TRAP: type=e (#pf Page fault) rp=ffffff002e26b640
addr=408 occurred in module "zfs" due to a NULL pointer dereference
Mar 14 13:23:52 openindiana savecore: [ID 338254 auth.error] Panic
crashdump pending on dump device but dumpadm -n in effect; run
savecore(1M) manually to extract. Image UUID
c8a94bad-1e9a-4b36-ab47-fc5ac363ada8.
Mar 14 13:24:04 openindiana DESC: The system has rebooted after a
kernel panic. Refer to http://illumos.org/msg/SUNOS-8000-KL for more
information.
Mar 14 13:24:04 openindiana IMPACT: There may be some performance
impact while the panic is copied to the savecore directory. Disk
space usage by panics can be substantial.
Mar 14 13:24:04 openindiana Use 'fmdump -Vp -u
c8a94bad-1e9a-4b36-ab47-fc5ac363ada8' to view more panic detail.
Please refer to the knowledge article for additional information.
[13:25:59]***@openindiana:~$ fmdump -Vp -u
c8a94bad-1e9a-4b36-ab47-fc5ac363ada8
TIME UUID SUNW-MSG-ID
Mar 14 2014 13:24:04.456339000 c8a94bad-1e9a-4b36-ab47-fc5ac363ada8
SUNOS-8000-KL
TIME CLASS ENA
Mar 14 13:23:52.9293 ireport.os.sunos.panic.dump_pending_on_device
0x0000000000000000
nvlist version: 0
version = 0x0
class = list.suspect
uuid = c8a94bad-1e9a-4b36-ab47-fc5ac363ada8
code = SUNOS-8000-KL
diag-time = 1394803444 221917
de = fmd:///module/software-diagnosis
fault-list-sz = 0x1
fault-list = (array of embedded nvlists)
(start fault-list[0])
nvlist version: 0
version = 0x0
class = defect.sunos.kernel.panic
certainty = 0x64
asru =
sw:///:path=/var/crash/openindiana/.c8a94bad-1e9a-4b36-ab47-fc5ac363ada8
resource =
sw:///:path=/var/crash/openindiana/.c8a94bad-1e9a-4b36-ab47-fc5ac363ada8
savecore-succcess = 0
os-instance-uuid = c8a94bad-1e9a-4b36-ab47-fc5ac363ada8
panicstr = BAD TRAP: type=e (#pf Page fault)
rp=ffffff002e26b640 addr=408 occurred in module "zfs" due to a NULL
pointer dereference
panicstack = unix:die+dd () | unix:trap+17db () |
unix:cmntrap+e6 () | zfs:dva_get_dsize_sync+5c () |
zfs:bp_get_dsize_sync+28 () | zfs:dsl_dataset_block_kill+4e () |
zfs:free_blocks+10d () | zfs:dnode_sync+21e () |
zfs:dmu_objset_sync_dnodes+80 () | zfs:dmu_objset_sync+1bd () |
zfs:dsl_dataset_sync+51 () | zfs:dsl_pool_sync+99 () |
zfs:spa_sync+373 () | zfs:txg_sync_thread+27b () | unix:thread_start+8
() |
crashtime = 1394803096
panic-time = 14 March 2014 13:18:16 GMT GMT
(end fault-list[0])
fault-status = 0x1
severity = Major
__ttl = 0x1
__tod = 0x532302f4 0x1b332e38
I have an OI system which has today had four^W five panics in a row,
and I'm hoping somebody might be able to help me investigate what's
going on. The information I have so far is pasted at the end of this
mail. This machine was upgraded from OIa8 to a9 on Wednesday, and at
the same time, I installed an l2arc. Since copying this information I
removed the cache device though, and it's just happened again, so I
*guess* it's probably not relevant. Next I'll try telling it to boot
into the old BE, just in case.
Can anyone give me any pointers on how to debug this further, or what
information might be useful, bearing in mind that this system now only
runs for a few minutes at a time?
Thanks for your time,
Nye
--
[13:25:47]***@openindiana:~$ grep -i panic /var/adm/messages
Mar 14 12:24:32 openindiana ^Mpanic[cpu1]/thread=ffffff002e26bc40:
Mar 14 12:31:26 openindiana savecore: [ID 570001 auth.error] reboot
after panic: BAD TRAP: type=e (#pf Page fault) rp=ffffff002e26b640
addr=408 occurred in module "zfs" due to a NULL pointer dereference
Mar 14 12:31:26 openindiana savecore: [ID 906182 auth.error] Panic
crashdump pending on dump device but dumpadm -n in effect; run
savecore(1M) manually to extract. Image UUID
6bc392ff-4b14-e595-d465-838f7e5ef5f3.
Mar 14 12:31:49 openindiana DESC: The system has rebooted after a
kernel panic. Refer to http://illumos.org/msg/SUNOS-8000-KL for more
information.
Mar 14 12:31:49 openindiana IMPACT: There may be some performance
impact while the panic is copied to the savecore directory. Disk
space usage by panics can be substantial.
Mar 14 12:31:49 openindiana Use 'fmdump -Vp -u
6bc392ff-4b14-e595-d465-838f7e5ef5f3' to view more panic detail.
Please refer to the knowledge article for additional information.
Mar 14 12:37:00 openindiana ^Mpanic[cpu6]/thread=ffffff002e131c40:
Mar 14 12:42:37 openindiana savecore: [ID 570001 auth.error] reboot
after panic: BAD TRAP: type=e (#pf Page fault) rp=ffffff002e130f00
addr=40 occurred in module "zfs" due to a NULL pointer dereference
Mar 14 12:42:38 openindiana savecore: [ID 786448 auth.error] Panic
crashdump pending on dump device but dumpadm -n in effect; run
savecore(1M) manually to extract. Image UUID
80249880-02ff-64c2-9632-99797216da0a.
Mar 14 12:42:50 openindiana DESC: The system has rebooted after a
kernel panic. Refer to http://illumos.org/msg/SUNOS-8000-KL for more
information.
Mar 14 12:42:50 openindiana IMPACT: There may be some performance
impact while the panic is copied to the savecore directory. Disk
space usage by panics can be substantial.
Mar 14 12:42:50 openindiana Use 'fmdump -Vp -u
80249880-02ff-64c2-9632-99797216da0a' to view more panic detail.
Please refer to the knowledge article for additional information.
Mar 14 13:07:29 openindiana ^Mpanic[cpu7]/thread=ffffff002f67ec40:
Mar 14 13:13:17 openindiana savecore: [ID 570001 auth.error] reboot
after panic: BAD TRAP: type=e (#pf Page fault) rp=ffffff002f67df00
addr=40 occurred in module "zfs" due to a NULL pointer dereference
Mar 14 13:13:18 openindiana savecore: [ID 738693 auth.error] Panic
crashdump pending on dump device but dumpadm -n in effect; run
savecore(1M) manually to extract. Image UUID
c9c783ba-268d-c503-b8e7-990cefd266ee.
Mar 14 13:13:31 openindiana DESC: The system has rebooted after a
kernel panic. Refer to http://illumos.org/msg/SUNOS-8000-KL for more
information.
Mar 14 13:13:31 openindiana IMPACT: There may be some performance
impact while the panic is copied to the savecore directory. Disk
space usage by panics can be substantial.
Mar 14 13:13:31 openindiana Use 'fmdump -Vp -u
c9c783ba-268d-c503-b8e7-990cefd266ee' to view more panic detail.
Please refer to the knowledge article for additional information.
Mar 14 13:18:15 openindiana ^Mpanic[cpu6]/thread=ffffff002e26bc40:
Mar 14 13:23:51 openindiana savecore: [ID 570001 auth.error] reboot
after panic: BAD TRAP: type=e (#pf Page fault) rp=ffffff002e26b640
addr=408 occurred in module "zfs" due to a NULL pointer dereference
Mar 14 13:23:52 openindiana savecore: [ID 338254 auth.error] Panic
crashdump pending on dump device but dumpadm -n in effect; run
savecore(1M) manually to extract. Image UUID
c8a94bad-1e9a-4b36-ab47-fc5ac363ada8.
Mar 14 13:24:04 openindiana DESC: The system has rebooted after a
kernel panic. Refer to http://illumos.org/msg/SUNOS-8000-KL for more
information.
Mar 14 13:24:04 openindiana IMPACT: There may be some performance
impact while the panic is copied to the savecore directory. Disk
space usage by panics can be substantial.
Mar 14 13:24:04 openindiana Use 'fmdump -Vp -u
c8a94bad-1e9a-4b36-ab47-fc5ac363ada8' to view more panic detail.
Please refer to the knowledge article for additional information.
[13:25:59]***@openindiana:~$ fmdump -Vp -u
c8a94bad-1e9a-4b36-ab47-fc5ac363ada8
TIME UUID SUNW-MSG-ID
Mar 14 2014 13:24:04.456339000 c8a94bad-1e9a-4b36-ab47-fc5ac363ada8
SUNOS-8000-KL
TIME CLASS ENA
Mar 14 13:23:52.9293 ireport.os.sunos.panic.dump_pending_on_device
0x0000000000000000
nvlist version: 0
version = 0x0
class = list.suspect
uuid = c8a94bad-1e9a-4b36-ab47-fc5ac363ada8
code = SUNOS-8000-KL
diag-time = 1394803444 221917
de = fmd:///module/software-diagnosis
fault-list-sz = 0x1
fault-list = (array of embedded nvlists)
(start fault-list[0])
nvlist version: 0
version = 0x0
class = defect.sunos.kernel.panic
certainty = 0x64
asru =
sw:///:path=/var/crash/openindiana/.c8a94bad-1e9a-4b36-ab47-fc5ac363ada8
resource =
sw:///:path=/var/crash/openindiana/.c8a94bad-1e9a-4b36-ab47-fc5ac363ada8
savecore-succcess = 0
os-instance-uuid = c8a94bad-1e9a-4b36-ab47-fc5ac363ada8
panicstr = BAD TRAP: type=e (#pf Page fault)
rp=ffffff002e26b640 addr=408 occurred in module "zfs" due to a NULL
pointer dereference
panicstack = unix:die+dd () | unix:trap+17db () |
unix:cmntrap+e6 () | zfs:dva_get_dsize_sync+5c () |
zfs:bp_get_dsize_sync+28 () | zfs:dsl_dataset_block_kill+4e () |
zfs:free_blocks+10d () | zfs:dnode_sync+21e () |
zfs:dmu_objset_sync_dnodes+80 () | zfs:dmu_objset_sync+1bd () |
zfs:dsl_dataset_sync+51 () | zfs:dsl_pool_sync+99 () |
zfs:spa_sync+373 () | zfs:txg_sync_thread+27b () | unix:thread_start+8
() |
crashtime = 1394803096
panic-time = 14 March 2014 13:18:16 GMT GMT
(end fault-list[0])
fault-status = 0x1
severity = Major
__ttl = 0x1
__tod = 0x532302f4 0x1b332e38