Server stürzt regelmäßig ab...

Welches Modul/Treiber für welche Hardware, Kernel compilieren...
Antworten
patrickb
Beiträge: 9
Registriert: 25.03.2009 12:47:28

Server stürzt regelmäßig ab...

Beitrag von patrickb » 05.06.2009 10:47:03

wenn es das falsche Unterforum ist, bitte dementsprechend verschieben.

Habe einen Sarge hier, der mir regelmäßig nachts abschmiert. Es ließ sich schon auf einen nächtlichen Cronjob einschränken, den ich dann deaktiviert habe, jetzt tritt das Problem unregelmäßig wieder auf. Der Crash endet in einem Dumpscreen (durchlaufende Fehlermeldungen)

/var/log/messages sagt:

Code: Alles auswählen

Jun  2 00:14:48 vader kernel: printk: 109438 messages suppressed.
Jun  2 00:14:48 vader kernel: oom-killer: gfp_mask=0x2d0, order=0
Jun  2 00:14:48 vader kernel:  <b013fdb1> out_of_memory+0x28/0x93  <b0140ce9> __alloc_pages+0x1e5/0x26a
Jun  2 00:14:48 vader kernel:  <b015268e> kmem_getpages+0x2f/0x7f  <b0153157> cache_grow+0xa7/0x137
Jun  2 00:14:48 vader kernel:  <b0153327> cache_alloc_refill+0x140/0x183  <b01534f4> kmem_cache_alloc+0x31/0x3a
Jun  2 00:14:48 vader kernel:  <f94aedaf> kmem_zone_alloc+0x46/0x8a [xfs]  <f94aedfd> kmem_zone_zalloc+0xa/0x39 [xfs]
Jun  2 00:14:48 vader kernel:  <f9490c3f> xfs_iread+0x33/0x1c5 [xfs]  <f947ca48> xfs_da_brelse+0x6f/0x91 [xfs]
Jun  2 00:14:48 vader kernel:  <f948eead> xfs_iget_core+0x1fa/0x53a [xfs]  <f94aedaf> kmem_zone_alloc+0x46/0x8a [xfs]
Jun  2 00:14:48 vader kernel:  <f948f295> xfs_iget+0xa8/0x130 [xfs]  <f94a6381> xfs_dir_lookup_int+0x53/0xaa [xfs]
Jun  2 00:14:48 vader kernel:  <f94aad42> xfs_lookup+0x48/0x71 [xfs]  <f94b49b7> xfs_vn_lookup+0x2b/0x60 [xfs]
Jun  2 00:14:48 vader kernel:  <b0162273> __lookup_hash+0x70/0x89  <b016354a> do_unlinkat+0x58/0xed
Jun  2 00:14:50 vader kernel:  <b0102b4b> syscall_call+0x7/0xb
Jun  2 00:14:50 vader kernel: oom-killer: gfp_mask=0xd0, order=0
Jun  2 00:14:50 vader kernel:  <b013fdb1> out_of_memory+0x28/0x93  <b0140ce9> __alloc_pages+0x1e5/0x26a
Jun  2 00:14:50 vader kernel:  <b015268e> kmem_getpages+0x2f/0x7f  <b0153157> cache_grow+0xa7/0x137
Jun  2 00:14:50 vader kernel:  <b0153327> cache_alloc_refill+0x140/0x183  <b01534f4> kmem_cache_alloc+0x31/0x3a
Jun  2 00:14:50 vader kernel:  <b015676e> get_empty_filp+0x42/0x108  <b0162137> __path_lookup_intent_open+0x13/0x6f
Jun  2 00:14:50 vader kernel:  <b01621a2> path_lookup_open+0xf/0x13  <b016285a> open_namei+0x86/0x58f
Jun  2 00:14:50 vader kernel:  <f94b6e62> xfs_fs_clear_inode+0x8a/0x9c [xfs]  <b01550d4> do_filp_open+0x1c/0x31
Jun  2 00:14:50 vader kernel:  <b0155214> get_unused_fd+0x53/0xa2  <b015531a> do_sys_open+0x3c/0xb1
Jun  2 00:14:50 vader kernel:  <b01553a5> sys_open+0x16/0x18  <b0102b4b> syscall_call+0x7/0xb
Jun  2 00:14:50 vader kernel: Mem-info:
Jun  2 00:14:50 vader kernel: DMA per-cpu:
Jun  2 00:14:50 vader kernel: cpu 0 hot: high 0, batch 1 used:0
Jun  2 00:14:50 vader kernel: cpu 0 cold: high 0, batch 1 used:0
Jun  2 00:14:50 vader kernel: cpu 1 hot: high 0, batch 1 used:0
Jun  2 00:14:50 vader kernel: cpu 1 cold: high 0, batch 1 used:0
Jun  2 00:14:50 vader kernel: cpu 2 hot: high 0, batch 1 used:0
Jun  2 00:14:50 vader kernel: cpu 2 cold: high 0, batch 1 used:0
Jun  2 00:14:50 vader kernel: cpu 3 hot: high 0, batch 1 used:0
Jun  2 00:14:50 vader kernel: cpu 3 cold: high 0, batch 1 used:0
Jun  2 00:14:50 vader kernel: DMA32 per-cpu: empty
Jun  2 00:14:50 vader kernel: Normal per-cpu:
Jun  2 00:14:50 vader kernel: cpu 0 hot: high 186, batch 31 used:43
Jun  2 00:14:50 vader kernel: cpu 0 cold: high 62, batch 15 used:48
Jun  2 00:14:50 vader kernel: cpu 1 hot: high 186, batch 31 used:18
Jun  2 00:14:50 vader kernel: cpu 1 cold: high 62, batch 15 used:56
Jun  2 00:14:50 vader kernel: cpu 2 hot: high 186, batch 31 used:166
Jun  2 00:14:50 vader kernel: cpu 2 cold: high 62, batch 15 used:57
Jun  2 00:14:50 vader kernel: cpu 3 hot: high 186, batch 31 used:22
Jun  2 00:14:50 vader kernel: cpu 3 cold: high 62, batch 15 used:11
Jun  2 00:14:50 vader kernel: HighMem per-cpu:
Jun  2 00:14:50 vader kernel: cpu 0 hot: high 186, batch 31 used:123
Jun  2 00:14:50 vader kernel: cpu 0 cold: high 62, batch 15 used:3
Jun  2 00:14:50 vader kernel: cpu 1 hot: high 186, batch 31 used:30
Jun  2 00:14:50 vader kernel: cpu 1 cold: high 62, batch 15 used:2
Jun  2 00:14:50 vader kernel: cpu 2 hot: high 186, batch 31 used:20
Jun  2 00:14:50 vader kernel: cpu 2 cold: high 62, batch 15 used:14
Jun  2 00:14:50 vader kernel: cpu 3 hot: high 186, batch 31 used:22
Jun  2 00:14:50 vader kernel: cpu 3 cold: high 62, batch 15 used:7
Jun  2 00:14:50 vader kernel: Free pages:     2013756kB (2004764kB HighMem)
Jun  2 00:14:50 vader kernel: Active:48752 inactive:5186 dirty:0 writeback:4 unstable:0 free:503439 slab:279122 mapped:46006 pagetables:302
Jun  2 00:14:50 vader kernel: DMA free:4604kB min:60kB low:72kB high:88kB active:8kB inactive:4kB present:16384kB pages_scanned:2 all_unreclaimable? yes
Jun  2 00:14:50 vader kernel: lowmem_reserve[]: 0 0 1136 4080
Jun  2 00:14:50 vader kernel: DMA32 free:0kB min:0kB low:0kB high:0kB active:0kB inactive:0kB present:0kB pages_scanned:0 all_unreclaimable? no
Jun  2 00:14:50 vader kernel: lowmem_reserve[]: 0 0 1136 4080
Jun  2 00:14:50 vader kernel: Normal free:4388kB min:4280kB low:5348kB high:6420kB active:264kB inactive:132kB present:1163264kB pages_scanned:703 all_unrecl
Jun  2 00:14:50 vader kernel: lowmem_reserve[]: 0 0 0 23552
Jun  2 00:14:50 vader kernel: HighMem free:2004764kB min:512kB low:3284kB high:6060kB active:194736kB inactive:20608kB present:3014656kB pages_scanned:0 all_
Jun  2 00:14:50 vader kernel: lowmem_reserve[]: 0 0 0 0
Jun  2 00:14:50 vader kernel: DMA: 55*4kB 174*8kB 35*16kB 2*32kB 1*64kB 0*128kB 1*256kB 0*512kB 0*1024kB 1*2048kB 0*4096kB = 4604kB
Jun  2 00:14:50 vader kernel: DMA32: empty
Jun  2 00:14:50 vader kernel: Normal: 19*4kB 193*8kB 9*16kB 0*32kB 1*64kB 0*128kB 0*256kB 1*512kB 0*1024kB 1*2048kB 0*4096kB = 4388kB
Jun  2 00:14:50 vader kernel: HighMem: 3261*4kB 2725*8kB 7068*16kB 7792*32kB 6177*64kB 3812*128kB 1869*256kB 434*512kB 23*1024kB 0*2048kB 0*4096kB = 2004764k
Jun  2 00:14:50 vader kernel: Swap cache: add 1477, delete 1389, find 15/32, race 0+0
Jun  2 00:14:50 vader kernel: Free swap  = 1046804kB
Jun  2 00:14:50 vader kernel: Total swap = 1052248kB
Jun  2 00:14:50 vader kernel: Free swap:       1046804kB
Jun  2 00:14:50 vader kernel: 1048576 pages of RAM
Jun  2 00:14:50 vader kernel: 753664 pages of HIGHMEM
Jun  2 00:14:50 vader kernel: 206130 reserved pages
Jun  2 00:14:50 vader kernel: 30459 pages shared
Jun  2 00:14:50 vader kernel: 88 pages swap cached
Jun  2 00:14:50 vader kernel: 0 pages dirty
Jun  2 00:14:50 vader kernel: 4 pages writeback
Jun  2 00:14:50 vader kernel: 46006 pages mapped
Jun  2 00:14:50 vader kernel: 279122 pages slab
Jun  2 00:14:50 vader kernel: 302 pages pagetables
Jun  2 00:14:50 vader kernel: Mem-info:
Jun  2 00:14:50 vader kernel: DMA per-cpu:
Jun  2 00:14:50 vader kernel: cpu 0 hot: high 0, batch 1 used:0
Jun  2 00:14:50 vader kernel: cpu 0 cold: high 0, batch 1 used:0
Jun  2 00:14:50 vader kernel: cpu 1 hot: high 0, batch 1 used:0
Jun  2 00:14:50 vader kernel: cpu 1 cold: high 0, batch 1 used:0
Jun  2 00:14:50 vader kernel: cpu 2 hot: high 0, batch 1 used:0
Jun  2 00:14:50 vader kernel: cpu 2 cold: high 0, batch 1 used:0
Jun  2 00:14:50 vader kernel: cpu 3 hot: high 0, batch 1 used:0
Jun  2 00:14:50 vader kernel: cpu 3 cold: high 0, batch 1 used:0
Jun  2 00:14:50 vader kernel: DMA32 per-cpu: empty
Jun  2 00:14:50 vader kernel: Normal per-cpu:
Jun  2 00:14:50 vader kernel: cpu 0 hot: high 186, batch 31 used:43
Jun  2 00:14:50 vader kernel: cpu 0 cold: high 62, batch 15 used:48
Jun  2 00:14:50 vader kernel: cpu 1 hot: high 186, batch 31 used:27
Jun  2 00:14:50 vader kernel: cpu 1 cold: high 62, batch 15 used:56
Jun  2 00:14:50 vader kernel: cpu 2 hot: high 186, batch 31 used:166
Jun  2 00:14:50 vader kernel: cpu 2 cold: high 62, batch 15 used:57
Jun  2 00:14:50 vader kernel: cpu 3 hot: high 186, batch 31 used:22
Jun  2 00:14:50 vader kernel: cpu 3 cold: high 62, batch 15 used:11
Jun  2 00:14:50 vader kernel: HighMem per-cpu:
Jun  2 00:14:50 vader kernel: cpu 0 hot: high 186, batch 31 used:123
Jun  2 00:14:50 vader kernel: cpu 0 cold: high 62, batch 15 used:3
Jun  2 00:14:50 vader kernel: cpu 1 hot: high 186, batch 31 used:180
Jun  2 00:14:50 vader kernel: cpu 1 cold: high 62, batch 15 used:2
Jun  2 00:14:50 vader kernel: cpu 2 hot: high 186, batch 31 used:20
Jun  2 00:14:50 vader kernel: cpu 2 cold: high 62, batch 15 used:14
Jun  2 00:14:50 vader kernel: cpu 3 hot: high 186, batch 31 used:22
Jun  2 00:14:50 vader kernel: cpu 3 cold: high 62, batch 15 used:7
Jun  2 00:14:50 vader kernel: Free pages:     2048104kB (2039112kB HighMem)
Jun  2 00:14:50 vader kernel: Active:39992 inactive:5217 dirty:8 writeback:0 unstable:0 free:512026 slab:279120 mapped:37131 pagetables:282
Jun  2 00:14:51 vader kernel: DMA free:4612kB min:60kB low:72kB high:88kB active:0kB inactive:4kB present:16384kB pages_scanned:29 all_unreclaimable? yes
Jun  2 00:14:51 vader kernel: lowmem_reserve[]: 0 0 1136 4080
Jun  2 00:14:51 vader kernel: DMA32 free:0kB min:0kB low:0kB high:0kB active:0kB inactive:0kB present:0kB pages_scanned:0 all_unreclaimable? no
Jun  2 00:14:51 vader kernel: lowmem_reserve[]: 0 0 1136 4080
Jun  2 00:14:51 vader kernel: Normal free:4380kB min:4280kB low:5348kB high:6420kB active:180kB inactive:256kB present:1163264kB pages_scanned:767 all_unrecl
Jun  2 00:14:51 vader kernel: lowmem_reserve[]: 0 0 0 23552
Jun  2 00:14:51 vader kernel: HighMem free:2039112kB min:512kB low:3284kB high:6060kB active:159796kB inactive:20608kB present:3014656kB pages_scanned:0 all_
Jun  2 00:14:51 vader kernel: lowmem_reserve[]: 0 0 0 0
Jun  2 00:14:51 vader kernel: DMA: 57*4kB 174*8kB 35*16kB 2*32kB 1*64kB 0*128kB 1*256kB 0*512kB 0*1024kB 1*2048kB 0*4096kB = 4612kB
Jun  2 00:14:51 vader kernel: DMA32: empty
Jun  2 00:14:51 vader kernel: Normal: 19*4kB 192*8kB 9*16kB 0*32kB 1*64kB 0*128kB 0*256kB 1*512kB 0*1024kB 1*2048kB 0*4096kB = 4380kB
Jun  2 00:14:51 vader kernel: HighMem: 4172*4kB 2951*8kB 7150*16kB 7810*32kB 6269*64kB 3889*128kB 1891*256kB 439*512kB 26*1024kB 0*2048kB 0*4096kB = 2039112k
Jun  2 00:14:51 vader kernel: Swap cache: add 1477, delete 1461, find 15/32, race 0+0
Jun  2 00:14:51 vader kernel: Free swap  = 1047148kB
Jun  2 00:14:51 vader kernel: Total swap = 1052248kB
Jun  2 00:14:51 vader kernel: Free swap:       1047148kB
Jun  2 00:14:51 vader kernel: 1048576 pages of RAM
Jun  2 00:14:51 vader kernel: 753664 pages of HIGHMEM
Jun  2 00:14:51 vader kernel: 206130 reserved pages
Jun  2 00:14:51 vader kernel: 30043 pages shared
Jun  2 00:14:51 vader kernel: 16 pages swap cached
Jun  2 00:14:51 vader kernel: 10 pages dirty
Jun  2 00:14:51 vader kernel: 0 pages writeback
Jun  2 00:14:51 vader kernel: 37131 pages mapped
Jun  2 00:14:51 vader kernel: 279120 pages slab
Jun  2 00:14:51 vader kernel: 282 pages pagetables
Hat jemand eine Idee?

Danielx
Beiträge: 6419
Registriert: 14.08.2003 17:52:23

Re: Server stürzt regelmäßig ab...

Beitrag von Danielx » 05.06.2009 11:50:20

OOM = Out of Memory
Denn sonst läuft der OOM-Killer eigentlich nicht.
Könnte das bei dir der Fall sein?

Gruß,
Daniel

patrickb
Beiträge: 9
Registriert: 25.03.2009 12:47:28

Re: Server stürzt regelmäßig ab...

Beitrag von patrickb » 05.06.2009 12:15:13

Eben nicht, extra auf 4G aufgerüstet, ohne Erfolg.

Ich lasse jetzt mal jede nacht einen Job laufen

while true; do w|grep load|logger; free -l -t|logger; echo ""|logger; sleep 5; done

Mal sehen was der sagt. Der Nagios der im Hintergrund mitläuft protokolliert allerdings auch nix, nur etwa 40%, dann rumms

Antworten