[gpfsug-discuss] Hanging file-systems

Simon Thompson S.J.Thompson at bham.ac.uk
Tue Nov 27 20:02:14 GMT 2018


Yes, but we’d upgraded all out HPC client nodes to 5.0.2-1 last week as well when this first happened …

Unless it’s necessary to upgrade the NSD servers as well for this?

Simon

From: <gpfsug-discuss-bounces at spectrumscale.org> on behalf of "TOMP at il.ibm.com" <TOMP at il.ibm.com>
Reply-To: "gpfsug-discuss at spectrumscale.org" <gpfsug-discuss at spectrumscale.org>
Date: Tuesday, 27 November 2018 at 19:48
To: "gpfsug-discuss at spectrumscale.org" <gpfsug-discuss at spectrumscale.org>
Subject: Re: [gpfsug-discuss] Hanging file-systems

"paging to disk" sometimes means mmap as well - there were several issues around that recently as well.


Regards,

Tomer Perry
Scalable I/O Development (Spectrum Scale)
email: tomp at il.ibm.com
1 Azrieli Center, Tel Aviv 67021, Israel
Global Tel:    +1 720 3422758
Israel Tel:      +972 3 9188625
Mobile:         +972 52 2554625




From:        Skylar Thompson <skylar2 at uw.edu>
To:        gpfsug-discuss at spectrumscale.org
Date:        27/11/2018 20:28
Subject:        Re: [gpfsug-discuss] Hanging file-systems
Sent by:        gpfsug-discuss-bounces at spectrumscale.org
________________________________



Despite its name, kswapd isn't directly involved in paging to disk; it's
the kernel process that's involved in finding committed memory that can be
reclaimed for use (either immediately, or possibly by flushing dirty pages
to disk). If kswapd is using a lot of CPU, it's a sign that the kernel is
spending a lot of time to find free pages to allocate to processes.

On Tue, Nov 27, 2018 at 05:53:58PM +0000, Simon Thompson wrote:
> Thanks Sven ???
>
> We found a node with kswapd running 100% (and swap was off)???
>
> Killing that node made access to the FS spring into life.
>
> Simon
>
> From: <gpfsug-discuss-bounces at spectrumscale.org> on behalf of "oehmes at gmail.com" <oehmes at gmail.com>
> Reply-To: "gpfsug-discuss at spectrumscale.org" <gpfsug-discuss at spectrumscale.org>
> Date: Tuesday, 27 November 2018 at 16:14
> To: "gpfsug-discuss at spectrumscale.org" <gpfsug-discuss at spectrumscale.org>
> Subject: Re: [gpfsug-discuss] Hanging file-systems
>
> 1. are you under memory pressure or even worse started swapping .

> _______________________________________________
> gpfsug-discuss mailing list
> gpfsug-discuss at spectrumscale.org
> https://urldefense.proofpoint.com/v2/url?u=http-3A__gpfsug.org_mailman_listinfo_gpfsug-2Ddiscuss&d=DwICAg&c=jf_iaSHvJObTbx-siA1ZOg&r=mLPyKeOa1gNDrORvEXBgMw&m=sgaWNOJnHka2HBtMNNXBur4p2KbQ8q786tWza40tcLQ&s=CWkCUHu4-uwZQ6r1x_VFAGqQ5FFSBGXMSVa5t2pk424&e=


--
-- Skylar Thompson (skylar2 at u.washington.edu)
-- Genome Sciences Department, System Administrator
-- Foege Building S046, (206)-685-7354
-- University of Washington School of Medicine
_______________________________________________
gpfsug-discuss mailing list
gpfsug-discuss at spectrumscale.org
https://urldefense.proofpoint.com/v2/url?u=http-3A__gpfsug.org_mailman_listinfo_gpfsug-2Ddiscuss&d=DwICAg&c=jf_iaSHvJObTbx-siA1ZOg&r=mLPyKeOa1gNDrORvEXBgMw&m=sgaWNOJnHka2HBtMNNXBur4p2KbQ8q786tWza40tcLQ&s=CWkCUHu4-uwZQ6r1x_VFAGqQ5FFSBGXMSVa5t2pk424&e=




-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://gpfsug.org/pipermail/gpfsug-discuss_gpfsug.org/attachments/20181127/fdc9de07/attachment.htm>


More information about the gpfsug-discuss mailing list