[gpfsug-discuss] WG: Spectrum Scale with RHEL7.6 kernel 3.10.0-957.21.2

KG spectrumscale at kiranghag.com
Thu Jun 13 17:55:07 BST 2019


Hi

As per the flash -
https://www-01.ibm.com/support/docview.wss?uid=ibm10887213&myns=s033&mynp=OCSTXKQY&mync=E&cm_sp=s033-_-OCSTXKQY-_-E
this bug doesnt appear if SELinux is disabled.

If customer is willing to disable SELinux, will it be ok to upgrade (or
stay on upgraded level and avoid downgrade)?

On Tue, Jun 11, 2019 at 9:24 PM Felipe Knop <knop at us.ibm.com> wrote:

> Renar,
>
> With the change below, which is a retrofit of a change deployed in newer
> kernels, an inconsistency has taken place between the GPFS kernel
> portability layer and the kernel proper. A known result of that
> inconsistency is a kernel crash. One known sequence leading to the crash
> involves the mkdir() call.
>
> We are working on an official notification on the issue.
>
> Felipe
>
> ----
> Felipe Knop knop at us.ibm.com
> GPFS Development and Security
> IBM Systems
> IBM Building 008
> 2455 South Rd, Poughkeepsie, NY 12601
> (845) 433-9314 T/L 293-9314
>
>
>
> [image: Inactive hide details for "Grunenberg, Renar" ---06/11/2019
> 08:28:07 AM---Hallo Felipe, can you explain is this a generic Probl]"Grunenberg,
> Renar" ---06/11/2019 08:28:07 AM---Hallo Felipe, can you explain is this a
> generic Problem in rhel or only a scale related. Are there a
>
> From: "Grunenberg, Renar" <Renar.Grunenberg at huk-coburg.de>
> To: gpfsug main discussion list <gpfsug-discuss at spectrumscale.org>
> Date: 06/11/2019 08:28 AM
> Subject: [EXTERNAL] Re: [gpfsug-discuss] WG: Spectrum Scale with RHEL7.6
> kernel 3.10.0-957.21.2
> Sent by: gpfsug-discuss-bounces at spectrumscale.org
> ------------------------------
>
>
>
> Hallo Felipe,
> can you explain is this a generic Problem in rhel or only a scale related.
> Are there any cicumstance already available? We ask redhat but have no
> points that this are know to them?
>
> Regards Renar
>
> Renar Grunenberg
> Abteilung Informatik - Betrieb
>
> HUK-COBURG
> Bahnhofsplatz
> 96444 Coburg
> Telefon: 09561 96-44110
> Telefax: 09561 96-44104
> E-Mail: Renar.Grunenberg at huk-coburg.de
> Internet: www.huk.de
>
> ------------------------------
> HUK-COBURG Haftpflicht-Unterstützungs-Kasse kraftfahrender Beamter
> Deutschlands a. G. in Coburg
> Reg.-Gericht Coburg HRB 100; St.-Nr. 9212/101/00021
> Sitz der Gesellschaft: Bahnhofsplatz, 96444 Coburg
> Vorsitzender des Aufsichtsrats: Prof. Dr. Heinrich R. Schradin.
> Vorstand: Klaus-Jürgen Heitmann (Sprecher), Stefan Gronbach, Dr. Hans Olav
> Herøy, Dr. Jörg Rheinländer (stv.), Sarah Rössler, Daniel Thomas.
> ------------------------------
> Diese Nachricht enthält vertrauliche und/oder rechtlich geschützte
> Informationen.
> Wenn Sie nicht der richtige Adressat sind oder diese Nachricht irrtümlich
> erhalten haben,
> informieren Sie bitte sofort den Absender und vernichten Sie diese
> Nachricht.
> Das unerlaubte Kopieren sowie die unbefugte Weitergabe dieser Nachricht
> ist nicht gestattet.
>
> This information may contain confidential and/or privileged information.
> If you are not the intended recipient (or have received this information
> in error) please notify the
> sender immediately and destroy this information.
> Any unauthorized copying, disclosure or distribution of the material in
> this information is strictly forbidden.
> ------------------------------
>
> *Von:* gpfsug-discuss-bounces at spectrumscale.org <
> gpfsug-discuss-bounces at spectrumscale.org> *Im Auftrag von *Felipe Knop
> *Gesendet:* Montag, 10. Juni 2019 15:43
> *An:* gpfsug main discussion list <gpfsug-discuss at spectrumscale.org>
> *Betreff:* Re: [gpfsug-discuss] WG: Spectrum Scale with RHEL7.6 kernel
> 3.10.0-957.21.2
>
> Renar,
>
> Thanks. Of the changes below, it appears that
>
> * security: double-free attempted in security_inode_init_security()
> (BZ#1702286)
>
> was the one that ended up triggering the problem. Our investigations now
> show that RHEL kernels >= 3.10.0-957.19.1 are impacted.
>
>
> Felipe
>
> ----
> Felipe Knop *knop at us.ibm.com* <knop at us.ibm.com>
> GPFS Development and Security
> IBM Systems
> IBM Building 008
> 2455 South Rd, Poughkeepsie, NY 12601
> (845) 433-9314 T/L 293-9314
>
>
>
> [image: Inactive hide details for "Grunenberg, Renar" ---06/10/2019
> 08:43:27 AM---Hallo Felipe, here are the change list:]"Grunenberg, Renar"
> ---06/10/2019 08:43:27 AM---Hallo Felipe, here are the change list:
>
> From: "Grunenberg, Renar" <*Renar.Grunenberg at huk-coburg.de*
> <Renar.Grunenberg at huk-coburg.de>>
> To: "'gpfsug-discuss at spectrumscale.org'" <
> *gpfsug-discuss at spectrumscale.org* <gpfsug-discuss at spectrumscale.org>>
> Date: 06/10/2019 08:43 AM
> Subject: [EXTERNAL] [gpfsug-discuss] WG: Spectrum Scale with RHEL7.6
> kernel 3.10.0-957.21.2
> Sent by: *gpfsug-discuss-bounces at spectrumscale.org*
> <gpfsug-discuss-bounces at spectrumscale.org>
> ------------------------------
>
>
>
>
> Hallo Felipe,
>
> here are the change list:
> RHBA-2019:1337 kernel bug fix update
>
>
> Summary:
>
> Updated kernel packages that fix various bugs are now available for Red
> Hat Enterprise Linux 7.
>
> The kernel packages contain the Linux kernel, the core of any Linux
> operating system.
>
> This update fixes the following bugs:
>
> * Mellanox CX-5 MAC learning with OVS H/W offload not working (BZ#1686292)
>
> * RHEL7.4 NFS4.1 client and server repeated SEQUENCE / TEST_STATEIDs with
> SEQUENCE Reply has SEQ4_STATUS_RECALLABLE_STATE_REVOKED set - NFS server
> should return NFS4ERR_DELEG_REVOKED or NFS4ERR_BAD_STATEID for revoked
> delegations (BZ#1689811)
>
> * PANIC: "BUG: unable to handle kernel paging request" in the mtip32xx
> mtip_init_cmd_header routine (BZ#1689929)
>
> * The nvme cli delete-ns command hangs indefinitely. (BZ#1690519)
>
> * drm/nouveau: nv50 - Graphics become sluggish or frozen for nvidia Pascal
> cards (Regression from 1584963) - Need to flush fb writes when rewinding
> push buffer (BZ#1690761)
>
> * [CEE/SD] Ceph+NFS server crashed and rebooted due to CephFS kernel
> client issue (BZ#1692266)
>
> * [Mellanox OVS offload] tc fails to calculate the checksum in case vlan
> trunk and header rewrite (BZ#1693110)
>
> * aio O_DIRECT writes to non-page-aligned file locations on ext4 can
> result in the overlapped portion of the page containing zeros (BZ#1693561)
>
> * [HP WS 7.6 bug] Audio driver does not recognize multi function audio
> jack microphone input (BZ#1693562)
>
> * XFS returns ENOSPC when using extent size hint with space still
> available (BZ#1693796)
>
> * OVN requires IPv6 to be enabled (BZ#1694981)
>
> * breaks DMA API for non-GPL drivers (BZ#1695511)
>
> * ovl_create can return positive retval and crash the host (BZ#1696292)
>
> * ceph: append mode is broken for sync/direct write (BZ#1696595)
>
> * Problem building module due to -EXPORT_SYMBOL_GPL/-EXPORT_SYMBOL
> (BZ#1697241)
>
> * Failed to load kpatch module after install the rpm package occasionally
> on ppc64le (BZ#1697867)
>
> * [Hyper-V][RHEL7] Stop suppressing PCID bit (BZ#1697940)
>
> * Resizing an online EXT4 filesystem on a loopback device hangs
> (BZ#1698110)
>
> * dm table: propagate BDI_CAP_STABLE_WRITES (BZ#1699722)
>
> * [ESXi][RHEL7.6]After upgrade to kernel-3.10.0-957.el7, system is unable
> to discover newly added VMware LSI Logic SAS virtual disks without a
> reboot. (BZ#1699723)
>
> * kernel: zcrypt: fix specification exception on z196 at ap probe
> (BZ#1700706)
>
> * XFS: Metadata corruption detected at xfs_attr3_leaf_write_verify()
> (BZ#1701293)
>
> * stime showed huge values related to wrong calculation of time deltas
> (L3:) (BZ#1701743)
>
> * Kernel panic due to NULL pointer dereference at
> sysfs_do_create_link_sd.isra.2+0x34 while loading [ipmi_si] module using
> hard-coded device (BZ#1701991)
>
> * IPv6 ECMP modulo N hashing inefficient when X^2 rt6i_nsiblings
> (BZ#1702282)
>
> * security: double-free attempted in security_inode_init_security()
> (BZ#1702286)
>
> * Missing wakeup leaves task stuck waiting in blk_queue_enter()
> (BZ#1702921)
>
> * Satellite Capsule sync triggers several XFS corruptions (BZ#1702922)
>
> * BUG: SELinux doesn't handle NFS crossmnt well (BZ#1702923)
>
> * md_clear flag missing from /proc/cpuinfo on late microcode update
> (BZ#1712993)
>
> * MDS mitigations are not enabled after double microcode update
> (BZ#1712998)
>
> * WARNING: CPU: 0 PID: 0 at kernel/jump_label.c:90
> __static_key_slow_dec+0xa6/0xb0 (BZ#1713004)
>
> Users of kernel are advised to upgrade to these updated packages, which
> fix these bugs.
>
> Full details and references:
>
> *https://access.redhat.com/errata/RHBA-2019:1337?sc_cid=701600000006NHXAA2*
> <https://access.redhat.com/errata/RHBA-2019:1337?sc_cid=701600000006NHXAA2>
>
> Revision History:
>
> Issue Date: 2019-06-04
> Updated: 2019-06-04
>
> Regards Renar
>
> Renar Grunenberg
> Abteilung Informatik - Betrieb
>
> HUK-COBURG
> Bahnhofsplatz
> 96444 Coburg
>
> Telefon: 09561 96-44110
> Telefax: 09561 96-44104
> E-Mail: *Renar.Grunenberg at huk-coburg.de* <Renar.Grunenberg at huk-coburg.de>
> Internet: *www.huk.de* <http://www.huk.de>
>
> ------------------------------
>
> HUK-COBURG Haftpflicht-Unterstützungs-Kasse kraftfahrender Beamter
> Deutschlands a. G. in Coburg
> Reg.-Gericht Coburg HRB 100; St.-Nr. 9212/101/00021
> Sitz der Gesellschaft: Bahnhofsplatz, 96444 Coburg
> Vorsitzender des Aufsichtsrats: Prof. Dr. Heinrich R. Schradin.
> Vorstand: Klaus-Jürgen Heitmann (Sprecher), Stefan Gronbach, Dr. Hans Olav
> Herøy, Dr. Jörg Rheinländer (stv.), Sarah Rössler, Daniel Thomas.
> ------------------------------
>
> Diese Nachricht enthält vertrauliche und/oder rechtlich geschützte
> Informationen.
> Wenn Sie nicht der richtige Adressat sind oder diese Nachricht irrtümlich
> erhalten haben,
> informieren Sie bitte sofort den Absender und vernichten Sie diese
> Nachricht.
> Das unerlaubte Kopieren sowie die unbefugte Weitergabe dieser Nachricht
> ist nicht gestattet.
>
> This information may contain confidential and/or privileged information.
> If you are not the intended recipient (or have received this information
> in error) please notify the
> sender immediately and destroy this information.
> Any unauthorized copying, disclosure or distribution of the material in
> this information is strictly forbidden.
> ------------------------------
>
>
> *Von:* *gpfsug-discuss-bounces at spectrumscale.org*
> <gpfsug-discuss-bounces at spectrumscale.org> [
> *mailto:gpfsug-discuss-bounces at spectrumscale.org*
> <gpfsug-discuss-bounces at spectrumscale.org>] *Im Auftrag von *Felipe Knop
> *Gesendet:* Montag, 10. Juni 2019 06:41
> *An:* gpfsug main discussion list <*gpfsug-discuss at spectrumscale.org*
> <gpfsug-discuss at spectrumscale.org>>
> *Betreff:* Re: [gpfsug-discuss] Spectrum Scale with RHEL7.6 kernel
> 3.10.0-957.21.2
>
> Hi,
>
> Though we are still learning what workload results in the problem, it
> appears that even minimal I/O on the file system may cause the OS to crash.
> One pattern that we saw was 'mkdir'. There is a chance that the DR site was
> not yet impacted because no I/O workload has been run there. In that case,
> rolling back to the prior kernel level (one which has been tested before)
> may be advisable.
>
> Felipe
>
> ----
> Felipe Knop *knop at us.ibm.com* <knop at us.ibm.com>
> GPFS Development and Security
> IBM Systems
> IBM Building 008
> 2455 South Rd, Poughkeepsie, NY 12601
> (845) 433-9314 T/L 293-9314
>
>
>
> [image: Inactive hide details for KG ---06/09/2019 09:38:55 AM---One of my
> customer already upgraded their DR site. Is rollback advised]KG
> ---06/09/2019 09:38:55 AM---One of my customer already upgraded their DR
> site. Is rollback advised? They will be running from DR
>
> From: KG <*spectrumscale at kiranghag.com* <spectrumscale at kiranghag.com>>
> To: gpfsug main discussion list <*gpfsug-discuss at spectrumscale.org*
> <gpfsug-discuss at spectrumscale.org>>
> Date: 06/09/2019 09:38 AM
> Subject: [EXTERNAL] Re: [gpfsug-discuss] Spectrum Scale with RHEL7.6
> kernel 3.10.0-957.21.2
> Sent by: *gpfsug-discuss-bounces at spectrumscale.org*
> <gpfsug-discuss-bounces at spectrumscale.org>
> ------------------------------
>
>
>
>
>
> One of my customer already upgraded their DR site.
>
> Is rollback advised? They will be running from DR site for a day in
> another week.
>
> On Sat, Jun 8, 2019, 03:37 Felipe Knop <*knop at us.ibm.com*
> <knop at us.ibm.com>> wrote:
>
>    Zach,
>
>             This appears to be affecting all Scale versions, including
>             5.0.2 -- but only when moving to the new 3.10.0-957.21.2 kernel.
>             (3.10.0-957 is not impacted)
>
>             Felipe
>
>             ----
>             Felipe Knop *knop at us.ibm.com* <knop at us.ibm.com>
>             GPFS Development and Security
>             IBM Systems
>             IBM Building 008
>             2455 South Rd, Poughkeepsie, NY 12601
>             (845) 433-9314 T/L 293-9314
>
>
>
>             Zachary Mance ---06/07/2019 05:51:37 PM---Which versions of
>             Spectrum Scale versions are you referring to? 5.0.2-3?
>             ---------------------------
>
>             From: Zachary Mance <*zmance at ucar.edu* <zmance at ucar.edu>>
>             To: gpfsug main discussion list <
>             *gpfsug-discuss at spectrumscale.org*
>             <gpfsug-discuss at spectrumscale.org>>
>             Date: 06/07/2019 05:51 PM
>             Subject: [EXTERNAL] Re: [gpfsug-discuss] Spectrum Scale with
>             RHEL7.6 kernel 3.10.0-957.21.2
>             Sent by: *gpfsug-discuss-bounces at spectrumscale.org*
>             <gpfsug-discuss-bounces at spectrumscale.org>
>             ------------------------------
>
>
>
>
>
>             Which versions of Spectrum Scale versions are you referring
>             to? 5.0.2-3?
>
>             ---------------------------------------------------------------------------------------------------------------
>             Zach Mance *zmance at ucar.edu* <zmance at ucar.edu> (303) 497-1883
>             HPC Data Infrastructure Group / CISL / NCAR
>             ---------------------------------------------------------------------------------------------------------------
>
>
>
>             On Fri, Jun 7, 2019 at 3:45 PM Felipe Knop <*knop at us.ibm.com*
>             <knop at us.ibm.com>> wrote:
>                All,
>
>                                     There have been reported issues
>                                     (including kernel crashes) on Spectrum Scale with the latest RHEL7.6 kernel
>                                     3.10.0-957.21.2. Please consider delaying upgrades to this kernel until
>                                     further information is provided.
>
>                                     Thanks,
>
>                                     Felipe
>
>                                     ----
>                                     Felipe Knop *knop at us.ibm.com*
>                                     <knop at us.ibm.com>
>                                     GPFS Development and Security
>                                     IBM Systems
>                                     IBM Building 008
>                                     2455 South Rd, Poughkeepsie, NY 12601
>                                     (845) 433-9314 T/L 293-9314
>
>
>
>                                     _______________________________________________
>                                     gpfsug-discuss mailing list
>                                     gpfsug-discuss at *spectrumscale.org*
>                                     <http://spectrumscale.org>
> *http://gpfsug.org/mailman/listinfo/gpfsug-discuss*
>                                     <http://gpfsug.org/mailman/listinfo/gpfsug-discuss>
>                                     _______________________________________________
>                                     gpfsug-discuss mailing list
>                                     gpfsug-discuss at *spectrumscale.org*
>                                     <http://spectrumscale.org>
> *http://gpfsug.org/mailman/listinfo/gpfsug-discuss*
>                                     <http://gpfsug.org/mailman/listinfo/gpfsug-discuss>
>
>             _______________________________________________
>             gpfsug-discuss mailing list
>             gpfsug-discuss at *spectrumscale.org*
>             <http://spectrumscale.org>
> *http://gpfsug.org/mailman/listinfo/gpfsug-discuss*
>             <http://gpfsug.org/mailman/listinfo/gpfsug-discuss>*[attachment
>             "graycol.gif" deleted by Felipe Knop/Poughkeepsie/IBM] *
>             _______________________________________________
>             gpfsug-discuss mailing list
>             gpfsug-discuss at spectrumscale.org
> *http://gpfsug.org/mailman/listinfo/gpfsug-discuss*
>             <http://gpfsug.org/mailman/listinfo/gpfsug-discuss>
>
> _______________________________________________
> gpfsug-discuss mailing list
> gpfsug-discuss at spectrumscale.org
> *http://gpfsug.org/mailman/listinfo/gpfsug-discuss*
> <http://gpfsug.org/mailman/listinfo/gpfsug-discuss>
>
> _______________________________________________
> gpfsug-discuss mailing list
> gpfsug-discuss at spectrumscale.org
> http://gpfsug.org/mailman/listinfo/gpfsug-discuss
>
>
>
> _______________________________________________
> gpfsug-discuss mailing list
> gpfsug-discuss at spectrumscale.org
> http://gpfsug.org/mailman/listinfo/gpfsug-discuss
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://gpfsug.org/pipermail/gpfsug-discuss_gpfsug.org/attachments/20190613/47a7b102/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: graycol.gif
Type: image/gif
Size: 105 bytes
Desc: not available
URL: <http://gpfsug.org/pipermail/gpfsug-discuss_gpfsug.org/attachments/20190613/47a7b102/attachment.gif>


More information about the gpfsug-discuss mailing list