[gpfsug-discuss] CES ON RHEL7.3

Sobey, Richard A r.sobey at imperial.ac.uk
Wed Dec 7 09:13:23 GMT 2016


I admit I didn’t do a whole lot of troubleshooting. We don’t run NFS so I can’t speak about that.

Initially the server looked like it came back ok, albeit “Node starting up..” was observed in the output of mmlscluster –ces. At that time I was not sure if that was a) expected behaviour and/or b) related to GPFS 4.2.1-2.

Once the node went back into service I had no complaints from customers that they faced any connectivity issues. The next morning I shut down a second CES node in order to upgrade it, but I observed that the first once went into a failed state (might have been a nasty coincidence!):

[root at icgpfs-ces1 yum.repos.d]# mmces state show -a
NODE                                     AUTH          AUTH_OBJ      NETWORK       NFS           OBJ           SMB           CES
icgpfs-ces1                              FAILED        DISABLED      HEALTHY       DISABLED      DISABLED      DEPEND        STARTING
icgpfs-ces2                              DEPEND        DISABLED      SUSPENDED     DEPEND        DEPEND        DEPEND        DEPEND
icgpfs-ces3                              HEALTHY       DISABLED      HEALTHY       DISABLED      DISABLED      HEALTHY       HEALTHY
icgpfs-ces4                              HEALTHY       DISABLED      HEALTHY       DISABLED      DISABLED      HEALTHY       HEALTHY

(Where ICGPFS-CES1 was the node running 7.3).

Also in mmces event show –N icgpfs-ces1 –time day the following error was logged about twice per minute:

icgpfs-ces1                              2016-12-06 06:32:04.968269 GMT        wnbd_restart              INFO       WINBINDD process was not running. Trying to start it

I moved the CES IP from icgpfs-ces2 to icgpfs-ces3 prior to suspending –ces2.

It was about that point I decided to abandon the planned upgrade of –ces2, resume the node and then suspend –ces1.

Attempts to downgrade the Kernel/OS/redhat-release RPM back to 7.2 worked well, except when I tried to start CES again and the node reported “Node failed”. I then rebuilt it completely, restored it to the cluster and it appears to be fine.

Sorry I can’t be any more specific than that but I hope it helps.

Thanks
Richard

From: gpfsug-discuss-bounces at spectrumscale.org [mailto:gpfsug-discuss-bounces at spectrumscale.org] On Behalf Of Ravi K Komanduri
Sent: 07 December 2016 06:46
To: r.sobey at inperial.ac.uk
Cc: gpfsug-discuss at spectrumscale.org
Subject: Re: [gpfsug-discuss] CES ON RHEL7.3

Sobey,

Could you mention the problems that you have faced on CES env for RH 7.3.  Is it related to the Kernel or in Ganesha environment ?

Your thoughts/inputs would help us in fixing the same.

Currently working on the CES environment on RH 7.3 support side.

With Regards,
Ravi K Komanduri
GPFS team
IBM



From:        "Sobey, Richard A" <r.sobey at imperial.ac.uk<mailto:r.sobey at imperial.ac.uk>>
To:        "'gpfsug-discuss at spectrumscale.org'" <gpfsug-discuss at spectrumscale.org<mailto:gpfsug-discuss at spectrumscale.org>>
Date:        12/07/2016 11:59 AM
Subject:        [gpfsug-discuss] CES ON RHEL7.3
Sent by:        gpfsug-discuss-bounces at spectrumscale.org<mailto:gpfsug-discuss-bounces at spectrumscale.org>
________________________________



A word of wisdom: do not try and run CES on RHEL 7.3 ☺Although it appears to work, a few things break and it becomes a bit unpredictable as I found out the hard way. I didn’t intend to run 7.3 of course as I knew it wasn’t supported.

Richard_______________________________________________
gpfsug-discuss mailing list
gpfsug-discuss at spectrumscale.org
http://gpfsug.org/mailman/listinfo/gpfsug-discuss


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://gpfsug.org/pipermail/gpfsug-discuss_gpfsug.org/attachments/20161207/ff7e463b/attachment.htm>


More information about the gpfsug-discuss mailing list