[gpfsug-discuss] Remote cluster mount failing

Felipe Knop knop at us.ibm.com
Mon Sep 12 06:17:05 BST 2016


There is a chance the problem might be related to an upgrade from 3.5 to 
4.1, or perhaps a remote mount between versions 3.5 and 4.1. It would be 
useful to know details related to any such migration and different 
releases when the PMR is opened.

Thanks,

  Felipe

----
Felipe Knop                                     knop at us.ibm.com
GPFS Development and Security
IBM Systems
IBM Building 008
2455 South Rd, Poughkeepsie, NY 12601
(845) 433-9314  T/L 293-9314





From:   Yuri L Volobuev/Austin/IBM at IBMUS
To:     gpfsug main discussion list <gpfsug-discuss at spectrumscale.org>
Date:   09/09/2016 12:30 PM
Subject:        Re: [gpfsug-discuss] Remote cluster mount failing
Sent by:        gpfsug-discuss-bounces at spectrumscale.org



It could be "easy" in the end, e.g. regenerating the key ("mmauth genkey 
new") may fix the issue. Figuring out exactly what is going wrong is messy 
though, and requires looking at a number of debug data points, something 
that's awkward to do on a public mailing list. I don't think you want to 
post certificates et al on a mailing list. The PMR channel is more 
appropriate for this kind of thing.

yuri

"Simon Thompson (Research Computing - IT Services)" ---09/09/2016 07:37:52 
AM---That’s sorta what I was expecting. Though I was hoping someone might 
have said 'oh just run mmchconf

From: "Simon Thompson (Research Computing - IT Services)" 
<S.J.Thompson at bham.ac.uk>
To: gpfsug main discussion list <gpfsug-discuss at spectrumscale.org>, 
Date: 09/09/2016 07:37 AM
Subject: Re: [gpfsug-discuss] Remote cluster mount failing
Sent by: gpfsug-discuss-bounces at spectrumscale.org



That’s sorta what I was expecting. Though I was hoping someone might have 
said 'oh just run mmchconfig ....' or something easy.

PMR on its way in.

Thanks!

Simon

From: <gpfsug-discuss-bounces at spectrumscale.org> on behalf of Yuri L 
Volobuev <volobuev at us.ibm.com>
Reply-To: "gpfsug-discuss at spectrumscale.org" <
gpfsug-discuss at spectrumscale.org>
Date: Wednesday, 7 September 2016 at 17:58
To: "gpfsug-discuss at spectrumscale.org" <gpfsug-discuss at spectrumscale.org>
Subject: Re: [gpfsug-discuss] Remote cluster mount failing
It's unclear what's wrong. I'd have two main suspects: (1) TLS protocol 
version confusion, due to a difference in GSKit version and/or 
configuration (e.g. NIST SP800 compliance) on two sides (2) firewall. TLS 
issues are usually messy and tedious to work though. I'd recommend opening 
a PMR to facilitate debug data collection and analysis. A lot of gory 
detail may be needed to figure out what's going on.

yuri

"Simon Thompson (Research Computing - IT Services)" ---09/07/2016 05:37:11 
AM---Hi All, I'm trying to get some multi cluster thing working between 
two of our GPFS

From: "Simon Thompson (Research Computing - IT Services)" <
S.J.Thompson at bham.ac.uk>
To: "gpfsug-discuss at spectrumscale.org" <gpfsug-discuss at spectrumscale.org>, 

Date: 09/07/2016 05:37 AM
Subject: [gpfsug-discuss] Remote cluster mount failing
Sent by: gpfsug-discuss-bounces at spectrumscale.org



Hi All,

I'm trying to get some multi cluster thing working between two of our GPFS
clusters.

In the "client" cluster, when trying to mount the "remote" cluster, I get:

# mmmount gpfs
Wed 7 Sep 13:33:06 BST 2016: mmmount: Mounting file systems ...
mount: mount /dev/gpfs on /gpfs failed: Connection timed out
mmmount: Command failed. Examine previous error messages to determine
cause.


And in the log file:
Wed Sep 7 13:33:07.481 2016: [N] The client side TLS handshake with node
10.0.0.182 was cancelled: connection reset by peer (return code 420).
Wed Sep 7 13:33:07.486 2016: [N] The client side TLS handshake with node
10.0.0.181 was cancelled: connection reset by peer (return code 420).
Wed Sep 7 13:33:07.487 2016: [E] Failed to join remote cluster
GPFS_STORAGE.CLUSTER
Wed Sep 7 13:33:07.488 2016: [W] Command: err 78: mount
GPFS_STORAGE.CLUSTER:gpfs
Wed Sep 7 13:33:07.489 2016: Connection timed out

In the remote cluster, I see:

Wed Sep 7 13:33:07.487 2016: [W] The TLS handshake with node 10.0.0.222
failed with error 447 (server side).
Wed Sep 7 13:33:07.488 2016: [X] Connection from 10.10.0.35 <c0p174>
refused, authentication failed
Wed Sep 7 13:33:07.489 2016: [E] Killing connection from 10.10.0.35, err
703
Wed Sep 7 13:33:07.490 2016: Operation not permitted



Weirdly though on other nodes in the client cluster this succeeds fine and
can mount, so I think I got all the bits in the mmauth and mmremotecluster
configured correctly.

Any suggestions?

Thanks

Simon

_______________________________________________
gpfsug-discuss mailing list
gpfsug-discuss at spectrumscale.org
http://gpfsug.org/mailman/listinfo/gpfsug-discuss


_______________________________________________
gpfsug-discuss mailing list
gpfsug-discuss at spectrumscale.org
http://gpfsug.org/mailman/listinfo/gpfsug-discuss

[attachment "graycol.gif" deleted by Yuri L Volobuev/Austin/IBM] 
_______________________________________________
gpfsug-discuss mailing list
gpfsug-discuss at spectrumscale.org
http://gpfsug.org/mailman/listinfo/gpfsug-discuss



-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://gpfsug.org/pipermail/gpfsug-discuss_gpfsug.org/attachments/20160912/039e0b74/attachment.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: image/gif
Size: 105 bytes
Desc: not available
URL: <http://gpfsug.org/pipermail/gpfsug-discuss_gpfsug.org/attachments/20160912/039e0b74/attachment.gif>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: image/gif
Size: 105 bytes
Desc: not available
URL: <http://gpfsug.org/pipermail/gpfsug-discuss_gpfsug.org/attachments/20160912/039e0b74/attachment-0001.gif>


More information about the gpfsug-discuss mailing list