[gpfsug-discuss] How to ignore ib_rdma_nic_unrecognized event on nodes where an IB link is not used.
Ran Pergamin
rpergamin at ddn.com
Wed May 29 12:54:46 BST 2019
Hi All,
My customer has some nodes in the cluster which current have their second IB port disabled.
Spectrum scale 4.2.3 update 13.
Port 1 is defined in verbs port, yet sysmoncon monitor and reports error on port 2 despite not being used.
I found an old listing claiming it will be solved in in 4.2.3-update5, yet nothing in 4.2.3-update7 release notes, about it.
https://www.spectrumscale.org/pipermail/gpfsug-discuss/2018-January/004395.html
Filters in sensor file say filters are not support + apply to ALL nodes, so no relevant where I need to ignore it.
Any idea how can I disable the check of sensor on mlx4_0/2 on some of the nodes ?
Node name: cff003-ib0.chemfarm
Node status: DEGRADED
Status Change: 2019-05-29 12:29:49
Component Status Status Change Reasons
-------------------------------------------------------------------------------------------------------------------------------------------------
GPFS TIPS 2019-05-29 12:29:48 gpfs_pagepool_small
NETWORK DEGRADED 2019-05-29 12:29:49 ib_rdma_link_down(mlx4_0/2), ib_rdma_nic_down(mlx4_0/2), ib_rdma_nic_unrecognized(mlx4_0/2)
ib0 HEALTHY 2019-05-29 12:29:49 -
mlx4_0/1 HEALTHY 2019-05-29 12:29:49 -
mlx4_0/2 FAILED 2019-05-29 12:29:49 ib_rdma_link_down, ib_rdma_nic_down, ib_rdma_nic_unrecognized
FILESYSTEM HEALTHY 2019-05-29 12:29:48 -
apps HEALTHY 2019-05-29 12:29:48 -
data HEALTHY 2019-05-29 12:29:48 -
PERFMON HEALTHY 2019-05-29 12:29:33 -
THRESHOLD HEALTHY 2019-05-29 12:29:18 -
Thanks !
Regards,
Ran
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://gpfsug.org/pipermail/gpfsug-discuss_gpfsug.org/attachments/20190529/e100c48c/attachment.htm>
More information about the gpfsug-discuss
mailing list