Почему ssh не работает с первой попытки, но после нескольких попыток «Нет маршрута к хосту» и «Время ожидания соединения истекло» он работает случайным образом?
# clush -bw node049 date
node049: ssh: connect to host node049 port 22: No route to host
clush: node049: exited with exit code 255
# clush -bw node049 date
node049: ssh: connect to host node049 port 22: No route to host
clush: node049: exited with exit code 255
# clush -bw node049 date
node049: ssh: connect to host node049 port 22: No route to host
clush: node049: exited with exit code 255
# clush -bw node049 date
node049: ssh: connect to host node049 port 22: Connection timed out
clush: node049: exited with exit code 255
# clush -bw node049 date
---------------
node049
---------------
Mon Jan 27 04:40:58 CET 2020
#
У меня 50 с лишним узлов, и это случайным образом происходит в кластере
Я вижу много таких сообщений на узлах
Jan 27 04:44:27 justime049 kernel: net_ratelimit: 72 callbacks suppressed
Jan 27 04:44:32 justime049 kernel: net_ratelimit: 66 callbacks suppressed
Jan 27 04:44:37 justime049 kernel: net_ratelimit: 252 callbacks suppressed
Jan 27 04:44:42 justime049 kernel: net_ratelimit: 2455 callbacks suppressed
Jan 27 04:44:47 justime049 kernel: net_ratelimit: 2799 callbacks suppressed
Jan 27 04:44:52 justime049 kernel: net_ratelimit: 3895 callbacks suppressed
Jan 27 04:44:52 justime049 rsyslogd: action 'action 0' resumed (module 'builtin:omfwd') [v8.24.0-34.el7 try http://www.rsyslog.com/e/2359 ]
Jan 27 04:44:52 justime049 rsyslogd: action 'action 0' resumed (module 'builtin:omfwd') [v8.24.0-34.el7 try http://www.rsyslog.com/e/2359 ]
Jan 27 04:44:52 justime049 rsyslogd: action 'action 0' resumed (module 'builtin:omfwd') [v8.24.0-34.el7 try http://www.rsyslog.com/e/2359 ]
Jan 27 04:44:52 justime049 rsyslogd: action 'action 0' resumed (module 'builtin:omfwd') [v8.24.0-34.el7 try http://www.rsyslog.com/e/2359 ]
Jan 27 04:44:52 justime049 rsyslogd: action 'action 0' resumed (module 'builtin:omfwd') [v8.24.0-34.el7 try http://www.rsyslog.com/e/2359 ]
Jan 27 04:44:52 justime049 rsyslogd: action 'action 0' resumed (module 'builtin:omfwd') [v8.24.0-34.el7 try http://www.rsyslog.com/e/2359 ]
Jan 27 04:44:52 justime049 rsyslogd: action 'action 0' resumed (module 'builtin:omfwd') [v8.24.0-34.el7 try http://www.rsyslog.com/e/2359 ]
Jan 27 04:44:52 justime049 rsyslogd: action 'action 0' resumed (module 'builtin:omfwd') [v8.24.0-34.el7 try http://www.rsyslog.com/e/2359 ]
Jan 27 04:44:52 justime049 rsyslogd: action 'action 0' resumed (module 'builtin:omfwd') [v8.24.0-34.el7 try http://www.rsyslog.com/e/2359 ]
Jan 27 04:44:52 justime049 rsyslogd: action 'action 0' resumed (module 'builtin:omfwd') [v8.24.0-34.el7 try http://www.rsyslog.com/e/2359 ]
Jan 27 04:44:52 justime049 rsyslogd: action 'action 0' suspended, next retry is Mon Jan 27 04:45:22 2020 [v8.24.0-34.el7 try http://www.rsyslog.com/e/2007 ]
Jan 27 04:44:57 justime049 kernel: net_ratelimit: 2339 callbacks suppressed
Jan 27 04:45:02 justime049 kernel: net_ratelimit: 2662 callbacks suppressed
Jan 27 04:45:07 justime049 kernel: net_ratelimit: 2450 callbacks suppressed
Jan 27 04:45:12 justime049 kernel: net_ratelimit: 2406 callbacks suppressed
Jan 27 04:45:17 justime049 kernel: net_ratelimit: 2401 callbacks suppressed
Jan 27 04:45:22 justime049 kernel: net_ratelimit: 2917 callbacks suppressed
Jan 27 04:45:22 justime049 rsyslogd: action 'action 0' resumed (module 'builtin:omfwd') [v8.24.0-34.el7 try http://www.rsyslog.com/e/2359 ]
Jan 27 04:45:22 justime049 rsyslogd: action 'action 0' resumed (module 'builtin:omfwd') [v8.24.0-34.el7 try http://www.rsyslog.com/e/2359 ]
Jan 27 04:45:22 justime049 rsyslogd: action 'action 0' resumed (module 'builtin:omfwd') [v8.24.0-34.el7 try http://www.rsyslog.com/e/2359 ]
Jan 27 04:45:22 justime049 rsyslogd: action 'action 0' resumed (module 'builtin:omfwd') [v8.24.0-34.el7 try http://www.rsyslog.com/e/2359 ]
Jan 27 04:45:22 justime049 rsyslogd: action 'action 0' resumed (module 'builtin:omfwd') [v8.24.0-34.el7 try http://www.rsyslog.com/e/2359 ]
Jan 27 04:45:22 justime049 rsyslogd: action 'action 0' resumed (module 'builtin:omfwd') [v8.24.0-34.el7 try http://www.rsyslog.com/e/2359 ]
Jan 27 04:45:22 justime049 rsyslogd: action 'action 0' resumed (module 'builtin:omfwd') [v8.24.0-34.el7 try http://www.rsyslog.com/e/2359 ]
Jan 27 04:45:22 justime049 rsyslogd: action 'action 0' resumed (module 'builtin:omfwd') [v8.24.0-34.el7 try http://www.rsyslog.com/e/2359 ]
Jan 27 04:45:22 justime049 rsyslogd: action 'action 0' resumed (module 'builtin:omfwd') [v8.24.0-34.el7 try http://www.rsyslog.com/e/2359 ]
Jan 27 04:45:22 justime049 rsyslogd: action 'action 0' resumed (module 'builtin:omfwd') [v8.24.0-34.el7 try http://www.rsyslog.com/e/2359 ]
Jan 27 04:45:22 justime049 rsyslogd: action 'action 0' suspended, next retry is Mon Jan 27 04:45:52 2020 [v8.24.0-34.el7 try http://www.rsyslog.com/e/2007 ]
Jan 27 04:45:27 justime049 kernel: net_ratelimit: 4646 callbacks suppressed
Jan 27 04:45:32 justime049 kernel: net_ratelimit: 3588 callbacks suppressed
Нет брандмауэра и IPtables
# firewall-cmd --state
not running
# systemctl stop iptables
Failed to stop iptables.service: Unit iptables.service not loaded.