停止hadoop集群,运行命令
$ sh stop-all.sh
出现提示:
no resourcemanager to stophost10: no nodemanager to stophost9: no nodemanager to stopno proxyserver to stop
添加日志重新操作:
export HADOOP_ROOT_LOGGER=DEBUG,console
18/11/13 11:07:16 DEBUG util.PerformanceAdvisory: Falling back to shell based18/11/13 11:07:16 DEBUG security.JniBasedUnixGroupsMappingWithFallback: Group mapping impl=org.apache.hadoop.security.ShellBasedUnixGroupsMapping18/11/13 11:07:16 DEBUG security.Groups: Group mapping impl=org.apache.hadoop.security.JniBasedUnixGroupsMappingWithFallback; cacheTimeout=300000; warningDeltaMs=500018/11/13 11:07:16 DEBUG security.UserGroupInformation: hadoop login18/11/13 11:07:16 DEBUG security.UserGroupInformation: hadoop login commit18/11/13 11:07:16 DEBUG security.UserGroupInformation: using local user:UnixPrincipal: udap18/11/13 11:07:16 DEBUG security.UserGroupInformation: Using user: "UnixPrincipal: udap" with name udap18/11/13 11:07:16 DEBUG security.UserGroupInformation: User entry: "udap"18/11/13 11:07:16 DEBUG security.UserGroupInformation: UGI loginUser:udap (auth:SIMPLE) 18/11/13 11:07:16 DEBUG security.UserGroupInformation: PrivilegedAction as:udap (auth:SIMPLE) from:org.apache.hadoop.hdfs.tools.GetConf.run(GetConf.java:314) stopping yarn daemons no resourcemanager to stop host10: no nodemanager to stop host9: no nodemanager to stop no proxyserver to stop
没什么报错信息,这不科学,所以观察脚本,发现错误在寻找pid的时候没找到,默认pid存在:
if [ "$HADOOP_PID_DIR" = "" ]; then HADOOP_PID_DIR=/tmpfi
当tmp目录被删除时,此时就无法运行脚本停止进程,所以只能手动一台台停止进程了,如果要避免错误应该是启动脚本中将“HADOOP_PID_DIR”配置到不删除目录