Troubleshooting Linux Container Executor
A list of numeric error codes communicated by the container-executor to the NodeManager that appear in the /var/log/hadoop-yarn NodeManager log.
|
Numeric Code |
Name |
Description |
|---|---|---|
|
1 |
INVALID_ARGUMENT_NUMBER |
|
|
2 |
INVALID_USER_NAME |
The user passed to the container-executor does not exist. |
|
3 |
INVALID_COMMAND_PROVIDED |
The container-executor does not recognize the command it was asked to run. |
|
5 |
INVALID_NM_ROOT |
The passed NodeManager root does not match the configured NodeManager root
( |
|
6 |
SETUID_OPER_FAILED |
Either could not read the local groups database, or could not set UID or GID |
|
7 |
UNABLE_TO_EXECUTE_CONTAINER_SCRIPT |
The container-executor could not run the container launcher script. |
|
8 |
UNABLE_TO_SIGNAL_CONTAINER |
The container-executor could not signal the container it was passed. |
|
9 |
INVALID_CONTAINER_PID |
The PID passed to the container-executor was negative or 0. |
|
18 |
OUT_OF_MEMORY |
The container-executor couldn't allocate enough memory while reading the container-executor.cfg file, or while getting the paths for the container launcher script or credentials files. |
|
20 |
INITIALIZE_USER_FAILED |
Couldn't get, stat, or secure the per-user NodeManager directory. |
|
21 |
UNABLE_TO_BUILD_PATH |
The container-executor couldn't concatenate two paths, most likely because it ran out of memory. |
|
22 |
INVALID_CONTAINER_EXEC_PERMISSIONS |
The container-executor binary does not have the correct permissions set. |
|
24 |
INVALID_CONFIG_FILE |
The container-executor.cfg file is missing, malformed, or has incorrect permissions. |
|
25 |
SETSID_OPER_FAILED |
Could not set the session ID of the forked container. |
|
26 |
WRITE_PIDFILE_FAILED |
Failed to write the value of the PID of the launched container to the PID file of the container. |
|
255 |
Unknown Error |
This error has several possible causes. Some common causes are:
|
|
Numeric Code |
Name |
Description |
|---|---|---|
|
0 |
SUCCESS |
Container has finished succesfully. |
|
-1000 |
INVALID |
Initial value of the container exit code. A container that does not have a COMPLETED state will always return this status. |
|
-100 |
ABORTED |
Containers killed by the framework, either due to being released by the application or being 'lost' due to node failures, for example. |
|
-101 |
DISKS_FAILED |
Container exited due to local disks issues in the NodeManager node. This occurs when the number of good nodemanager-local-directories or nodemanager-log-directories drops below the health threshold. |
|
-102 |
PREEMPTED |
Containers preempted by the framework. This does not count towards a container failure in most applications. |
|
-103 |
KILLED_EXCEEDED_VMEM |
Container terminated because of exceeding allocated virtual memory limit. |
|
-104 |
KILLED_EXCEEDED_PMEM |
Container terminated because of exceeding allocated physical memory limit. |
|
-105 |
KILLED_BY_APPMASTER |
Container was terminated on request of the application master. |
|
-106 |
KILLED_BY_RESOURCEMANAGER |
Container was terminated by the resource manager. |
|
-107 |
KILLED_AFTER_APP_COMPLETION |
Container was terminated after the application finished. |
