egse.process
cgse-common
This code is part of the cgse-common
package.
egse.process
¶
This module provides functions and classes to work with processes and sub-processes.
We combine two great packages, the subprocess
and the psutils
packages, to
provide a simplified, robust and user-friendly interface to work with sub-processes.
The classes and functions are optimized to work with processes within the framework
of the cgse
, so we do not intend to be fully generic. If you need that, we recommend
to use the subprocess
and psutil
packages directly.
The main class provided is the SubProcess
which by default, starts a sub-process
in the background and detached from the parent process. That means there is no
communication between the parent and the subprocess through pipes. Most (if not all)
processes in the cgse
framework communicate with ZeroMQ messages in different
protocols like REQ-REP, PUSH-PULL and ROUTER-DEALER
Another useful class is the ProcessStatus
. This class provides status information
like memory and CPU usage for the running process. Additionally, it will generate
and update metrics that can be queried by the Prometheus timeseries database.
Classes:
Name | Description |
---|---|
ProcessStatus |
The ProcessStatus is basically a dataclass that contains the status information of a running |
SubProcess |
A SubProcess that is usually started by the ProcessManager. |
Functions:
Name | Description |
---|---|
get_process_info |
Loops over all running processes and tries to match each item in 'items' to the command line |
is_process_running |
Check if there is any running process that contains the given items in its |
list_processes |
Returns and optionally prints the processes that match the given criteria in items. |
list_zombies |
Returns a list of zombie processes. |
ProcessStatus
¶
ProcessStatus(*, metrics_prefix=None)
The ProcessStatus is basically a dataclass that contains the status information of a running process.
The available information is the following:
- pid: the process identifier
- uptime: the process up-time as a floating point number expressed in seconds
- uuid: the UUID1 for this process
- memory info: memory information on the process
- cpu usage, percentage and count (number of physical cores)
The status will always be updated before returning or printing.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
metrics_prefix
|
Optional[str]
|
the prefix that identifies the process for which these metrics are gathered. |
None
|
Methods:
Name | Description |
---|---|
as_dict |
Returns all process information as a dictionary. |
update |
Updates those values that change during execution, like memory usage, number of |
update_metrics |
Updates the metrics that are taken from the psutils module. |
as_dict
¶
as_dict()
Returns all process information as a dictionary.
This runs the update()
method first to bring the numbers up-to-date.
update
¶
update()
Updates those values that change during execution, like memory usage, number of connections, ...
This call will also update the metrics!
Returns:
Type | Description |
---|---|
ProcessStatus
|
the ProcessStatus object, self. |
update_metrics
¶
update_metrics()
Updates the metrics that are taken from the psutils module.
The following metrics are never updated since they are not changed during a process execution:
- PSUTIL_NUMBER_OF_CPU
- PSUTIL_PID
SubProcess
¶
A SubProcess that is usually started by the ProcessManager.
Example:
proc = SubProcess("MyApp", [sys.executable, "-m", "egse.<module>"])
proc.execute()
Methods:
Name | Description |
---|---|
execute |
Execute the sub-process. |
exists |
Checks if the sub-process exists by checking if its process ID exists. |
is_running |
Check if this process is still running. |
quit |
Send a request to quit to the process. |
reap_children |
Tries hard to terminate and ultimately kill all the children of this process. |
returncode |
Check if the sub-process is terminated and return its return code or None when the process |
execute
¶
execute()
Execute the sub-process.
Returns:
Type | Description |
---|---|
bool
|
True if the process could be started, False on error. |
exists
¶
exists()
Checks if the sub-process exists by checking if its process ID exists.
Returns:
Type | Description |
---|---|
bool
|
True if the sub-process exists. |
is_running
¶
is_running()
Check if this process is still running.
- checks if process exists
- checks if process is not a zombie and is not dead
Returns:
Type | Description |
---|---|
bool
|
True if the process is running. |
quit
¶
quit()
Send a request to quit to the process.
This will first send a SIGTERM signal to the process, if that fails, a SIGKILL will be sent.
Returns:
Type | Description |
---|---|
int
|
0 if the process and its sub-processes are all terminated. Will return > 0 to indicate the number of processes that survived the SIGKILL. |
reap_children
¶
reap_children(timeout=3)
Tries hard to terminate and ultimately kill all the children of this process.
This will first send a SIGTERM signal to the process, if that fails, a SIGKILL will be sent.
Returns:
Type | Description |
---|---|
int
|
0 if the process and its sub-processes are all terminated. Will return > 0 to indicate the number of processes that survived the SIGKILL. |
returncode
¶
returncode()
Check if the sub-process is terminated and return its return code or None when the process is still running.
get_process_info
¶
get_process_info(
items, contains=True, case_sensitive=False
)
Loops over all running processes and tries to match each item in 'items' to the command line of the process. Any process where all 'items' can be matched will end up in the response.
Returns a list with the process info (PID, cmdline, create_time) for any processes where all 'items' match the process command line. An empty list is returned when not 'all the items' match for any of the processes.
Examples:
>>> get_process_info(items=["feesim"])
[
{
'pid': 10166,
'cmdline': [
'/Library/Frameworks/Python.framework/Versions/3.8/Resources/Python.app/Contents/MacOS/Python',
'/Users/rik/git/plato-common-egse/venv38/bin/feesim',
'start',
'--zeromq'
],
'create_time': 1664898231.915995
}
]
>>> get_process_info(items=["dpu_cs", "--zeromq"])
[
{
'pid': 11595,
'cmdline': [
'/Library/Frameworks/Python.framework/Versions/3.8/Resources/Python.app/Contents/MacOS/Python',
'/Users/rik/git/plato-common-egse/venv38/bin/dpu_cs',
'start',
'--zeromq'
],
'create_time': 1664898973.542281
}
]
Parameters:
Name | Type | Description | Default |
---|---|---|---|
items
|
List[str] | str
|
a string or a list of strings that should match command line items |
required |
contains
|
bool
|
if True, the match is done with 'in' otherwise '==' |
True
|
case_sensitive
|
bool
|
if True, the match shall be case-sensitive |
False
|
Returns:
Type | Description |
---|---|
List
|
A list of process info entries. |
is_process_running
¶
is_process_running(
items,
contains=True,
case_sensitive=False,
as_list=False,
)
Check if there is any running process that contains the given items in its commandline.
Loops over all running processes and tries to match all items in the 'items' argument to the command line of the process. If all 'items' can be matched to a process, the function returns the PID of that process.
By default, only the first matching process PID is returned. If as_list=True
then all mathing process PIDs are returned as a list.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
items
|
List[str] | str
|
a string or a list of strings that should match command line parts |
required |
contains
|
bool
|
if True, the match is done with 'in' otherwise '==' [default: True] |
True
|
case_sensitive
|
bool
|
if True, the match shall be case-sensitive [default: False] |
False
|
as_list
|
bool
|
return the PID of all matching processes as a list [default: False] |
False
|
Returns:
Type | Description |
---|---|
int | List[int]
|
The PID(s) if there exists a running process with the given items, 0 or [] otherwise. |
list_processes
¶
list_processes(
items,
contains=True,
case_sensitive=False,
verbose=False,
)
Returns and optionally prints the processes that match the given criteria in items.
Parameters:
Name | Type | Description | Default |
---|---|---|---|
items
|
List[str] | str
|
a string or a list of strings that should match command line parts |
required |
contains
|
bool
|
if True, the match is done with 'in' otherwise '==' [default: True] |
True
|
case_sensitive
|
bool
|
if True, the match shall be case-sensitive [default: False] |
False
|
verbose
|
bool
|
if True, the processes will be printed to the console |
False
|
Returns:
Type | Description |
---|---|
list[dict]
|
A list of dictionaries for the matching processes. The dict contains the 'pid', 'status' and 'cmdline' of a process. |
list_zombies
¶
list_zombies()
Returns a list of zombie processes.
A zombie process, also known as a defunct process, is a process that has completed its execution but still has an entry in the process table. This happens when a child process terminates, but the parent process hasn't yet read its exit status by using a system call like wait(). As a result, the process is "dead" (it has completed execution) but hasn't been "reaped" or removed from the system's process table.
A zombie process can not be killed with SIGKILL because it's already dead, and it's only removed when the parent process reads their exit status or when the parent process itself terminates.
A zombie process does not block ports, so it's mostly harmless and will disappear when the parent process terminates.
Returns:
Type | Description |
---|---|
list[dict]
|
A list of dictionaries with information on the zombie processes. The dict contains the 'pid', 'name', and 'cmdline' of the zombie process. |