opera.util package
Submodules
opera.util.error_codes module
error_codes.py
Error codes for use with OPERA PGEs.
- opera.util.error_codes.CODES_PER_RANGE = 1000
Number of error codes allocated to each range
- opera.util.error_codes.CRITICAL_RANGE_START = 3000
Starting value for the Critical code range
- opera.util.error_codes.DEBUG_RANGE_START = 1000
Starting value for the Debug code range
- opera.util.error_codes.ERROR_CODE_PGE_OFFSET = 10000
Base offset used to distinguish error codes by PGE type
- class opera.util.error_codes.ErrorCode(value)[source]
Bases:
IntEnum
Error codes for OPERA PGEs.
Each code is combined with the designated error code offset defined by the RunConfig to determine the final, logged error code.
- CATALOG_METADATA_CREATION_FAILED = 3003
- CLOSING_LOG_FILE = 26
- CONFIGURATION_DETAILS = 1000
- CREATED_SAS_CONFIG = 16
- CREATING_CATALOG_METADATA = 18
- CREATING_ISO_METADATA = 19
- CREATING_OUTPUT_FILE = 17
- CREATING_WORKING_DIRECTORY = 5
- DATE_RANGE_MISSING = 2000
- DIRECTORY_CREATION_FAILED = 3001
- DIRECTORY_SETUP_COMPLETE = 6
- DYNAMIC_IMPORT_FAILED = 3022
- FILENAME_VIOLATES_NAMING_CONVENTION = 3011
- FILE_MOVE_FAILED = 3010
- GRIB_TO_NETCDF_CONVERSION_FAILED = 3023
- INPUT_FILE = 13
- INPUT_NOT_FOUND = 3005
- INVALID_CATALOG_METADATA = 3009
- INVALID_INPUT = 3007
- INVALID_OUTPUT = 3008
- ISO_METADATA_CANT_RENDER_ONE_VARIABLE = 2002
- ISO_METADATA_COULD_NOT_EXTRACT_METADATA = 3018
- ISO_METADATA_DESCRIPTIONS_CONFIG_INVALID = 3025
- ISO_METADATA_DESCRIPTIONS_CONFIG_NOT_FOUND = 3026
- ISO_METADATA_GOT_SOME_RENDERING_ERRORS = 3017
- ISO_METADATA_NO_DESCRIPTIONS = 2008
- ISO_METADATA_NO_ENTRY_FOR_DESCRIPTION = 3027
- ISO_METADATA_RENDER_FAILED = 3019
- ISO_METADATA_TEMPLATE_NOT_FOUND = 3016
- ISO_METADATA_TEMPLATE_NOT_PROVIDED_WHEN_NEEDED = 3024
- LOADING_RUN_CONFIG_FILE = 2
- LOGGED_CRITICAL_LINE = 3021
- LOGGED_DEBUG_LINE = 1004
- LOGGED_INFO_LINE = 27
- LOGGED_WARNING_LINE = 2007
- LOGGING_COULD_NOT_INCREMENT_SEVERITY = 2005
- LOGGING_REQUESTED_SEVERITY_NOT_FOUND = 2003
- LOGGING_RESYNC_FAILED = 2006
- LOGGING_SOURCE_FILE_DOES_NOT_EXIST = 2004
- LOG_FILE_CREATED = 1
- LOG_FILE_CREATION_FAILED = 3004
- LOG_FILE_INIT_COMPLETE = 4
- MOVING_LOG_FILE = 7
- MOVING_OUTPUT_FILE = 8
- NO_ALGO_PARAM_SCHEMA_PATH = 29
- NO_RENAME_FUNCTION_FOR_EXTENSION = 2001
- OUTPUT_NOT_FOUND = 3006
- OVERALL_SUCCESS = 0
- PGE_NAME = 11
- PROCESSING_DETAILS = 1001
- PROCESSING_INPUT_FILE = 14
- QA_SAS_PROGRAM_COMPLETED = 23
- QA_SAS_PROGRAM_DISABLED = 24
- QA_SAS_PROGRAM_FAILED = 3015
- QA_SAS_PROGRAM_NOT_FOUND = 3014
- QA_SAS_PROGRAM_STARTING = 22
- RENDERING_ISO_METADATA = 25
- RUN_CONFIG_FILENAME = 10
- RUN_CONFIG_VALIDATION_FAILED = 3000
- SAS_CONFIG_CREATION_FAILED = 3002
- SAS_EXE_COMMAND_LINE = 1002
- SAS_OUTPUT_FILE_HAS_MISSING_DATA = 3020
- SAS_PROGRAM_COMPLETED = 21
- SAS_PROGRAM_FAILED = 3013
- SAS_PROGRAM_NOT_FOUND = 3012
- SAS_PROGRAM_STARTING = 20
- SAS_QA_COMMAND_LINE = 1003
- SCHEMA_FILE = 12
- SUMMARY_STATS_MESSAGE = 9
- UPDATING_PRODUCT_METADATA = 28
- USING_CONFIG_FILE = 15
- VALIDATING_RUN_CONFIG_FILE = 3
- opera.util.error_codes.INFO_RANGE_START = 0
Starting value for the Info code range
- opera.util.error_codes.WARNING_RANGE_START = 2000
Starting value for the Warning code range
opera.util.logger module
logger.py
Logging utilities for use with OPERA PGEs.
This module is adapted for OPERA from the NISAR PGE R2.0.0 util/logger.py Original Authors: Alice Stanboli, David White Adapted By: Scott Collins, Jim Hofman
- opera.util.logger.CRITICAL = 'Critical'
Constants for logging levels
- class opera.util.logger.PgeLogger(workflow=None, error_code_base=None, log_filename=None)[source]
Bases:
object
Class to help with the PGE logging.
Advantages over the standalone write() function: * Opens and closes the log file for you * The class’s write() function has fewer arguments that need to be provided.
- LOGGER_CODE_BASE = 900000
- QA_LOGGER_CODE_BASE = 800000
- append(source)[source]
Appends text from another file to this log file.
- Parameters:
source (str) – The source text to append. If the source refers a file name, the contents of the file will be appended. Otherwise, the provided text is appended as is.
- close_log_stream()[source]
Writes the log summary to the log stream Writes the log stream to a log file and saves the file to disk Closes the log stream
- critical(module, error_code_offset, description)[source]
Write a critical-level message to the log.
Since critical messages should be used for unrecoverable errors, any time this log level is invoked a RuntimeError is raised with the description provided to this function. The log file is closed and finalized before the exception is raised.
- Parameters:
module (str) – Name of the module where the logging took place.
error_code_offset (int) – Error code offset to add to this logger’s error code base value to determine the final error code associated with the log message.
description (str) – Description message to write to the log.
- Raises:
RuntimeError – Raised when this method is called. The contents of the description parameter is provided as the exception string.
- debug(module, error_code_offset, description)[source]
Write a debug-level message to the log.
- Parameters:
module (str) – Name of the module where the logging took place.
error_code_offset (int) – Error code offset to add to this logger’s error code base value to determine the final error code associated with the log message.
description (str) – Description message to write to the log.
- property error_code_base
Return the error code base from error_codes.py
- get_log_count_by_severity(severity)[source]
Gets the number of messages logged for the specified severity
- Parameters:
severity (str) – The severity level to get the log count of. Should be one of info, debug, warning, critical (case-insensitive).
- Returns:
log_count – The number of messages logged at the provided severity level.
- Return type:
int
- get_log_count_by_severity_dict()[source]
Returns a copy of the dictionary of log counts by severity.
- increment_log_count_by_severity(severity)[source]
Increments the logged message count of the provided severity level.
- Parameters:
severity (str) – The severity level to increment the log count of. Should be one of info, debug, warning, critical (case-insensitive).
- info(module, error_code_offset, description)[source]
Write an info-level message to the log.
- Parameters:
module (str) – Name of the module where the logging took place.
error_code_offset (int) – Error code offset to add to this logger’s error code base value to determine the final error code associated with the log message.
description (str) – Description message to write to the log.
- log(module, error_code_offset, description, additional_back_frames=0)[source]
Logs any kind of message.
Determines the log level (Critical, Warning, Info, or Debug) based on the provided error code offset.
- Parameters:
module (str) – Name of the module where the logging took place.
error_code_offset (int) – Error code offset to add to this logger’s error code base value to determine the final error code associated with the log message.
description (str) – Description message to write to the log.
additional_back_frames (int, optional) – Number of call-stack frames to “back up” to in order to determine the calling function and line number.
- log_one_metric(module, metric_name, metric_value, additional_back_frames=0)[source]
Writes one metric value to the log file.
- Parameters:
module (str) – Name of the module where the logging took place.
metric_name (str) – Name of the metric being logged.
metric_value (object) – Value to associate to the logged metric.
additional_back_frames (int) – Number of call-stack frames to “back up” to in order to determine the calling function and line number.
- move(new_filename)[source]
This function is useful when the log file has been given a default name, and needs to be assigned a name that meets the PGE file naming conventions.
- Parameters:
new_filename (str) – The new filename (including path) to assign to this log file.
- parse_line(line)[source]
Parses the provided formatted log line into its component parts according to the log formatting style for OPERA.
- Parameters:
line (str) – The log line to parse
- Returns:
parsed_line – The provided log line parsed into its component parts.
- Return type:
tuple
- Raises:
ValueError – If the line cannot be parsed according to the OPERA log formatting style.
- warning(module, error_code_offset, description)[source]
Write a warning-level message to the log.
- Parameters:
module (str) – Name of the module where the logging took place.
error_code_offset (int) – Error code offset to add to this logger’s error code base value to determine the final error code associated with the log message.
description (str) – Description message to write to the log.
- property workflow
Return specific workflow
- write(severity, module, error_code_offset, description, additional_back_frames=0)[source]
Write a message to the log.
- Parameters:
severity (str) – The severity level to log at. Should be one of info, debug, warning, critical (case-insensitive).
module (str) – Name of the module where the logging took place.
error_code_offset (int) – Error code offset to add to this logger’s error code base value to determine the final error code associated with the log message.
description (str) – Description message to write to the log.
additional_back_frames (int, optional) – Number of call-stack frames to “back up” to in order to determine the calling function and line number.
- opera.util.logger.default_log_file_name()[source]
Returns a path + filename that can be used for the log file right away.
To minimize the risk of errors opening a log file, the initial log filename does not rely on anything read from a run config file, SAS output file, etc. Therefore, this filename does not follow the file naming convention.
Later (elsewhere), after everything is known, the log file will be renamed.
- Returns:
file_path – Path to the default log file name.
- Return type:
str
- opera.util.logger.get_severity_from_error_code(error_code)[source]
Determines the log level (Critical, Warning, Info, or Debug) based on the provided error code.
- Parameters:
error_code (int or ErrorCode) – The error code to map to a severity level.
- Returns:
severity – The severity level associated to the provided error code.
- Return type:
str
- opera.util.logger.standardize_severity_string(severity)[source]
Returns the severity string in a consistent way.
- Parameters:
severity (str) – The severity string to standardize.
- Returns:
severity – The standardized severity string.
- Return type:
str
- opera.util.logger.write(log_stream, severity, workflow, module, error_code, error_location, description, time_tag=None)[source]
Low-level logging function. May be called directly in lieu of PgeLogger class.
- Parameters:
log_stream (io.StringIO) – The log stream to write to.
severity (str) – The severity level of the log message.
workflow (str) – Name of the workflow where the logging took place.
module (str) – Name of the module where the logging took place.
error_code (int or ErrorCode) – The error code associated with the logged message.
error_location (str) – File name and line number where the logging took place.
description (str) – Description of the logged event.
time_tag (str, optional) – ISO format time tag to associate to the message. If not provided, the current time is used.
opera.util.run_utils module
run_utils.py
Contains utility functions for running executable processes within the OPERA PGE subsystem.
- opera.util.run_utils.create_qa_command_line(qa_program_path, qa_program_options=None)[source]
Forms the appropriate command line for executing a SAS Quality Assurance (QA) application from parameters obtained from the RunConfig.
By default, this function assumes the QA program path corresponds to an executable file reaching within the current environment’s PATH. If this function cannot locate the executable, the QA program path is assumed to be a Python module path and treated accordingly.
- Parameters:
qa_program_path (str) – The path to the QA executable to be invoked by the returned command line.
qa_program_options (list[str], optional) – List of options to include in the returned command line.
- Returns:
command_line – The fully formed command line, returned in list format suitable for use with subprocess.run.
- Return type:
list[str]
- Raises:
OSError – If the QA executable exists within the current environment, but is not set with execute permissions for the current process.
- opera.util.run_utils.create_sas_command_line(sas_program_path, sas_runconfig_path, sas_program_options=None)[source]
Forms the appropriate command line for executing a SAS from the parameters obtained from the RunConfig.
By default, this function assumes the SAS program path corresponds to an executable file reaching within the current environment’s PATH. If this function cannot locate the executable, the SAS program path is assumed to be a Python module path and treated accordingly.
- Parameters:
sas_program_path (str) – The path to the SAS executable to be invoked by the returned command line.
sas_runconfig_path (str) – The path to the RunConfig to feed to the SAS executable in the returned command line.
sas_program_options (list[str], optional) – List of options to include in the returned command line.
- Returns:
command_line – The fully formed command line, returned in list format suitable for use with subprocess.run.
- Return type:
list[str]
- Raises:
OSError – If the SAS executable exists within the current environment, but is not set with execute permissions for the current process.
- opera.util.run_utils.get_checksum(file_name)[source]
Generate the MD5 checksum of the provided file.
This function was adapted from swot_pge.util.BasePgeWrapper.get_checksum()
- Parameters:
file_name (str) – Path the file on disk to generate the checksum for.
- Returns:
checksum – MD5 checksum of the provided file.
- Return type:
str
- opera.util.run_utils.get_extension(file_name)[source]
Returns the file extension (including the dot) of the provided file name.
- opera.util.run_utils.get_traceback_from_log(log_contents)[source]
Utilizes a regular expression to parse and return a traceback stack from provided log contents.
Notes
The regular expression used with this function was derived from the following Stack Exchange answer: https://stackoverflow.com/a/53658873
- Parameters:
log_contents (str) – The log contents to parse for a traceback stack.
- Returns:
traceback_match – The result of the regex search for a traceback. If none could be found, None will be returned.
- Return type:
re.Match
- opera.util.run_utils.time_and_execute(command_line, logger, execute_via_shell=False)[source]
Executes the provided command line via subprocess while collecting the runtime of the execution.
- Parameters:
command_line (Iterable[str]) – The command line program, including options/arguments, to execute. Each
logger (PgeLogger) – A logger object used to capture any error status returned from execution.
execute_via_shell (bool, optional) – If true, instruct subprocess.run to execute the command-line via system shell. Useful for running test commands but should generally not be used for production.
- Returns:
elapsed_time – The time elapsed during execution, in seconds.
- Return type:
float
opera.util.time module
time.py
Time-tag generation utilities for use with OPERA PGEs.
This module is adapted for OPERA from the NISAR PGE R2.0.0 util/time.py Original Author: Alice Stanboli Adapted By: Scott Collins
- opera.util.time.get_catalog_metadata_datetime_str(date_time)[source]
Converts the provided datetime object to a time-tag string suitable for use in catalog metadata.
- Parameters:
date_time (datetime.datetime) – Datetime object to convert to a catalog metadata time-tag string.
- Returns:
datetime_str – The provided time converted to ISO format, including nanosecond resolution.
- Return type:
str
- opera.util.time.get_current_iso_time()[source]
Returns current time in ISO format, including trailing “Z” to indicate Zulu (GMT) time.
- Returns:
time_in_iso – Current time in ISO format: YYYY-MM-DDTHH:MM:SS.mmmmmmZ
- Return type:
str
- opera.util.time.get_iso_time(date_time)[source]
Converts the provided datetime object to an ISO-format time-tag.
- Parameters:
date_time (datetime.datetime) – Convert to ISO format.
- Returns:
time_in_iso – Provided time in ISO format: YYYY-MM-DDTHH:MM:SS.mmmmmmZ
- Return type:
str
- opera.util.time.get_time_for_filename(date_time)[source]
Converts the provided datetime object to a time-tag string suitable for use with output filenames.
- Parameters:
date_time (datetime.datetime) – Datetime object to convert to a filename time-tag.
- Returns:
datetime_str – The provided time converted to YYYYMMDDTHHmmss format.
- Return type:
str
opera.util.usage_metrics module
usage_metrics.py
OS-level metrics gathering functions for use with OPERA PGEs.
This module is adapted for OPERA from the NISAR PGE R2.0.0 util/usage_metrics.py Original Author: David White Adapted By: Scott Collins
- opera.util.usage_metrics.get_os_metrics()[source]
Gets metrics related to machine resource usage, by the current process and by all of its children processes.
- Returns:
metrics –
- Dictionary containing metrics mapped to the following keys:
- os.cpu.seconds.sys -
System CPU time, in seconds, consumed by the current process and its children
- os.cpu.seconds.user -
User CPU time, in seconds, consumed by the current process and its children
- os.filesystem.reads -
Number of file system reads performed by the current process and its children
- os.filesystem.writes -
Number of file system writes performed by the current process and its children
- os.max_rss_kb.main_process -
Maximum resident set size (physical memory consumption), in kilobytes, of the current process
- os.max_rss_kb.largest_child_process -
Maximum resident set size (physical memory consumption), in kilobytes, of the largest child process
- os.peak_vm_kb.main_process -
Peak virtual memory usage, in kilobytes, of the current process.
- Return type:
dict
- opera.util.usage_metrics.get_self_peak_vmm_kb()[source]
Attempt to get the peak virtual memory by looking into the /proc/self/status
Note that this accounts for the peak virtual memory of just the current process, not the sum of the current process and all its children.
- Returns:
vm_peak_kb – Peak virtual memory usage, in kilobytes, of the current process. If this value cannot be obtained for any reason, -1 is returned instead.
- Return type:
int
Module contents
util
Contains utility modules for performing common operations, such as logging and metrics gathering, for use with the OPERA PGE Subsystem.