fireworks.utilities package¶
Subpackages¶
- fireworks.utilities.tests package
- Submodules
- fireworks.utilities.tests.test_dagflow module
- fireworks.utilities.tests.test_filepad module
- fireworks.utilities.tests.test_fw_serializers module
ExplicitSerializationTest
ExplicitTestSerializer
SerializationTest
SerializationTest.setUp()
SerializationTest.tearDown()
SerializationTest.test_as_dict()
SerializationTest.test_complex_json()
SerializationTest.test_complex_yaml()
SerializationTest.test_implicit_serialization()
SerializationTest.test_json()
SerializationTest.test_numpy_array()
SerializationTest.test_sanity()
SerializationTest.test_serialize_fw_decorator()
SerializationTest.test_unicode_json()
SerializationTest.test_unicode_json_file()
SerializationTest.test_unicode_yaml()
SerializationTest.test_unicode_yaml_file()
SerializationTest.test_yaml()
- fireworks.utilities.tests.test_update_collection module
- fireworks.utilities.tests.test_visualize module
- Module contents
Submodules¶
fireworks.utilities.dagflow module¶
fireworks.utilities.dict_mods module¶
This module allows you to modify a dict (a spec) using another dict (an instruction). The main method of interest is apply_dictmod().
This code is based heavily on the Ansible class of custodian <https://pypi.python.org/pypi/custodian>, but simplifies it considerably for the limited use cases required by FireWorks.
- fireworks.utilities.dict_mods.apply_mod(modification, obj) None ¶
Note that modify makes actual in-place modifications. It does not return a copy.
- Parameters:
modification – Modification must be {action_keyword : settings}, where action_keyword is a supported DictMod
obj – A dict to be modified
- fireworks.utilities.dict_mods.arrow_to_dot(input_dict)¶
Converts arrows (‘->’) in dict keys to dots ‘.’ recursively. Allows for storing MongoDB neseted document queries in MongoDB.
- Parameters:
input_dict (dict)
- Returns:
dict
- fireworks.utilities.dict_mods.get_nested_dict(input_dict, key)¶
fireworks.utilities.filepad module¶
This module defines the FilePad class. FilePad is a convenient api to mongodb/gridfs to add/delete/update any file of any size.
- class fireworks.utilities.filepad.FilePad(host='localhost', port=27017, database='fireworks', username=None, password=None, authsource=None, uri_mode=False, mongoclient_kwargs=None, filepad_coll_name='filepad', gridfs_coll_name='filepad_gfs', logdir=None, strm_lvl=None, text_mode=False)¶
Bases:
MSONable
- __init__(host='localhost', port=27017, database='fireworks', username=None, password=None, authsource=None, uri_mode=False, mongoclient_kwargs=None, filepad_coll_name='filepad', gridfs_coll_name='filepad_gfs', logdir=None, strm_lvl=None, text_mode=False) None ¶
- Parameters:
host (str) – hostname
port (int) – port number
database (str) – database name
username (str)
password (str)
authsource (str) – authSource parameter for MongoDB authentication; defaults to “name” (i.e., db name) if not set
uri_mode (bool) – if set True, all Mongo connection parameters occur through a MongoDB URI string (set as the host).
filepad_coll_name (str) – filepad collection name
gridfs_coll_name (str) – gridfs collection name
logdir (str) – path to the log directory
strm_lvl (str) – the logger stream level
text_mode (bool) – whether to use text_mode for file read/write (instead of binary). Might be useful if working only with text files between Windows and Unix systems
- add_file(path, identifier=None, compress=True, metadata=None)¶
Insert the file specified by the path into gridfs. The gridfs id and identifier are returned. Note: identifier must be unique, i.e, no insertion if the identifier already exists in the db.
- Parameters:
- Returns:
the id returned by gridfs, identifier
- Return type:
- classmethod auto_load()¶
Returns FilePad object.
- count(filter=None, **kwargs)¶
Get the number of documents in filepad.
- delete_file(identifier) None ¶
Delete the document with the matching identifier. The contents in the gridfs as well as the associated document in the filepad are deleted.
- Parameters:
identifier (str) – the file identifier
- classmethod from_db_file(db_file, admin=True)¶
- Parameters:
db_file (str) – path to the filepad cred file.
- Returns:
FilePad object
- get_file(identifier)¶
Get file by identifier.
- get_file_by_id(gfs_id)¶
- get_file_by_query(query, sort_key=None, sort_direction=-1)¶
- update_file(identifier, path, compress=True)¶
Update the file contents in the gridfs, update the gfs_id in the document and retain the rest.
- update_file_by_id(gfs_id, path, compress=True)¶
Update the file in the gridfs with the given id and retain the rest of the document.
fireworks.utilities.fw_serializers module¶
This module aids in serializing and deserializing objects.
To serialize a FW object, refer to the documentation for the FWSerializable class. To de-serialize an object, refer to the documentation for the FWSerializable class and load_object() method.
you can de-serialize explicitly, e.g. FWObject.from_dict() to enforce a FWObject instance as the result
you can de-serialize implicitly, e.g. load_object() to search for the correct Class dynamically
The implicit method is especially useful if you don’t know in advance which subclass of a base class your serialization might point to. e.g. if you require some type of Quadrilateral, a serialization from your collaborator might point to a Square, Rhombus, or Rectangle, and you might not know which one in advance…
- Some advantages:
- Robust with regard to code refactorings even in implicit loading given certain reasonable
guidelines on fw_name.
- Simple to allow a superclass to define all the serializations for its subclasses, removing
code repetition(in particular, note that from_dict is a class method rather than a static method, allowing use of self)
Decorators aid in some of the routine parts of the serialization, such as adding the _fw_name key
Both JSON and YAML file import/export are naturally and concisely supported within the framework.
Auto-detect and proper loading of JSON and YAML files
Proper JSON handling of datetime (both encoding and decoding) and UTF-8 strings
- In some cases, objects can be serialized/deserialized extremely concisely, by use of only
their fw_name (if no parameters are needed to describe the object)
- fireworks.utilities.fw_serializers.DATETIME_HANDLER(obj)¶
- class fireworks.utilities.fw_serializers.FWSerializable¶
Bases:
ABC
To create a serializable object within FireWorks, you should subclass this class and implement the to_dict() and from_dict() methods.
If you want the load_object() implicit de-serialization to work, you must also:
Use the @serialize_fw decorator on to_dict()
Override the _fw_name parameter with a unique key.
See documentation of load_object() for more details.
The use of @classmethod for from_dict allows you to define a superclass that implements the to_dict() and from_dict() for all its subclasses.
For an example of serialization, see the class QueueAdapterBase.
- as_dict()¶
- classmethod from_file(filename, f_format=None)¶
Load a serialization of this object from a file.
- classmethod from_format(f_str, f_format='json')¶
convert from a String representation to its Object.
- property fw_name¶
- to_db_dict()¶
- fireworks.utilities.fw_serializers.get_default_serialization(cls)¶
- fireworks.utilities.fw_serializers.load_object(obj_dict)¶
Creates an instantiation of a class based on a dictionary representation. We implicitly determine the Class through introspection along with information in the dictionary.
We search for a class with the _fw_name property equal to obj_dict[‘_fw_name’] If the @module key is set, that module is checked first for a matching class to improve speed of lookup. Afterwards, the modules in the USER_PACKAGES global parameter are checked.
Refactoring class names, module names, etc. will not break object loading as long as:
the _fw_name property is maintained the same AND
the refactored module is kept within USER_PACKAGES
You can get around these limitations if you really want: i) If you want to change the fw_name of an object you can set the FW_NAME_UPDATES key ii) If you want to put a refactored module in a new place add an entry to USER_PACKAGES
- Parameters:
obj_dict (dict) – the dict representation of the class
- fireworks.utilities.fw_serializers.load_object_from_file(filename, f_format=None)¶
Implicitly load an object from a file. just a friendly wrapper to load_object().
- fireworks.utilities.fw_serializers.reconstitute_dates(obj_dict)¶
- fireworks.utilities.fw_serializers.recursive_deserialize(func)¶
a decorator to add FW serializations keys see documentation of FWSerializable for more details.
- fireworks.utilities.fw_serializers.recursive_dict(obj, preserve_unicode=True)¶
- fireworks.utilities.fw_serializers.recursive_serialize(func)¶
a decorator to add FW serializations keys see documentation of FWSerializable for more details.
- fireworks.utilities.fw_serializers.serialize_fw(func)¶
a decorator to add FW serializations keys see documentation of FWSerializable for more details.
fireworks.utilities.fw_utilities module¶
- class fireworks.utilities.fw_utilities.DataServer(address=None, authkey=None, serializer='pickle', ctx=None, *, shutdown_timeout=1.0)¶
Bases:
BaseManager
Provide a server that can host shared objects between multiprocessing Processes (that normally can’t share data). For example, a common LaunchPad is shared between processes and pinging launches is coordinated to limit DB hits.
- class fireworks.utilities.fw_utilities.NestedClassGetter¶
Bases:
object
Used to help pickle inner classes, e.g. see Workflow.Links When called with the containing class as the first argument, and the name of the nested class as the second argument, returns an instance of the nested class.
- fireworks.utilities.fw_utilities.create_datestamp_dir(root_dir, l_logger, prefix='block_')¶
Internal method to create a new block or launcher directory. The dir name is based on the time and the FW_BLOCK_FORMAT.
- Parameters:
root_dir – directory to create the new dir in
l_logger – the logger to use
prefix – the prefix for the new dir, default=”block_”
- fireworks.utilities.fw_utilities.explicit_serialize(o)¶
- fireworks.utilities.fw_utilities.get_fw_logger(name: str, l_dir: None = None, file_levels: tuple[str, str] = ('DEBUG', 'ERROR'), stream_level: str = 'DEBUG', formatter: ~logging.Formatter = <logging.Formatter object>, clear_logs: bool = False) Logger ¶
Convenience method to return a logger.
- Parameters:
name – name of the logger that sets the groups, e.g. ‘group1.set2’
l_dir – the directory to put the log file
file_levels – iterable describing level(s) to log to file(s). default: (‘DEBUG’, ‘ERROR’)
stream_level – level to log to standard output. default: ‘DEBUG’
formatter – logging format. default: FW_LOGGING_FORMATTER
clear_logs – whether to clear the logger with the same name
- fireworks.utilities.fw_utilities.get_my_host()¶
- fireworks.utilities.fw_utilities.get_my_ip()¶
- fireworks.utilities.fw_utilities.get_slug(m_str)¶
- fireworks.utilities.fw_utilities.log_exception(m_logger, msgs)¶
A shortcut wrapper around log_fancy for exceptions.
- Parameters:
m_logger (logger) – The logger object
msgs ([str]) – String or iterable of Strings, will be joined by newlines
- fireworks.utilities.fw_utilities.log_fancy(m_logger, msgs, log_lvl='info', add_traceback=False) None ¶
A wrapper around the logger messages useful for multi-line logs. Helps to group log messages by adding a fancy border around it, which enhances readability of log lines meant to be read as a unit.
fireworks.utilities.update_collection module¶
- fireworks.utilities.update_collection.update_launchpad_data(lp, replacements, **kwargs) None ¶
If you want to update a text string in your entire FireWorks database with a replacement, use this method. For example, you might want to update a directory name preamble like “/scratch/user1” to “/project/user2”. The algorithm does a text replacement over the entire BSON document. The original collection is backed up within the database with extension “_xiv_{Date}”.
- Parameters:
(LaunchPad) (lp) – a FireWorks LaunchPad object
(dict) (replacements) – e.g. {“old_path1”: “new_path1”, “scratch/”:”project/”}
kwargs – Additional arguments accepted by the update_path_in_collection method
- fireworks.utilities.update_collection.update_path_in_collection(db, collection_name, replacements, query=None, dry_run=False, force_clear=False) None ¶
updates the text specified in replacements for the documents in a MongoDB collection. This can be used to mass-update an outdated value (e.g., a directory path or tag) in that collection. The algorithm does a text replacement over the entire BSON document. The original collection is backed up within the database with extension “_xiv_{Date}”.
- Parameters:
db (Database) – MongoDB db object
collection_name (str) – name of a MongoDB collection to update
replacements (dict) – e.g. {“old_path1”: “new_path1”, “scratch/”:”project/”}
query (dict) – criteria for query, default None if you want all documents to be updated
dry_run (bool) – if True, only a new collection with new paths is created and original “collection” is not replaced
force_clear (bool) – careful! If True, the collection f”{collection}_temp_refactor” is removed!
- Returns:
None, but if dry_run==False it replaces the collection with the updated one
fireworks.utilities.visualize module¶
- fireworks.utilities.visualize.plot_wf(wf, depth_factor=1.0, breadth_factor=2.0, labels_on=True, numerical_label=False, text_loc_factor=1.0, save_as=None, style='rD--', markersize=10, markerfacecolor='blue', fontsize=12) None ¶
Generate a visual representation of the workflow. Useful for checking whether the firework connections are in order before launching the workflow.
- Parameters:
wf (Workflow) – workflow object.
depth_factor (float) – adjust this to stretch the plot in y direction.
breadth_factor (float) – adjust this to stretch the plot in x direction.
labels_on (bool) – whether to label the nodes or not. The default is to label the nodes using the firework names.
numerical_label (bool) – set this to label the nodes using the firework ids.
text_loc_factor (float) – adjust the label location.
save_as (str) – save the figure to the given name.
style (str) – marker style.
markersize (int) – marker size.
markerfacecolor (str) – marker face color.
fontsize (int) – font size for the node label.
- fireworks.utilities.visualize.wf_to_graph(wf: Workflow, dag_kwargs: dict[str, Any] | None = None, wf_show_tasks: bool = True) None ¶
Renders a graph representation of a workflow or firework. Workflows are rendered as the control flow of the firework, while Fireworks are rendered as a sequence of Firetasks.
Copied from https://git.io/JO6L8.
- Parameters:
- Returns:
The Workflow or Firework directed acyclic graph.
- Return type:
Digraph