DirectorySpool

DirectorySpool(
    base_path = .,
    index_path = None,
    preferred_format = None,
    select_kwargs = None,
    merge_kwargs = None,
)

A spool for interacting with DAS files on disk.

FileSpool creates and index of all files then allows for simple querying and bulk processing of the files.

Parameters

Parameter	Description
base_path	The path to the directory to index.
index_path	The path to the index file containing the contents of the directory. By default it will be created in the top-level of the data directory.
preferred_format	A string to specify the format of the data. Specifying this parameter will save time in indexing.
select_kwargs	Dict of keyword arguments to restrict output contents.

Name	Description
chunk	Chunk the data in the spool along specified dimension.
concatenate	Concatenate the patches together.
get_contents	Get a dataframe of the spool contents.
get_patch_names	Generates the default name of patch data.
map	Map a function of all the contents of the spool.
new_from_df	Create a new instance from dataframes.
select	Sub-select parts of the spool.
sort	Sort the Spool based on a specific attribute.
split	Yield sub-patches based on specified parameters.
stack_patches	Stack (add) all patches compatible with first patch together.
update	Updates the contents of the spool, return the updated spool.