pyrocko.squirrel.selection¶
Meta-data caching for flexible file selections.
Classes
|
Database backed file selection (base class for |
- class Selection(database, persistent=None)[source]¶
Bases:
object
Database backed file selection (base class for
Squirrel
).- Parameters:
A selection in this context represents the list of files available to the application. Instead of using
Selection
directly, user applications should usually use its subclassSquirrel
which adds content indices to the selection and provides high level data querying.By default, a temporary table in the database is created to hold the names of the files in the selection. This table is only visible inside the application which created it. If a name is given to
persistent
, a named selection is created, which is visible also in other applications using the same database.Besides the filename references, desired content kind masks and file format indications are stored in the selection’s database table to make the user choice regarding these options persistent on a per-file basis. Book-keeping on whether files are unknown, known or if modification checks are forced is handled in the selection’s file-state table.
Paths of files can be added to the selection using the
add()
method and removed withremove()
.undig_grouped()
can be used to iterate over all content known to the selection.- is_new()[source]¶
Is this a new selection?
Always
True
for non-persistent selections. OnlyFalse
for a persistent selection which already existed in the database when the it was initialized.
- add(paths, kind_mask=255, format='detect', show_progress=True)[source]¶
Add files to the selection.
- Parameters:
paths (iterator yielding
str
objects) – Paths to files to be added to the selection.kind_mask (
int
(bit mask)) – Content kinds to be added to the selection.format (str) – File format identifier or
'detect'
to enable auto-detection (available:'datacube'
,'mseed'
,'pyrocko_events'
,'pyrocko_stations'
,'sac'
,'spickle'
,'stationxml'
,'tdms_idas'
,'virtual'
,'yaml'
).
- iter_paths(raw=False)[source]¶
Iterate over all file paths currently belonging to the selection.
- Parameters:
raw (bool) – By default absolute paths are yielded. Set to
True
to yield the path as it is stored in the database, which can be relative or absolute, depending on whether the file is within a Squirrel environment or outside.- Yields:
File paths.
- get_paths(raw=False)[source]¶
Get all file paths currently belonging to the selection.
- Parameters:
raw (bool) – By default absolute paths are returned. Set to
True
to return the path as it is stored in the database, which can be relative or absolute, depending on whether the file is within a Squirrel environment or outside.- Returns:
List of file paths.
- undig_grouped(skip_unchanged=False)[source]¶
Get inventory of cached content for all files in the selection.
- Parameters:
skip_unchanged (bool) – If
True
only inventory of modified files is yielded (flag_modified()
must be called beforehand).
This generator yields tuples
((format, path), nuts)
wherepath
is the path to the file,format
is the format assignation or'detect'
andnuts
is a list ofNut
objects representing the contents of the file.
- flag_modified(check=True)[source]¶
Mark files which have been modified.
- Parameters:
check (bool) – If
True
query modification times of known files on disk. IfFalse
, only flag unknown files.
Assumes file state is 0 for newly added files, 1 for files added again to the selection (forces check), or 2 for all others (no checking is done for those).
Sets file state to 0 for unknown or modified files, 2 for known and not modified files.