Dataset¶
The dataset configuration section is the core of data input and data configuration. Here we define where all observed data is stored, define response functions paths for waveforms, blacklist stations or load phase arrival picks. Finally the Dataset
object is managing all available data.
# -----------------------------------------------------------------------------
# Configuration section for dataset (input data)
#
# The placeholder '${event_name}' will be expanded to the current event. This
# enables us to use the same configuration for multiple events. The available
# events are detected by looking into possible expansions of
# dataset_config.events_path
# -----------------------------------------------------------------------------
dataset_config: !grond.DatasetConfig
# Optional prefix for all paths in this config file
path_prefix: .
# File with hypocenter information and possibly reference solution
events_path: 'data/events/${event_name}/event.txt'
# List of files with station coordinates.
stations_stationxml_paths:
- 'data/events/${event_name}/waveforms/stations.geofon.xml'
- 'data/events/${event_name}/waveforms/stations.iris.xml'
# List of files with instrument response information (can be the same as in
# stations_stationxml_paths above)
responses_stationxml_paths:
- 'data/events/${event_name}/waveforms/stations.geofon.xml'
- 'data/events/${event_name}/waveforms/stations.iris.xml'
# List of directories with raw waveform data
waveform_paths: ['data/events/${event_name}/waveforms/raw']
# List of stations/components to be excluded according to their STA, NET.STA,
# NET.STA.LOC, or NET.STA.LOC.CHA codes
blacklist: ['GE.UGM', 'GE.PLAI']
# List of files with additional exclusion lists (one entry per line, same
# format as above)
blacklist_paths:
- 'data/events/${event_name}/waveforms/blacklist.txt'
General configuration and templating¶
All folder and file paths in the dataset support templating and prefixing:
events_path
- File with hypocenter information and possibly reference solution.
path_prefix
- defines a prefix which is prepended to all paths in the configuration.
${event_name}
- will be substituted with the event name defined in your
events_path
file.
Waveform data¶
Usually raw, unrestituted waveforms are loaded into Grond, together with StationXML data describing the station location and response function - Grond will take care of proper restitution.
waveform_paths
- List of directories with raw waveform data.
stations_stationxml_paths
- List of files with station coordinates in StationXML format.
Inventory datasets have to be converted to StationXML. Use ObsPy’s
Inventory
for conversion:inventory.write('out-stationxml.xml', format='stationxml')
. stations_path
- List of files with station coordinates in Pyrocko format.
extend_incomplete
- Extend incomplete seismic traces:
true
/false
. clippings_path
- Pyrocko marker file indicating where a seismic trace is masked.
responses_stationxml_paths
- List of StationXML response files for restitution of the raw waveform data.
responses_sacpz_path
- Path to SACPZ response files for restitution of the raw waveform data.
station_corrections_path
- File containing station correction informations. See
example station corrections
. apply_correction_factors
- Apply the correction factors from station corrections:
true
/false
. apply_correction_delays
- Apply the correction delays from station corrections:
true
/false
. picks_paths
- List of phase picks in Pyrocko format.
blacklist
- List of stations/components to be excluded according to their STA, NET.STA, NET.STA.LOC, or NET.STA.LOC.CHA codes
blacklist_paths
- List of text files with blacklisted stations in NSLC pattern.
whitelist
List of stations/components to be included according to their
STA
,NET.STA
,NET.STA.LOC
, orNET.STA.LOC.CHA
codesNote: when whitelisting on channel level, both, the raw and the processed channel codes have to be listed.
whitelist_paths
- List of text files with whitelisted stations in NSLC pattern.
synthetic_test
- Run a synthetic test:
true
/false