Dataset

The dataset configuration section is the core of data input and data configuration. Here we define where all observed data is stored, define response functions paths for waveforms, blacklist stations or load phase arrival picks. Finally the Dataset object is managing all available data.

A basic dataset section of a Grond configuration file (gronf).
# -----------------------------------------------------------------------------
# Configuration section for dataset (input data)
#
# The placeholder '${event_name}' will be expanded to the current event. This
# enables us to use the same configuration for multiple events. The available 
# events are detected by looking into possible expansions of
# dataset_config.events_path
# -----------------------------------------------------------------------------

dataset_config: !grond.DatasetConfig

  # Optional prefix for all paths in this config file
  path_prefix: .

  # File with hypocenter information and possibly reference solution
  events_path: 'data/events/${event_name}/event.txt'

  # List of files with station coordinates.
  stations_stationxml_paths: 
  - 'data/events/${event_name}/waveforms/stations.geofon.xml'
  - 'data/events/${event_name}/waveforms/stations.iris.xml'

  # List of files with instrument response information (can be the same as in
  # stations_stationxml_paths above)
  responses_stationxml_paths:
  - 'data/events/${event_name}/waveforms/stations.geofon.xml'
  - 'data/events/${event_name}/waveforms/stations.iris.xml'

  # List of directories with raw waveform data
  waveform_paths: ['data/events/${event_name}/waveforms/raw']

  # List of stations/components to be excluded according to their STA, NET.STA,
  # NET.STA.LOC, or NET.STA.LOC.CHA codes
  blacklist: ['GE.UGM', 'GE.PLAI']

  # List of files with additional exclusion lists (one entry per line, same 
  # format as above)
  blacklist_paths:
  - 'data/events/${event_name}/waveforms/blacklist.txt'

General configuration and templating

All folder and file paths in the dataset support templating and prefixing:

events_path
File with hypocenter information and possibly reference solution.
path_prefix
defines a prefix which is prepended to all paths in the configuration.
${event_name}
will be substituted with the event name defined in your events_path file.

Waveform data

Usually raw, unrestituted waveforms are loaded into Grond, together with StationXML data describing the station location and response function - Grond will take care of proper restitution.

waveform_paths
List of directories with raw waveform data.
stations_stationxml_paths
List of files with station coordinates in StationXML format. Inventory datasets have to be converted to StationXML. Use ObsPy’s Inventory for conversion: inventory.write('out-stationxml.xml', format='stationxml').
stations_path
List of files with station coordinates in Pyrocko format.
extend_incomplete
Extend incomplete seismic traces: true/false.
clippings_path
Pyrocko marker file indicating where a seismic trace is masked.
responses_stationxml_paths
List of StationXML response files for restitution of the raw waveform data.
responses_sacpz_path
Path to SACPZ response files for restitution of the raw waveform data.
station_corrections_path
File containing station correction informations. See example station corrections.
apply_correction_factors
Apply the correction factors from station corrections: true/false.
apply_correction_delays
Apply the correction delays from station corrections: true/false.
picks_paths
List of phase picks in Pyrocko format.
blacklist
List of stations/components to be excluded according to their STA, NET.STA, NET.STA.LOC, or NET.STA.LOC.CHA codes
blacklist_paths
List of text files with blacklisted stations in NSLC pattern.
whitelist

List of stations/components to be included according to their STA, NET.STA, NET.STA.LOC, or NET.STA.LOC.CHA codes

Note: when whitelisting on channel level, both, the raw and the processed channel codes have to be listed.

whitelist_paths
List of text files with whitelisted stations in NSLC pattern.
synthetic_test
Run a synthetic test: true/false

Satellite data

Unwrapped static surface displacements have to be prepared in Kite format.

kite_scene_paths
List of folders where pre-processed Kite surface displacement scenes are stored.

GNSS campaign data

Single measurements of surface displacement data from GNSS campaigns can be loaded from YAML text files.

gnss_campaign_paths
List of folders where GNSS data of static surface displacements are stored.