Skip to main content
U.S. flag

An official website of the United States government

GSPy: A new toolbox and data standard for Geophysical Datasets

August 22, 2022

The diversity of geophysical methods and datatypes, as well as the isolated nature of various specialties (e.g., electromagnetic, seismic, potential fields) leads to a profusion of separate data file formats and documentation conventions. This can hinder cooperation and reduce the impact of datasets researchers have invested in heavily to collect and prepare. An open, portable, and well-supported community data standard could greatly improve the interoperability, transferability, and long-term archival of geophysical data. Airborne geophysical methods particularly need an open and accessible data standard, and they exemplify the complexity that is common in geophysical datasets where critical auxiliary information on the survey and system parameters are required to fully utilize and understand the data. Here, we propose a new Geophysical Standard, termed the GS convention, that leverages the well-established and widely used NetCDF file format and builds on the Climate and Forecasts (CF) metadata convention. We also present an accompanying open-source Python package, GSPy, to provide methods and workflows for building the GS-standardized NetCDF files, importing and exporting between common data formats, preparing input files for geophysical inversion software, and visualizing data and inverted models. By using the NetCDF format, handled through the Xarray Python package, and following the CF conventions, we standardize how metadata is recorded and directly stored with the data, from general survey and system information down to specific variable attributes. Utilizing the hierarchical nature of NetCDF, GS-formatted files are organized with a root Survey group that contains global metadata about the geophysical survey. Data are then organized into subgroups beneath Survey and are categorized as Tabular or Raster depending on the geometry and point of origin for the data. Lastly, the standard ensures consistency in constructing and tracking coordinate reference systems, which is vital for accurate portability and analysis. Development and adoption of a NetCDF-based data standard for geophysical surveys can greatly improve how these complex datasets are shared and utilized, making the data more accessible to a broader science community. The architecture of GSPy can be easily transferred to additional geophysical datatypes and methods in future releases.

Publication Year 2022
Title GSPy: A new toolbox and data standard for Geophysical Datasets
DOI 10.3389/feart.2022.907614
Authors Stephanie R. James, Nathan Leon Foks, Burke J. Minsley
Publication Type Article
Publication Subtype Journal Article
Series Title Frontiers in Earth Science - Environmental Informatics and Remote Sensing
Index ID 70255669
Record Source USGS Publications Warehouse
USGS Organization Geology, Geophysics, and Geochemistry Science Center