Data
Browse data and tools on the USGS website that have a connection to the Community for Data Integration.
Filter Total Items: 20
National-scale dataset of strontium isotope compositions in environmental materials for the United States
This database contains published strontium isotopic ratios (87Sr/86Sr) and associated sample collection data for strontium dissolved in environmental waters for the United States (conterminous, Hawaii, Alaska, and coastal seawater). Samples included in this version were collected and analyzed between 1963 to 2022.
Urban tree cover provides consistent mitigation of extreme heat in arid but not humid cities - data release
Urban land cover types influence the urban microclimates. However, recent work indicates the magnitude of land cover’s microclimate influence is affected by aridity. Moreover, this variation in cooling and warming potentials of urban land cover types can substantially alter the exposure of urban areas to extreme heat. Our goal is to understand both the relative influences of urban land...
Measurements of Water Quality Constituents in Groundwater Within 1 Mile (1.61 km) of Orphaned Wells in the United States
This is a combined dataset from the USGS Orphaned Well Dataset (Grove and Merrill, 2022) and publicly available data from the USGS National Water Information System, NWIS, obtained via the Water Quality Portal using the USGS Python dataretrieval library. This dataset is composed of water quality measurements from groundwater sites located within 1 mile of the locations of unplugged...
Questions and responses to USGS-wide poll on quality assurance practices for timeseries data, 2021
This data record contains questions and responses to a USGS-wide survey conducted to identify issues and needs associated with quality assurance and quality control (QA/QC) of USGS timeseries data streams. This research was funded by the USGS Community for Data Integration as part of a project titled “From reactive- to condition-based maintenance: Artificial intelligence for anomaly...
Machine learning with satellite imagery to document the historical transition from topographic to dense sub-surface agricultural drainage networks (tile drains)
Image library of (1) tile-drained landscapes and (2) tile-drain types that will be used for a machine-learning model workflow that identifies (1) tile-drained landscapes and (2) differentiates two types of tile-drained areas visible in satellite imagery. These images were sourced from WorldView and Quickbird satellite imagery (copyright DigitalGlobe) and cropped to features of interest...
USGS Geochron: A Database of Geochronological and Thermochronological Dates and Data (ver. 3.0, May 2024)
USGS Geochron is a database of geochronological and thermochronological dates and data. The data set contains published ages, dates, analytical information, sample metadata including location, and source citations. The following analytical techniques are represented in the data set: 40Ar/39Ar, K-Ar, U-Th-Pb, Sm-Nd, Rb-Sr, Lu-Hf, fission track, and luminescence. This data set incorporates...
Brook trout imagery data for individual recognition with deep learning
This Data Release provides imagery data for the development of deep-learning models to recognize individual brook trout (n=435). Images were collected at the Paint Bank State Fish Hatchery (Paint Bank, VA) on August 9, 2021 using a GoPro Hero 9 camera mounted approximately 50 cm above a fish board. The Paint Bank State Fish Hatchery is operated by the Virginia Department of Wildlife...
Landslide Inventories across the United States (ver. 2.0, June 2022)
Landslides are damaging and deadly, and they occur in every U.S. state. However, our current ability to understand landslide hazards at the national scale is limited, in part because spatial data on landslide occurrence across the U.S. varies greatly in quality, accessibility, and extent. Landslide inventories are typically collected and maintained by different agencies and institutions...
Coast Train--Labeled imagery for training and evaluation of data-driven models for image segmentation
Coast Train is a library of images of coastal environments, annotations, and corresponding thematic label masks (or 'label images') collated for the purposes of training and evaluating machine learning (ML), deep learning, and other models for image segmentation. It includes image sets from both geospatial satellite, aerial, and UAV imagery and orthomosaics, as well as non-geospatial...
Metadata standards for Magnetotelluric Time Series Data
Magnetotellurics (MT) is an electromagnetic geophysical method that is sensitive to variations in subsurface electrical resistivity. Measurements of natural electric and magnetic fields are done in the time domain, where instruments can record for a couple of hours up to mulitple months resulting in data sets on the order of gigabytes. The principles of findability, accessibility...
Annotated fish imagery data for individual and species recognition with deep learning
We provide annotated fish imagery data for use in deep learning models (e.g., convolutional neural networks) for individual and species recognition. For individual recognition models, the dataset consists of annotated .json files of individual brook trout imagery collected at the Eastern Ecological Science Center's Experimental Stream Laboratory. For species recognition models, the...
Flow-Conditioned Parameter Grids for the Contiguous United States: A Pilot, Seamless Basin Characteristic Dataset
Abstract To aid in parameterization of mechanistic, statistical, and machine learning models of hydrologic systems in the contiguous United States (CONUS), flow-conditioned parameter grids (FCPGs) have been generated describing upstream basin mean elevation, slope, land cover class, latitude, and 30-year climatologies of mean total annual precipitation, minimum daily air temperature, and...