2023 CDI Workshop: Open Data for Open Science
The 2023 CDI Workshop was held from May 2-5th, 2023 at the National Conservation Training Center in Shepherdstown, West Virginia. The theme of the workshop was "Open Data for Open Science"
The 2023 Workshop had 394 registrants, 33 breakout sessions, 3 plenaries, 39 posters or demos, and 11 lightning talks. The workshop welcomed both virtual and in-person attendees, with the physical location being the National Conservation Training Center in Shepherdstown, WV.
The online agenda for the workshop is available on Sched.
Breakout Sessions
Title | Session Lead(s) |
Open Data for the Year of Open Science | Viv Hutchison |
ArcGIS Online: Getting Started with Open Data and Initiatives in the ArcGIS Hub | Kristen Hocutt |
Introduction to Posit Team (formerly RStudio Team) | Kaycee Faunce, Jayaram Hariharan |
Develop and Advance a USGS POSSE (Participatory Open Science to Support Engagement) | Sophia B Liu, Nicole Herman-Mercer, Aparna Bamzai, Amanda Cravens |
Highlights from the New USGS Advanced Scientific Computing Strategy & Roadmap | Janice Gordon |
Effective Communication for Open Data | Amanda (Mandie) Carr |
Bringing legacy data into the modern era – with a focus on compliance and best practices | Joe Bard, Marc Hunter |
Where do my data go? To statistics and beyond! | Jacob Malcom, Nicole Athearn |
Fitting square pegs into round holes: challenges in interdisciplinary science information management design | Jess Driscoll |
Earth Science in the AI/ML Age: Leveraging Open Data and Science for Discovery | Neal Pastick, Pete Doucette, Alison Appling |
Shared Software Development Resources | Carl Schroedl |
Moving the Bar on USGS Data Management Planning | Madison Langseth |
Using the Cloud to Navigate the Science Data Lifecycle | Dionne Zoanni, Eric Larson |
Creating national products out of regional studies - processing on a larger scale | Jeanne Jones |
Using the Microsoft Power Platform to enhance information sharing across USGS and potentially the public | Rebecca Bushon, Kenna Butler |
USGS Metadata Transition: mdToolkit and Workflow Innovations | Dennis Walworth |
Imagery as data – A showcase of how imagery can be used in increasing earth process understanding through both data delivery and analysis | Frank Engel, Cian Dawson |
How do we move from data portals towards open science and decision support infrastructure? | Megan Hines, Dionne Zoanni |
Continuous Science Deployment in the Cloud – Using emerging cloud technologies to create living web applications | Kevin Henry |
Making the Leap: Adopting High Performance Computing for Your Science | Kyle Moran |
How to use vocabularies to make your data interoperable | Fran Lightsom |
Overview and updates to the Research Computing data storage infrastructure. | Jeff Falgout, Drew Ignizio |
USGS Basic ISO Metadata Content Standards | Dennis Walworth |
Reproducible Data Pipelines in R: what are they, how to use them, and a hands on example using dataRetrieval and targets | Julie Padilla |
High throughput data transformation on the cloud | Jeremy Newson |
Open Science Napkin Drawings | Madison Langseth, Leslie Hsu |
Posters and Demos
Title | Presenter(s) |
Metadata development for the USGS National Water Census (and beyond!) | Megan Hines |
PyHAT Demo | Itiya Aneece |
Informing the use of native plant materials in restoration and rehabilitation with the Native Plant Seed Mapping Toolkit | Catilin Andrews, Rob Massatti |
An open-source process for maintaining a catalog | Leslie Hsu, Brandon Serna |
PyHAT: An Open-Source Machine Learning tool for Earth and Planetary Spectral Datasets | Itiya Aneece |
Increasing data accessibility and cross-community use through the GHISA visualization app | Itiya Aneece |
The United States Geological Survey Science Data Catalog | Chirag Shah, Lisa Zolly |
State of the Data: how FAIR are USGS data? | Tamar Norkin |
ZenRiver Game Concept: Accelerating Creation of Machine Learning Imagery Training Datasets Using Citizen Science | Frank Engle, Joe Federer |
Generalization quality assessment tools for geospatial data | Ethan Shavers |
"ChesBay 24k": A Framework for Summarizing Landscape Data in the Chesapeake Bay Watershed and Beyond | Benjamin Gressler |
Evaluation and recommendation of practices for publication of reproducible data and software releases in the USGS | Julie Padilla |
Standardizing and Sharing Data for Chesapeake Studies | Stephanie Gordon |
Datablasted: Can we refine publication pathways of imagery? | Jin-Si Over |
Integrating continuous and categorical data to improve streamflow prediction | Konrad Hafen |
Automating Sensor-to-Cloud Workflows for Riparian Resources in Grand Canyon | Joseph Thomas |
Connecting with our Stakeholders - Developing a Better Understanding of Use and Usability for Science Products | Dawn Kotowicz, Kevin Henry |
USGS Alaska Science Center Data Mangement Team: Promoting Science through Data | Laura McDuffie |
A Tool for Rapid-Repeat High-Resolution Coastal Vegetation Maps to Improve Forecasting of Hurricane Impacts and Coastal Resilience | Kristin Byrd |
Extracting data from maps: Applying lessons learned from the AI for Critical Mineral Assessment Competition | Margaret Goldman |
NPSdataverse: Metadata authoring and data publication & access at NPS | Rob Baker |
Managing Data Packages in the NPS DataStore Repository | Kristin Vanderbilt |
Linking Orphaned Oil & Gas Wells with Groundwater Quality | Nick Gianoutsos |
Communicating stream fish vulnerability to climate change | Taylor Woods |
USGS Hydrography Markup Tools -- Origins and Future Directions | Marcelle Caturia |
Hosting Published USGS Datasets - A Glance at ScienceBase | Drew Ignizio |
USGS Enterprise Information Management and Technology Resources for Scientists | Jennifer Erxleben, Jessica Driscoll |
Are U.S. Geological Survey researchers citing their data? | Madison Langseth, Grace Donovan |
Evolution of the North Pacific Pelagic Seabird Database | Marla Hood |
Geonarratives & Small Application Hosting at USGS | Milan Liu |
USGS Data Strategy: An Opportunity to Engage | Viv Hutchison |
Everglades Tree Island Indicator: Lessons Learned for Increasing Inclusivity | Saira Haider |
Leveraging Existing USGS Streamgage Data to Map Flood-Prone Areas | Labeeb Ahmed |
USGS Science Data Lifecycle in the Cloud | Dionne Zoannie, Eric Larson, Courtney Neu |
Welcome to our community of Earth science data & technology enthusiasts! | Annnie Burgess, Megan Carter |
Towards open geophysics: The Geophysical Survey (GS) Data Standard and GSPy Toolbox | Burke Minsley |
Community Engagement through Virtual Workshops: Lessons Learned from the Imagery Data Collaboration Area | Cian Dawson |
Augmented Reality High Water Mark | Daniel Beckman |
Cloud-hosted Web Applications | Peter Ng |
Lightning Talks
-
Annie Burgess: Welcome to our community of Earth science data & technology enthusiasts!
-
Birgit Peterson: Seeing the Forest in the Clouds: Developing application-ready 3D products from lidar and other remotely sensed data
-
Burke Minsley: Towards open geophysics: The Geophysical Survey (GS) data standard and GSPy toolbox
-
Cassandra Ladino: The Science Behind Business at USGS
-
Judd Patterson: NPS Data Packages
-
Sarah Wright: R packages for long-term monitoring data
-
Margaret Goldman: Using AI competitions to advance automation at the USGS
-
Nick Sievert: CreelCat: A database and application for inland creel and angler survey data=
-
Sophia B Liu: Participatory Open Science at USGS: Foundational Overview Towards a Strategic Vision
-
Viv Hutchison: The USGS Data Strategy: You are invited to engage in its implementation!
-
Lindsay Powers: The Geological Materials Repository in oratory
Go back to CDI 2023 Activities.