Skip to main content
U.S. flag

An official website of the United States government

2021 CDI Workshop: Designing Data-Intensive Science

The 2021 Community for Data Integration Workshop was held May 25-29, 2021 in a virtual format. The theme of the workshop was "Designing Data-Intensive Science."

The 2021 Workshop had 483 registrants, 24 breakout sessions, 3 plenaries, 32 posters or demos, and 14 lightning talks. The completely virtual format allowed a larger number of participants than previous in-person CDI workshops.

The online agenda for the workshop is available on Sched.

Breakout Sessions 

  • A fun, fast hands-on introduction to the user-centered design process, Joe Bard, Sophie Hou, Rachel Volentine 
  • Advanced Scientific Computing in the USGS, Janice Gordon, Courtney Neu 
  • Assessing the Value and Usage of USGS Data Management Plans, Grace Donovan, Amanda Liford, Madison Langseth, Elizabeth Sellers 
  • Cleaning data with OpenRefine, Ricardo McClees-Funinan 
  • Cloud Optimized File Formats: What's new in ScienceBase and Strategies for Data Managers, Drew Ignizio, Rich Signell 
  • Data-Intensive Science in Action at the USGS, Rich Signell, Jeanne Jones, Peter Ng, Kevin Henry, Jamie Jones, Chris Skinner, Jason Kreitler, Ben Sleeter 
  • Effective Communication, Juliana Casavan, Claire Stirm 
  • Globus 101: Introduction and Basics to Moving and Sharing Data with Globus, Jeff Falgout, Janice Gordon, Vas Vasiliadis 
  • How to talk to your data manager/scientist – Breaking the ice, Madison Langseth, Jason Ferrante, Tara Bell, Matt Cannister, Kris Jaeger, Sue Kemp 
  • Integrated modeling at the USGS - what do we need?, Leslie Hsu, Christie Hegermiller, Brandon Serna, Catherine Jarnevich, Anne Wein 
  • Making Connections through Data Integration, Shayne Urbanowski, Amber Kremer, Madison Fung, Rich Signell, Becca Scully, Genevieve Barron, Tatyana DiMascio 
  • Market & Audience Assessment, Juliana Casavan and Claire Stirm 
  • Online Imagery Data Storage and Release: Current State of the Science and Future Directions, Seth Ackerman, Joe Adams, Sandy Brosnahan, Evan Dailey, Cian Dawson, Frank Engel, Dennis Walworth, Ben Letcher, Chris Gazoorian, Jon Warrick, Anthony Fischbach 
  • R Workshop: accessing the USGS National Map and making 3D maps with terrainR, Mike Mahoney, Colleen Nell, Lindsay Platt 
  • Reclamation Use of Data Management and Science for Water Resource Applications, Allison Odell, James Nagode, Kenneth Richard, Ken Nowak, Katie Holman, Lindsay Bearup, Drew Loney.  
  • Records Management: Winds of Change, Matt Arsenault, Chris Bartlett, Ed Olexa, Larry Reedy 
  • Semantic Web 101, Fran Lightsom, Brandon Whitehead, Scott Peckham, Ken Bagstad 
  • The Cloud in Action – How Centers are using Cloud Hosting Solutions for Data-intensive Workflows & Running Scientific Models, Kirsty Haynie, Mike Hearne, Eric Larson, Heather Schovanec, Dionne Zoanni, Cory Overton, Jeremy Fee, Tony Butzer and Stefanie Kagone, Rich Signell, Tarandeep Kalra, Sam Congdon, Courtney Neu 
  • The fundamentals of design for scientific data visualization, Ellen Bechtel, Sophie Hou, Ben Letcher, Colleen Nell, Amy Puls, Katherine Trickey, Dionne Zoanni 
  • Updates to the Data Storage and Transport Ecosystem for Research Computing, Jeff Falgout, Janice Gordon 
  • USGS Cloud Hosting Solutions - Advancing 21st Century Science, Jennifer Erxleben, Eric Larson, Dionne Zoanni, Courtney Neu, Robert Shepherd 
  • USGS Roadmap to enable FAIR Principles, Wade Bishop, Viv Hutchison, Fran Lightsom, Dave Govoni, Linda Debrewer 
  • USGS Shared Software Resources, Carl Schroedl 
  • Using AI/ML to advance USGS Science, Pete Doucette, Eric Larson, JC Nelson, Matt Kuckuk, Jeff Tracey, Freddie Kalaitzis 

Posters and Demos

  • Modernizing sensor data workflows to leverage Internet of Things (IoT) and cloud-based technologies, Caitlin Andrews
  • Retrieving data. Wait a few seconds and try to cut or copy again, Itiya Aneece
  • Semantics and machine reasoning enable FAIR, web-based data and model integration, Kenneth Bagstad
  • From reactive- to condition-based maintenance: artificial intelligence for anomaly prediction in time-series data and operational decision-making, Matthew Cashman
  • USGS Markup Application: Supporting User-Driven Improvements to Hydrography Data​, Marcelle Caturia
  • Colorado River Basin EarthMAP Implementation, Katharine Dahm
  • Transforming Data Representations to Improve Computational Performance, David Donato
  • Making USGS/NOAA Total Water Level and Coastal Change Forecast data accessible through user-friendly interfaces, Kara Doran
  • What's New in Cloud Hosting Solutions?, Jennifer Erxleben
  • Delivering the North American tree-ring fire history network through a web application and an R package, Chris Guiterman
  • Central Energy Resources Science Center Data Management Services Project Overview, Gregory Gunther
  • Landsat-derived fire history metrics to provide critical information for prioritizing prescribed fire across the Southeast, Todd Hawbaker
  • Let’s Chat about Usability!, Sophie Hou
  • USGS Model Catalog, Leslie Hsu
  • Efficiently Accessing Large Earth Imagery Datasets Using the Meta Raster Format (MRF) and AWS Serverless Architecture, Liz Huselid
  • Meet the SAS Science Data Management Team!, Viv Hutchison
  • USGS State of the Data, Viv Hutchison
  • The Definition of Analysis-ready Data in USGS, Viv Hutchison
  • USMIN – delivering critical mineral data for the U.S., Nick Karl
  • Diversity and Inclusion Resources at USGS, Kim Kloecker
  • A Fire-Aware Stream Application to Integrate USGS Fire and Water Databases, Katharine Kolb
  • The Standalone Data Dictionary: A More Robust Approach in Documenting the Entity and Attributes for Data, Raymond Obuch
  • Diverse data to improve Southwest fire forecasts: Joining novel remote sensing, post-fire dynamics, and intra-annual precipitation patterns, Michala Phillips
  • Advancing Post-Fire Debris Flow Hazard Science with a Field Deployable Mapping Tool, Francis Rengers
  • Development of a web-based tool for coastal water resources management, Tara Root
  • The Wildfire Trends Tool: A data visualization and analysis tool to meet land management needs and facilitate scientific inquiry, Douglas Shinneman
  • Utilization of Google Earth Engine to Examine Surface Water Inundation Patterns in California Croplands, Britt Smith
  • Solar and sensor geometry, not vegetation response, drive satellite NDVI phenology in widespread ecosystems of the western United States, Jessica Walker
  • Remote sensing strategies for invasive species management, Cynthia Wallace
  • GIS Clipping and Summarization Tool for Points, Lines, Polygons, and Rasters, Justin Welty
  • Coast Train - Massive Library of Labeled Coastal Images for Machine Learning Applications, Phillipe Wernette
  • Visualizing Science using Python Dash, Daniel Wieferich

Lightning Talks 

  • Semantics and machine reasoning enable FAIR, web-based data and model integration, Kenneth Bagstad 

  • Coast Train - Massive Library of Labeled Coastal Images for Machine Learning Applications, Phillipe Wernette 

  • From reactive- to condition-based maintenance: artificial intelligence for anomaly prediction in time-series data and operational decision-making, Matthew Cashman 

  • USGS Markup Application: Supporting User-Driven Improvements to Hydrography Data, Marcelle Caturia 

  • Making USGS/NOAA Total Water Level and Coastal Change Forecast data accessible through user-friendly interfaces, Kara Doran 

  • USGS Model Catalog, Leslie Hsu 

  • Efficiently Accessing Large Earth Imagery Datasets Using the Meta Raster Format (MRF) and AWS Serverless Architecture, Liz Huselid 

  • A Fire-Aware Stream Application to Integrate USGS Fire and Water Databases, Katharine Kolb 

  • The Standalone Data Dictionary: A More Robust Approach in Documenting the Entity and Attributes for Data, Raymond Obuch 

  • Diverse Data to Improve Southwest Fire Forecasts: Joining Novel Remote Sensing, Post-fire Dynamics, and Intra-annual Precipitation Patterns, Sasha Reed 

  • Development of a web-based tool for coastal water resources management, Tara Root 

  • Utilization of Google Earth Engine to Examine Surface Water Inundation Patterns in California Croplands, Britt Smith 

  • Remote sensing strategies for invasive species management, Cynthia Wallace 

  • Visualizing Science Using Python Dash, Daniel Wieferich 

Go back to CDI 2021 Activities.