Skip to main content
U.S. flag

An official website of the United States government

Endless forams: >34,000 modern planktonic foraminiferal images for taxonomic training and automated species recognition using convolutional neural networks

July 23, 2019

Accurate planktonic foraminiferal species identification is central to many paleoceanographic studies, from selecting specific species for geochemical research to elucidating the biotic dynamics of microfossil communities relevant to physical oceanographic processes and interconnected phenomena such as climate change. However, species identification varies among taxonomic schools, few resources exist to train students in the difficult task of discerning amongst closely related species, and the number of taxonomic experts is limited. Here, we take the first steps towards removing these rate-limiting steps by generating the first extensive image library of modern planktonic foraminifera, providing taxonomic training tools and resources, and automating species-level taxonomic identification of planktonic foraminifera via machine learning using convolution neural networks. Taxonomic experts identified 34,640 images of modern planktonic foraminifera to the species level. These images are served as species exemplars through the online portal Endless Forams (endlessforams.org) and a taxonomic training portal hosted on the citizen science platform Zooniverse (zooniverse.org/projects/ahsiang/endless-forams/). A supervised machine learning classifier was then trained with more than 24,000 images of planktonic foraminifera and tested using the remaining ~10,000 images (i.e., the validation set). The best classifier provided the correct species name for an image in the validation set 87.4% of the time. Together, these resources provide a rigorous set of training tools in modern planktonic foraminiferal taxonomy and a means of rapidly generating assemblage data via machine learning in future studies.

Publication Year 2019
Title Endless forams: >34,000 modern planktonic foraminiferal images for taxonomic training and automated species recognition using convolutional neural networks
DOI 10.1029/2019PA003612
Authors Allison Y. Hsiang, Anieke Brombacher, Marina Costa Rillo, Maryline J. Mleneck-Vautravers, Stephen Connett, Sian Lordsmith, Anna Jentzen, Michael J. Henehan, Brett Metcalfe, Isabel Fenton, Bridget Wade, Lyndsey Fox, Julie Meilland, Catherine V. Davis, Ulrike Baranowski, Jeroen Groeneveld, Kirsty M. Edgar, Aurore Movellan, Tracy Aze, Harry J. Dowsett, Giles Miller, Nelson Rios, Pincelli M. Hull
Publication Type Article
Publication Subtype Journal Article
Series Title Paleoceanography and Paleoclimatology
Index ID 70204891
Record Source USGS Publications Warehouse
USGS Organization Eastern Geology and Paleoclimate Science Center