Assisting human annotation of marine images with foundation models

Orenstein, Eric C.; Woodward, Benjamin; Lundsten, Lonny; Barnard, Kevin; Schlining, Brian; Katjia, Kakani

Orenstein, Eric C.; Woodward, Benjamin; Lundsten, Lonny; Barnard, Kevin; Schlining, Brian; Katjia, Kakani. 2025 Assisting human annotation of marine images with foundation models. Frontiers in Marine Science, 12. 10.3389/fmars.2025.1469396

[A][B][+][-]

Abstract

Marine scientists have been leveraging supervised machine learning algorithms to analyze image and video data for nearly two decades. There have been many advances, but the cost of generating expert human annotations to train new models remains extremely high. There is broad recognition both in computer and domain sciences that generating training data remains the major bottleneck when developing ML models for targeted tasks. Increasingly, computer scientists are not attempting to produce highly-optimized models from general annotation frameworks, instead focusing on adaptation strategies to tackle new data challenges. Taking inspiration from large language models, computer vision researchers are now thinking in terms of “foundation models” that can yield reasonable zero- and few-shot detection and segmentation performance with human prompting. Here we consider the utility of this approach for ocean imagery, leveraging Meta’s Segment Anything Model to enrich ocean image annotations based on existing labels. This workflow yields promising results, especially for modernizing existing data repositories. Moreover, it suggests that future human annotation efforts could use foundation models to speed progress toward a sufficient training set to address domain specific problems.

Documents