Explore open access research and scholarly works from NERC Open Research Archive

Advanced Search

Optimizing plankton image classification with metadata-enhanced representation learning

Masoudi, Mojtaba; Giering, Sarah L.C. ORCID: https://orcid.org/0000-0002-3090-1876; Eftekhari, Noushin; Massot-Campos, Miquel; Irisson, Jean-Olivier; Thornton, Blair. 2024 Optimizing plankton image classification with metadata-enhanced representation learning. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing. 1-18. 10.1109/JSTARS.2024.3424498

Abstract
Automated camera-based sensors are widely used in vessel-based research to monitor plankton and marine particles. However, current methods suffer from the costly and time-consuming requirement of annotating data for fully supervised learning, especially in plankton grouping tasks characterized by long-tailed datasets. In response, we propose a novel self-supervised learning (SSL) framework that significantly reduces reliance on expensive human annotations by leveraging crucial metadata such as water depth and location. The method comprises three major steps: self-supervised training, innovative sampling, and final classification. It identifies key sample subsets from an unlabelled dataset using hierarchical clustering approach and incorporates an innovative balancing representative subsampling strategy that addresses the challenge of dataset imbalance and enhances generalisability across diverse plankton classes. Our approach prioritises discerning representation features observed in images that exhibit correlations with the patterns found in their associated metadata. Furthermore, our method introduce a novel grouping based on visual perspective selection method, enabling the identification of balanced subset views that depart from traditional class-based categorisation. Our experimental results showcase a significant enhancement in image classification accuracy, with a 23% improvement over methods that do not utilise metadata, and attains a macro F1-score of 54% for 10 populated species from a severely long-tailed dataset. This is achieved with a mere 0.3% of the entire dataset used for annotation.
Documents
537742:255075
[thumbnail of Optimizing_Plankton_Image_Classification_With_Metadata-Enhanced_Representation_Learning.pdf]
Preview
Optimizing_Plankton_Image_Classification_With_Metadata-Enhanced_Representation_Learning.pdf - Published Version
Available under License Creative Commons Attribution 4.0.

Download (3MB) | Preview
Information
Programmes:
NOC Programmes > Ocean BioGeosciences
Library
Statistics

Downloads per month over past year

More statistics for this item...

Metrics

Altmetric Badge

Dimensions Badge

Share
Add to AnyAdd to TwitterAdd to FacebookAdd to LinkedinAdd to PinterestAdd to Email
View Item