U.S. flag

An official website of the United States government

Dot gov

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Https

Secure .gov websites use HTTPS
A lock () or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Breadcrumb

  1. Home

PlanktonSet 1.0: Plankton imagery data collected from F.G. Walton Smith in Straits of Florida from 2014-06-03 to 2014-06-06 and used in the 2015 National Data Science Bowl (NCEI Accession 0127422)

Data presented here are subset of a larger plankton imagery data set collected in the subtropical Straits of Florida from 2014-05-28 to 2014-06-14. Imagery data were collected using the In Situ Ichthyoplankton Imaging System (ISIIS-2) as part of a NSF-funded project to assess the biophysical drivers affecting fine-scale interactions between larval fish, their prey, and predators. This subset of images was used in the inaugural National Data Science Bowl (www.datasciencebowl.com) hosted by Kaggle and sponsored by Booz Allen Hamilton. Data were originally collected to examine the biophysical drivers affecting fine-scale (spatial) interactions between larval fish, their prey, and predators in a subtropical pelagic marine ecosystem. Image segments extracted from the raw data were sorted into 121 plankton classes, split 50:50 into train and test data sets, and provided for a machine learning competition (the National Data Science Bowl). There was no hierarchical relationships explicit in the 121 plankton classes, though the class naming convention and a tree-like diagram (see file "Plankton Relationships.pdf") indicated relationships between classes, whether it was taxonomic or structural (size and shape). We intend for this dataset to be available to the machine learning and computer vision community as a standard machine learning benchmark. This “Plankton 1.0” dataset is a medium-size dataset with a fair amount of complexity where image classification improvements can still be made.

About this Dataset

Updated: 2024-02-22
Metadata Last Updated: 2025-11-19T15:42:35.291Z
Date Created: N/A
Data Provided by:
Dataset Owner: N/A

Access this data

Contact dataset owner Access URL
Table representation of structured data
Title PlanktonSet 1.0: Plankton imagery data collected from F.G. Walton Smith in Straits of Florida from 2014-06-03 to 2014-06-06 and used in the 2015 National Data Science Bowl (NCEI Accession 0127422)
Description Data presented here are subset of a larger plankton imagery data set collected in the subtropical Straits of Florida from 2014-05-28 to 2014-06-14. Imagery data were collected using the In Situ Ichthyoplankton Imaging System (ISIIS-2) as part of a NSF-funded project to assess the biophysical drivers affecting fine-scale interactions between larval fish, their prey, and predators. This subset of images was used in the inaugural National Data Science Bowl (www.datasciencebowl.com) hosted by Kaggle and sponsored by Booz Allen Hamilton. Data were originally collected to examine the biophysical drivers affecting fine-scale (spatial) interactions between larval fish, their prey, and predators in a subtropical pelagic marine ecosystem. Image segments extracted from the raw data were sorted into 121 plankton classes, split 50:50 into train and test data sets, and provided for a machine learning competition (the National Data Science Bowl). There was no hierarchical relationships explicit in the 121 plankton classes, though the class naming convention and a tree-like diagram (see file "Plankton Relationships.pdf") indicated relationships between classes, whether it was taxonomic or structural (size and shape). We intend for this dataset to be available to the machine learning and computer vision community as a standard machine learning benchmark. This “Plankton 1.0” dataset is a medium-size dataset with a fair amount of complexity where image classification improvements can still be made.
Modified 2025-11-19T15:42:35.291Z
Publisher Name N/A
Contact N/A
Keywords 0127422 , biological data , images , PLANKTON , biological , in situ , R/V F.G. Walton Smith , Oregon State University, Hatfield Marine Science Center , Oregon State University, Hatfield Marine Science Center , Straits of Florida , oceanography , DOC/NOAA/NESDIS/NODC > National Oceanographic Data Center, NESDIS, NOAA, U.S. Department of Commerce , National Data Science Bowl (www.datasciencebowl.com) , Spatial variability of larval fish in relation to their prey and predator fields: Patterns and interactions from cm to 10s of km in a subtropical, pelagic environment - NSF Award 1419987 , EARTH SCIENCE > BIOLOGICAL CLASSIFICATION , EARTH SCIENCE > BIOLOGICAL CLASSIFICATION > PROTISTS > PLANKTON , EARTH SCIENCE > BIOSPHERE > ECOSYSTEMS > AQUATIC ECOSYSTEMS > PLANKTON , In situ Ichthyoplankton Imaging System (ISIIS) , F. G. Walton Smith (call sign: WCZ6292, ICES code: 33WA, 1999-) , OCEAN > ATLANTIC OCEAN > NORTH ATLANTIC OCEAN , MLHFP1 , environment , oceans
{
    "identifier": "gov.noaa.nodc:0127422",
    "accessLevel": "public",
    "contactPoint": {
        "@type": "vcard:Contact",
        "fn": "Your contact point",
        "hasEmail": "mailto:[email protected]"
    },
    "programCode": [
        "010:000"
    ],
    "landingPage": "",
    "title": "PlanktonSet 1.0: Plankton imagery data collected from F.G. Walton Smith in Straits of Florida from 2014-06-03 to 2014-06-06 and used in the 2015 National Data Science Bowl (NCEI Accession 0127422)",
    "description": "Data presented here are subset of a larger plankton imagery data set collected in the subtropical Straits of Florida from 2014-05-28 to 2014-06-14. Imagery data were collected using the In Situ Ichthyoplankton Imaging System (ISIIS-2) as part of a NSF-funded project to assess the biophysical drivers affecting fine-scale interactions between larval fish, their prey, and predators. This subset of images was used in the inaugural National Data Science Bowl (www.datasciencebowl.com) hosted by Kaggle and sponsored by Booz Allen Hamilton.  Data were originally collected to examine the biophysical drivers affecting fine-scale (spatial) interactions between larval fish, their prey, and predators in a subtropical pelagic marine ecosystem. Image segments extracted from the raw data were sorted into 121 plankton classes, split 50:50 into train and test data sets, and provided for a machine learning competition (the National Data Science Bowl). There was no hierarchical relationships explicit in the 121 plankton classes, though the class naming convention and a tree-like diagram (see file \"Plankton Relationships.pdf\") indicated relationships between classes, whether it was taxonomic or structural (size and shape). We intend for this dataset to be available to the machine learning and computer vision community as a standard machine learning benchmark. This \u201cPlankton 1.0\u201d dataset is a medium-size dataset with a fair amount of complexity where image classification improvements can still be made.",
    "language": "",
    "distribution": [
        {
            "@type": "dcat:Distribution",
            "mediaType": "application\/json",
            "accessURL": "https:\/\/www.ncei.noaa.gov\/metadata\/geoportal\/\/rest\/metadata\/item\/gov.noaa.nodc%3A0127422"
        },
        {
            "@type": "dcat:Distribution",
            "mediaType": "text\/html",
            "accessURL": "https:\/\/www.ncei.noaa.gov\/metadata\/geoportal\/\/rest\/metadata\/item\/gov.noaa.nodc%3A0127422\/html"
        },
        {
            "@type": "dcat:Distribution",
            "mediaType": "application\/xml",
            "accessURL": "https:\/\/www.ncei.noaa.gov\/metadata\/geoportal\/\/rest\/metadata\/item\/gov.noaa.nodc%3A0127422\/xml"
        },
        {
            "@type": "dcat:Distribution",
            "mediaType": "application\/octet-stream",
            "accessURL": "https:\/\/www.ncei.noaa.gov\/access\/metadata\/landing-page\/bin\/gfx?id=gov.noaa.nodc:0127422"
        }
    ],
    "bureauCode": [
        "010:04"
    ],
    "modified": "2025-11-19T15:42:35.291Z",
    "publisher": {
        "@type": "org:Organization",
        "name": "Your Publisher"
    },
    "theme": "",
    "keyword": [
        "0127422",
        "biological data",
        "images",
        "PLANKTON",
        "biological",
        "in situ",
        "R\/V F.G. Walton Smith",
        "Oregon State University, Hatfield Marine Science Center",
        "Oregon State University, Hatfield Marine Science Center",
        "Straits of Florida",
        "oceanography",
        "DOC\/NOAA\/NESDIS\/NODC > National Oceanographic Data Center, NESDIS, NOAA, U.S. Department of Commerce",
        "National Data Science Bowl (www.datasciencebowl.com)",
        "Spatial variability of larval fish in relation to their prey and predator fields: Patterns and interactions from cm to 10s of km in a subtropical, pelagic environment - NSF Award 1419987",
        "EARTH SCIENCE > BIOLOGICAL CLASSIFICATION",
        "EARTH SCIENCE > BIOLOGICAL CLASSIFICATION > PROTISTS > PLANKTON",
        "EARTH SCIENCE > BIOSPHERE > ECOSYSTEMS > AQUATIC ECOSYSTEMS > PLANKTON",
        "In situ Ichthyoplankton Imaging System (ISIIS)",
        "F. G. Walton Smith (call sign: WCZ6292, ICES code: 33WA, 1999-)",
        "OCEAN > ATLANTIC OCEAN > NORTH ATLANTIC OCEAN",
        "MLHFP1",
        "environment",
        "oceans"
    ]
}