U.S. flag

An official website of the United States government

Dot gov

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Https

Secure .gov websites use HTTPS
A lock () or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Breadcrumb

  1. Home

NIST Structured Forms Reference Set of Binary Images II (SFRS2) - NIST Special Database 6

The documents in this database are 12 different tax forms with the IRS 1040 Package X for the year 1988. These include Forms 1040, 2106, 2441, 4562, and 6251 together with Schedules A, B, C, D, E, F, and SE. Eight of these forms contain two pages or form faces; therefore, there are 20 different form faces represented in the database. The document images in this database appear to be real hand-printed forms prepared by individuals, but the images have been automatically derived and synthesized using a computer and contain no "real" tax data. There are 900 simulated tax submissions represented in the database averaging 6.22 form faces per submission.

About this Dataset

Updated: 2024-02-22
Metadata Last Updated: 2017-01-10 00:00:00
Date Created: N/A
Views:
Data Provided by:
ASCII references
Dataset Owner: N/A

Access this data

Contact dataset owner Access URL
Landing Page URL
Table representation of structured data
Title NIST Structured Forms Reference Set of Binary Images II (SFRS2) - NIST Special Database 6
Description The documents in this database are 12 different tax forms with the IRS 1040 Package X for the year 1988. These include Forms 1040, 2106, 2441, 4562, and 6251 together with Schedules A, B, C, D, E, F, and SE. Eight of these forms contain two pages or form faces; therefore, there are 20 different form faces represented in the database. The document images in this database appear to be real hand-printed forms prepared by individuals, but the images have been automatically derived and synthesized using a computer and contain no "real" tax data. There are 900 simulated tax submissions represented in the database averaging 6.22 form faces per submission.
Modified 2017-01-10 00:00:00
Publisher Name National Institute of Standards and Technology
Contact mailto:[email protected]
Keywords ASCII references , binary image databases , character recognition , characters , forms identifications , forms recognition , ground truth , hand prints , hand printed characters , hand writing recognition , handprints , images , OCR , printed characters , recognition , software recognition , tax forms
{
    "identifier": "FF429BC1786D8B3EE0431A570681E858220",
    "accessLevel": "public",
    "references": [
        "https:\/\/s3.amazonaws.com\/nist-srd\/SD6\/SD06_users_guide.pdf"
    ],
    "contactPoint": {
        "hasEmail": "mailto:[email protected]",
        "fn": "Karen Marshall"
    },
    "programCode": [
        "006:052"
    ],
    "@type": "dcat:Dataset",
    "landingPage": "https:\/\/data.nist.gov\/od\/id\/FF429BC1786D8B3EE0431A570681E858220",
    "description": "The documents in this database are 12 different tax forms with the IRS 1040 Package X for the year 1988. These include Forms 1040, 2106, 2441, 4562, and 6251 together with Schedules A, B, C, D, E, F, and SE. Eight of these forms contain two pages or form faces; therefore, there are 20 different form faces represented in the database. The document images in this database appear to be real hand-printed forms prepared by individuals, but the images have been automatically derived and synthesized using a computer and contain no \"real\" tax data. There are 900 simulated tax submissions represented in the database averaging 6.22 form faces per submission.",
    "language": [
        "en"
    ],
    "title": "NIST Structured Forms Reference Set of Binary Images II (SFRS2) - NIST Special Database 6",
    "distribution": [
        {
            "accessURL": "https:\/\/doi.org\/10.18434\/M3D95B"
        }
    ],
    "license": "https:\/\/www.nist.gov\/open\/license",
    "bureauCode": [
        "006:55"
    ],
    "modified": "2017-01-10 00:00:00",
    "publisher": {
        "@type": "org:Organization",
        "name": "National Institute of Standards and Technology"
    },
    "accrualPeriodicity": "irregular",
    "theme": [
        "Human language technology"
    ],
    "keyword": [
        "ASCII references",
        "binary image databases",
        "character recognition",
        "characters",
        "forms identifications",
        "forms recognition",
        "ground truth",
        "hand prints",
        "hand printed characters",
        "hand writing recognition",
        "handprints",
        "images",
        "OCR",
        "printed characters",
        "recognition",
        "software recognition",
        "tax forms"
    ]
}

Was this page helpful?