The documents in this database are 12 different tax forms from the IRS 1040 Package X for the year 1988. These include Forms 1040, 2106, 2441, 4562, and 6251 together with Schedules A, B, C, D, E, F, and SE. Eight of these forms contain two pages or form faces; therefore, there are 20 different form faces represented in the database. The document images in this database appear to be real forms prepared by individuals, but the images have been automatically derived and synthesized using a computer.
About this Dataset
Title | NIST Structured Forms Reference Set of Binary Images (SFRS) - NIST Special Database 2 |
---|---|
Description | The documents in this database are 12 different tax forms from the IRS 1040 Package X for the year 1988. These include Forms 1040, 2106, 2441, 4562, and 6251 together with Schedules A, B, C, D, E, F, and SE. Eight of these forms contain two pages or form faces; therefore, there are 20 different form faces represented in the database. The document images in this database appear to be real forms prepared by individuals, but the images have been automatically derived and synthesized using a computer. |
Modified | 2017-01-10 00:00:00 |
Publisher Name | National Institute of Standards and Technology |
Contact | mailto:[email protected] |
Keywords | ASCII references , automated character recognition , automated data capture , binary image databases , forms identifications , forms recognition , image format documentation , image software , images , OCR , printed characters , software recognition , tax forms |
{ "identifier": "FF429BC178608B3EE0431A570681E858207", "accessLevel": "public", "contactPoint": { "hasEmail": "mailto:[email protected]", "fn": "Karen Marshall" }, "programCode": [ "006:052" ], "landingPage": "https:\/\/data.nist.gov\/od\/id\/FF429BC178608B3EE0431A570681E858207", "title": "NIST Structured Forms Reference Set of Binary Images (SFRS) - NIST Special Database 2", "description": "The documents in this database are 12 different tax forms from the IRS 1040 Package X for the year 1988. These include Forms 1040, 2106, 2441, 4562, and 6251 together with Schedules A, B, C, D, E, F, and SE. Eight of these forms contain two pages or form faces; therefore, there are 20 different form faces represented in the database. The document images in this database appear to be real forms prepared by individuals, but the images have been automatically derived and synthesized using a computer.", "language": [ "en" ], "distribution": [ { "accessURL": "https:\/\/doi.org\/10.18434\/M3J08M" } ], "bureauCode": [ "006:55" ], "modified": "2017-01-10 00:00:00", "publisher": { "@type": "org:Organization", "name": "National Institute of Standards and Technology" }, "theme": [ "Human language technology" ], "keyword": [ "ASCII references", "automated character recognition", "automated data capture", "binary image databases", "forms identifications", "forms recognition", "image format documentation", "image software", "images", "OCR", "printed characters", "software recognition", "tax forms" ] }