U.S. flag

An official website of the United States government

Dot gov

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Https

Secure .gov websites use HTTPS
A lock () or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Breadcrumb

  1. Home

SDNist v1.3: Temporal Map Challenge Environment

SDNist (v1.3) is a set of benchmark data and metrics for the evaluation of synthetic data generators on structured tabular data. This version (1.3) reproduces the challenge environment from Sprints 2 and 3 of the Temporal Map Challenge. These benchmarks are distributed as a simple open-source python package to allow standardized and reproducible comparison of synthetic generator models on real world data and use cases. These data and metrics were developed for and vetted through the NIST PSCR Differential Privacy Temporal Map Challenge, where the evaluation tools, k-marginal and Higher Order Conjunction, proved effective in distinguishing competing models in the competition environment.SDNist is available via `pip` install: `pip install sdnist==1.2.8` for Python >=3.6 or on the [USNIST/Github](https://github.com/usnistgov/Differential-Privacy-Temporal-Map-Challeng…). The sdnist Python module will download data from NIST as necessary, and users are not required to download data manually.

About this Dataset

Updated: 2025-04-06
Metadata Last Updated: 2021-12-06 00:00:00
Date Created: N/A
Data Provided by:
Dataset Owner: N/A

Access this data

Contact dataset owner Landing Page URL
Download URL
Table representation of structured data
Title SDNist v1.3: Temporal Map Challenge Environment
Description SDNist (v1.3) is a set of benchmark data and metrics for the evaluation of synthetic data generators on structured tabular data. This version (1.3) reproduces the challenge environment from Sprints 2 and 3 of the Temporal Map Challenge. These benchmarks are distributed as a simple open-source python package to allow standardized and reproducible comparison of synthetic generator models on real world data and use cases. These data and metrics were developed for and vetted through the NIST PSCR Differential Privacy Temporal Map Challenge, where the evaluation tools, k-marginal and Higher Order Conjunction, proved effective in distinguishing competing models in the competition environment.SDNist is available via `pip` install: `pip install sdnist==1.2.8` for Python >=3.6 or on the [USNIST/Github](https://github.com/usnistgov/Differential-Privacy-Temporal-Map-Challenge-assets/). The sdnist Python module will download data from NIST as necessary, and users are not required to download data manually.
Modified 2021-12-06 00:00:00
Publisher Name National Institute of Standards and Technology
Contact mailto:[email protected]
Keywords private information sharing , differential privacy , privacy , benchmarks , synthetic data
{
    "identifier": "ark:\/88434\/mds2-2515",
    "accessLevel": "public",
    "contactPoint": {
        "hasEmail": "mailto:[email protected]",
        "fn": "Gary Howarth II"
    },
    "programCode": [
        "006:045"
    ],
    "landingPage": "https:\/\/data.nist.gov\/od\/id\/mds2-2515",
    "title": "SDNist v1.3: Temporal Map Challenge Environment",
    "description": "SDNist (v1.3) is a set of benchmark data and metrics for the evaluation of synthetic data generators on structured tabular data. This version (1.3) reproduces the challenge environment from Sprints 2 and 3 of the Temporal Map Challenge. These benchmarks are distributed as a simple open-source python package to allow standardized and reproducible comparison of synthetic generator models on real world data and use cases. These data and metrics were developed for and vetted through the NIST PSCR Differential Privacy Temporal Map Challenge, where the evaluation tools, k-marginal and Higher Order Conjunction, proved effective in distinguishing competing models in the competition environment.SDNist is available via `pip` install: `pip install sdnist==1.2.8` for Python >=3.6 or on the [USNIST\/Github](https:\/\/github.com\/usnistgov\/Differential-Privacy-Temporal-Map-Challenge-assets\/). The sdnist Python module will download data from NIST as necessary, and users are not required to download data manually.",
    "language": [
        "en"
    ],
    "distribution": [
        {
            "downloadURL": "https:\/\/data.nist.gov\/od\/ds\/mds2-2515\/GA_NC_SC_10Y_PUMS.parquet",
            "mediaType": "application\/octet-stream",
            "title": "Census GA_NC_SC data"
        },
        {
            "downloadURL": "https:\/\/data.nist.gov\/od\/ds\/mds2-2515\/GA_NC_SC_10Y_PUMS.json",
            "mediaType": "application\/json",
            "title": "Census GA_NC_SC schema"
        },
        {
            "downloadURL": "https:\/\/data.nist.gov\/od\/ds\/mds2-2515\/IL_OH_10Y_PUMS.parquet",
            "mediaType": "application\/octet-stream",
            "title": "Census IL-OH data"
        },
        {
            "downloadURL": "https:\/\/data.nist.gov\/od\/ds\/mds2-2515\/IL_OH_10Y_PUMS.json",
            "mediaType": "application\/json",
            "title": "Census IL_OH schema"
        },
        {
            "downloadURL": "https:\/\/data.nist.gov\/od\/ds\/mds2-2515\/NY_PA_10Y_PUMS.parquet",
            "mediaType": "application\/octet-stream",
            "title": "Census NY-PA data"
        },
        {
            "downloadURL": "https:\/\/data.nist.gov\/od\/ds\/mds2-2515\/NY_PA_10Y_PUMS.json",
            "mediaType": "application\/json",
            "title": "Census NY_PA schema"
        },
        {
            "downloadURL": "https:\/\/data.nist.gov\/od\/ds\/mds2-2515\/taxi2016.parquet",
            "mediaType": "application\/octet-stream",
            "title": "Taxi 2016"
        },
        {
            "downloadURL": "https:\/\/data.nist.gov\/od\/ds\/mds2-2515\/taxi2016.json",
            "mediaType": "application\/json",
            "title": "Taxi 2016 schema"
        },
        {
            "downloadURL": "https:\/\/data.nist.gov\/od\/ds\/mds2-2515\/taxi2020.parquet",
            "mediaType": "application\/octet-stream",
            "title": "Taxi 2020 data"
        },
        {
            "downloadURL": "https:\/\/data.nist.gov\/od\/ds\/mds2-2515\/taxi2020.json",
            "mediaType": "application\/json",
            "title": "Taxi 2020 schema"
        },
        {
            "downloadURL": "https:\/\/data.nist.gov\/od\/ds\/mds2-2515\/taxi.parquet",
            "mediaType": "application\/octet-stream",
            "title": "Taxi data"
        },
        {
            "downloadURL": "https:\/\/data.nist.gov\/od\/ds\/mds2-2515\/taxi.json",
            "mediaType": "application\/json",
            "title": "Taxi schema"
        },
        {
            "accessURL": "https:\/\/doi.org\/10.18434\/mds2-2515",
            "title": "DOI Access for SDNist: Benchmark data and evaluation tools for data synthesizers."
        },
        {
            "downloadURL": "https:\/\/data.nist.gov\/od\/ds\/mds2-2515\/report2.jinja2",
            "description": "A jinja2 report template to help humans read the k-marginal data",
            "mediaType": "application\/octet-stream",
            "title": "K-marginal report template"
        },
        {
            "accessURL": "https:\/\/github.com\/usnistgov\/SDNist\/",
            "format": "Python 3.8 module",
            "description": "SDNist: Benchmark data and evaluation tools for synthetic data generators",
            "title": "SDNist software respository at Github"
        },
        {
            "downloadURL": "https:\/\/data.nist.gov\/od\/ds\/mds2-2515\/census-datasets-CSVs.zip",
            "format": "CSV",
            "description": "Three compressed CSV files to run the 'Census'-related functions in SDNist.",
            "mediaType": "application\/zip",
            "title": "Datasets for 'Census' evaluation in CSV format"
        },
        {
            "downloadURL": "https:\/\/data.nist.gov\/od\/ds\/mds2-2515\/taxi-datasets-CSVs.zip",
            "format": "CSV",
            "description": "Three compressed CSV files to run the 'Taxi'-related functions in SDNist.",
            "mediaType": "application\/zip",
            "title": "Taxi datasets in CSV format"
        }
    ],
    "bureauCode": [
        "006:55"
    ],
    "modified": "2021-12-06 00:00:00",
    "publisher": {
        "@type": "org:Organization",
        "name": "National Institute of Standards and Technology"
    },
    "theme": [
        "Public Safety:Public safety communications research",
        "Information Technology:Privacy",
        "Information Technology:Artificial Intelligence"
    ],
    "keyword": [
        "private information sharing",
        "differential privacy",
        "privacy",
        "benchmarks",
        "synthetic data"
    ]
}