U.S. flag

An official website of the United States government

Dot gov

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Https

Secure .gov websites use HTTPS
A lock () or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Breadcrumb

  1. Home

2022 NeuCLIR Dataset

Cross-language Information Retrieval (CLIR) has been studied at TREC and subsequent evaluation forums for more than twenty years, but recent advances in the application of deep learning to information retrieval (IR) warrant a new, large-scale effort that will enable exploration of classical and modern IR techniques for this task.

About this Dataset

Updated: 2025-04-06
Metadata Last Updated: 2023-02-01 00:00:00
Date Created: N/A
Data Provided by:
Dataset Owner: N/A

Access this data

Contact dataset owner Access URL
Landing Page URL
Table representation of structured data
Title 2022 NeuCLIR Dataset
Description Cross-language Information Retrieval (CLIR) has been studied at TREC and subsequent evaluation forums for more than twenty years, but recent advances in the application of deep learning to information retrieval (IR) warrant a new, large-scale effort that will enable exploration of classical and modern IR techniques for this task.
Modified 2023-02-01 00:00:00
Publisher Name National Institute of Standards and Technology
Contact mailto:[email protected]
Keywords TREC text retrieval conference
{
    "identifier": "ark:\/88434\/mds2-3289",
    "accessLevel": "public",
    "contactPoint": {
        "hasEmail": "mailto:[email protected]",
        "fn": "Ian Soboroff"
    },
    "programCode": [
        "006:045"
    ],
    "landingPage": "https:\/\/data.nist.gov\/od\/id\/mds2-3289",
    "title": "2022 NeuCLIR Dataset",
    "description": "Cross-language Information Retrieval (CLIR) has been studied at TREC and subsequent evaluation forums for more than twenty years, but recent advances in the application of deep learning to information retrieval (IR) warrant a new, large-scale effort that will enable exploration of classical and modern IR techniques for this task.",
    "language": [
        "en"
    ],
    "distribution": [
        {
            "accessURL": "https:\/\/ir.nist.gov\/neuclir\/neuclir1.tar.gz",
            "title": "2022 NeuCLIR Corpus"
        },
        {
            "accessURL": "https:\/\/trec.nist.gov\/data\/neuclir\/topics.0720.utf8.jsonl",
            "title": "2022 NeuCLIR Topics"
        },
        {
            "accessURL": "https:\/\/ir-datasets.com\/neuclir.html",
            "title": "2022 NeuCLIR Dataset"
        },
        {
            "accessURL": "https:\/\/trec.nist.gov\/data\/neuclir2022.html",
            "title": "2022 NeuCLIR Main Page"
        }
    ],
    "bureauCode": [
        "006:55"
    ],
    "modified": "2023-02-01 00:00:00",
    "publisher": {
        "@type": "org:Organization",
        "name": "National Institute of Standards and Technology"
    },
    "theme": [
        "Information Technology:Data and informatics"
    ],
    "keyword": [
        "TREC text retrieval conference"
    ]
}