The ad hoc retrieval task investigates the performance of systems that search a static set of documents using new questions (called topics in TREC). This task is similar to how a researcher might use a library - the collection is known but the questions likely to be asked are not known. NIST provides the participants approximately 2 gigabytes worth of documents and a set of 50 natural language topic statements. The participants produce a set of queries from the topic statements and run those queries against the documents. The output from this run is the official test result for the ad hoc task. Participants return the best 1000 documents retrieved for each topic to NIST for evaluation. The dataset comprises the documents, the topics, and the annotations of relevant documents.
About this Dataset
Title | TREC 1999 Adhoc Dataset |
---|---|
Description | The ad hoc retrieval task investigates the performance of systems that search a static set of documents using new questions (called topics in TREC). This task is similar to how a researcher might use a library - the collection is known but the questions likely to be asked are not known. NIST provides the participants approximately 2 gigabytes worth of documents and a set of 50 natural language topic statements. The participants produce a set of queries from the topic statements and run those queries against the documents. The output from this run is the official test result for the ad hoc task. Participants return the best 1000 documents retrieved for each topic to NIST for evaluation. The dataset comprises the documents, the topics, and the annotations of relevant documents. |
Modified | 2024-10-31 00:00:00 |
Publisher Name | National Institute of Standards and Technology |
Contact | mailto:[email protected] |
Keywords | TREC text retrieval conference |
{ "identifier": "ark:\/88434\/mds2-3620", "accessLevel": "public", "contactPoint": { "hasEmail": "mailto:[email protected]", "fn": "Ian Soboroff" }, "programCode": [ "006:045" ], "landingPage": "https:\/\/data.nist.gov\/od\/id\/mds2-3620", "title": "TREC 1999 Adhoc Dataset", "description": "The ad hoc retrieval task investigates the performance of systems that search a static set of documents using new questions (called topics in TREC). This task is similar to how a researcher might use a library - the collection is known but the questions likely to be asked are not known. NIST provides the participants approximately 2 gigabytes worth of documents and a set of 50 natural language topic statements. The participants produce a set of queries from the topic statements and run those queries against the documents. The output from this run is the official test result for the ad hoc task. Participants return the best 1000 documents retrieved for each topic to NIST for evaluation. The dataset comprises the documents, the topics, and the annotations of relevant documents.", "language": [ "en" ], "distribution": [ { "accessURL": "https:\/\/trec.nist.gov\/data\/cd45\/index.html", "title": "TREC 1999 ADHOC CORPUS" }, { "accessURL": "https:\/\/trec.nist.gov\/data\/topics_eng\/topics.401-450.gz", "title": "TREC 1999 ADHOC TOPICS" }, { "accessURL": "https:\/\/trec.nist.gov\/data\/qrels_eng\/qrels.trec8.adhoc.parts1-5.tar.gz", "title": "TREC 1999 ADHOC QRELS" }, { "accessURL": "https:\/\/trec.nist.gov\/data\/cd45\/index.html", "title": "TREC 1999 Adhoc document corpus" }, { "accessURL": "https:\/\/trec.nist.gov\/data\/topics_eng\/topics.401-450.gz", "format": "SGML", "title": "TREC 1999 Adhoc Topics" }, { "accessURL": "https:\/\/trec.nist.gov\/data\/qrels_eng\/qrels.trec8.adhoc.parts1-5.tar.gz", "title": "TREC 1999 Adhoc relevance judgments (\"qrels\")" } ], "bureauCode": [ "006:55" ], "modified": "2024-10-31 00:00:00", "publisher": { "@type": "org:Organization", "name": "National Institute of Standards and Technology" }, "theme": [ "Information Technology:Data and informatics" ], "keyword": [ "TREC text retrieval conference" ] }