U.S. flag

An official website of the United States government

Dot gov

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Https

Secure .gov websites use HTTPS
A lock () or https:// means you’ve safely connected to the .gov website. Share sensitive information only on official, secure websites.

Breadcrumb

  1. Home

Dataset Search

Search results

47 results found

Challenge Round 0 (Dry Run) Test Dataset

Data provided by  National Institute of Standards and Technology

This dataset was an initial test harness infrastructure test for the TrojAI program. It should not be used for research. Please use the more refined datasets generated for the other rounds. The data being generated and disseminated is training, validation, and test data used to construct trojan detection software solutions. This data, generated at NIST, consists of human level AIs trained to perform a variety of tasks (image classification, natural language processing, etc.).

Tags: Trojan Detection,Artificial Intelligence,ai,machine learning,Adversarial Machine Learning,

Modified: 2024-02-22

Views: 0

Active Evaluation Software for Selection of Ground Truth Labels

Data provided by  National Institute of Standards and Technology

This software repository contains a python package Aegis (Active Evaluator Germane Interactive Selector) package that allows us to evaluate machine learning systems's performance (according to a metric such as accuracy) by adaptively sampling trials to label from an unlabeled test set to minimize the number of labels needed. This includes sample (public) data as well as a simulation script that tests different label-selecting strategies on already labelled test sets. This software is configured so that users can add their own data and system outputs to test evaluation.

Tags: active evaluation,machine learning,ar,

Modified: 2024-02-22

Views: 0

Simulated Radar Waveform and RF Dataset Generator for Incumbent Signals in the 3.5 GHz CBRS Band

Data provided by  National Institute of Standards and Technology

This software tool generates simulated radar signals and creates RF datasets. The datasets can be used to develop and test detection algorithms by utilizing machine learning/deep learning techniques for the 3.5 GHz Citizens Broadband Radio Service (CBRS) or similar bands. In these bands, the primary users of the band are federal incumbent radar systems. The software tool generates radar waveforms and randomizes the radar waveform parameters.

Tags: 3.5 GHz,CBRS,LTE,ESC,radar,radio frequency signals,spectrum,machine learning,deep learning,detection,

Modified: 2024-02-22

Views: 0

Comparison of primary laser spectroscopy and mass spectrometry methods for measuring mass concentration of gaseous elemental mercury

Data provided by  National Institute of Standards and Technology

Data for mass concentration analysis and spectral analysis for the paper titled, "Comparison of primary laser spectroscopy and mass spectrometry methods for measuring mass concentration of gaseous elemental mercury"

Tags: mercury,laser absorption spectroscopy,Standard Generator,SI traceability,ID-CV-ICP-MS,primary measurement method,Environment and Climate,

Modified: 2024-02-22

Views: 0

Optical scattering measurements and simulation data for one-dimensional (1-D) patterned periodic sub-wavelength features

Data provided by  National Institute of Standards and Technology

This data set consists of both measured and simulated optical intensities scattered off periodic line arrays, with simulations based upon an average geometric model for these lines. These data were generated in order to determine the average feature sizes based on optical scattering, which is an inverse problem for which solutions to the forward problem are calculated using electromagnetic simulations after a parameterization of the feature geometry.

Tags: electromagnetic simulations,simulations,experimental,angle-resolved scattering,scattering,gratings,patterned semiconductors,semiconductors,scatterfield microscopy,bright-field microscopy,microscopy,inverse problems,machine learning,

Modified: 2024-02-22

Views: 0

SRM 3133 Mercury (Hg) Standard Solution Lot No. 160921

Data provided by  National Institute of Standards and Technology

This Standard Reference Material (SRM) is intended for use as a primary calibration standard for the quantitative determination of mercury. A unit of SRM 3133 consists of five 10 mL sealed borosilicate glass ampoules of an acidified aqueous solution prepared gravimetrically to contain a known mass fraction of mercury. The solution contains nitric acid at an approximate mass fraction of 10 %. This data is public in the Certificate of Analysis for this material.

Tags: mercury,pure materials,cations,metals,single element solutions,spectrometry,Advanced Materials,

Modified: 2024-02-22

Views: 0

QFlow 2.0: Quantum dot data for machine learning

Data provided by  National Institute of Standards and Technology

Using a modified Thomas-Fermi approximation, we model a reference semiconductor system comprising a quasi-1D nanowire with a series of five depletion gates whose voltages determine the number of quantum dots (QDs), the charges on each of the QDs, as well as the conductance through the wire. The original dataset, QFlow lite, consists of 1 001 idealized simulated measurements with gate configurations sampling over different realizations of the same type of device.

Tags: machine learning,quantum dots,simulated data,

Modified: 2024-02-22

Views: 0

Nestor: a toolkit for quantifying tacit maintenance knowledge, for investigatory analysis in smart manufacturing

Data provided by  National Institute of Standards and Technology

There is often a large amount of maintenance data already available for use in Smart Manufacturing systems, but in a currently-unusable form: service tickets and maintenance work orders (MWOs). Nestor is a toolkit for using Natural Language Processing (NLP) with efficient user-interaction to perform structured data extraction with minimal annotation time-cost.

Tags: information,communication,maintenance,tribal knowledge,event sequences,training,machine learning,data cleaning,prognostics,diagnostics,visualization,decision guidance,CMMS,scheduling,investigations,nestor,smart manufacturing,manufacturing operations,manufacturing performance,

Modified: 2024-02-22

Views: 0

Albatross Species Chemical Database and Annotated Bibliography

Data provided by  National Institute of Standards and Technology

Database and annotated bibliography of contaminants in tissues from albatross (Family Diomedeidae) species. Database is saved as current Microsoft Access 365 .accdb file and a backward compatible to Access 2000 .mdb file and a Microsoft Excel .xlsx file which has the database output information, introduction with description of headers, and some derived data. A free viewer for Excel is at: https://www.microsoft.com/en-us/p/xlsx-viewer-free/9nblggh6hbf4?activet….

Tags: seabird,tissues,PAHs,PBDEs,PCBs,PCDDs,PCDF,perflourinated acids,phenols,toxicology pesticides,mercury,trace elements,heavy metals,organic,inorganic,chemistry,stable isotopes,Environment and Climate,

Modified: 2024-02-22

Views: 0

REMI: Resource for Materials Informatics

Data provided by  National Institute of Standards and Technology

The REsource for Materials Informatics (REMI) will host a diverse collection of scripting notebooks (Jupyter, Matlab LiveScripts, etc.) for collecting, pre-processing, analyzing, and visualizing materials data. Notebooks are curated using tags aligned to Materials Science and Data Science topics. REMI emerged from the realization that both experts and novices wanted examples of using machine learning for science. Meanwhile, lots of experts are developing digital notebooks (e.g. Jupyter) to demonstrate step-by-step data collection, pre-processing, analysis and visualization.

Tags: machine learning,data analysis,data processing,materials science,materials genome initiative,

Modified: 2024-02-22

Views: 0