Dataset Search
Sort By
Search results
604 results found
Complex Document Information Processing (CDIP) dataset
Data provided by National Institute of Standards and Technology
This dataset is called the "IIT CDIP collection". "CDIP" stands for "Complex Document Information Processing" and "IIT" stands for "Illinois Institute of Technology" who originally built the dataset. The dataset consists of documents from the states' lawsuit against the tobacco industry in the 1990s.
Tags: optical character recognition,information retrieval,document structure,document understanding,image to text,
Modified: 2025-04-06
IACC.3 (Internet Archive Creative Commons) Video Dataset
Data provided by National Institute of Standards and Technology
The IACC.3 dataset is approximately 4600 Internet Archive videos (144 GB, 600 h) with Creative Commons licenses in MPEG-4/H.264 format with duration ranging from 6.5 min to 9.5 min and a mean duration of almost 7.8 min. Most videos will have some metadata provided by the donor available e.g., title, keywords, and description.
Tags: video retrieval,ad-hoc video search,trecvid,internet archive,search by text,
Modified: 2025-04-06
Temperature profiles of acrylonitrile butadiene styrene (ABS) during bench-scale material extrusion additive manufacturing
Data provided by National Institute of Standards and Technology
Temperature profiles of acrylonitrile butadiene styrene (ABS) filament during material extrusion additive manufacturing. Printing temperature and velocities cover the full "printable" range of the ABS filament and are corrected for reflected infrared photons. The profile of the active printing layer and two sub-layers are included. See the associated publication for the full experimental details.
Tags: Acrylonitrile butadiene styrene,ABS,FDM,thermography,3D printing,Temperature profiles,material extrusion,additive manufacturing,
Modified: 2025-04-06
Smartphone Data for Development of Indoor Localization Apps
Data provided by National Institute of Standards and Technology
The PerfLoc Prize Competition (https://perfloc.nist.gov) was developed by NIST during 2015-2017 and was run during 2017-2018. The Competition was concluded with a single winner on May 16, 2018. However, NIST believes the data collected for the PerfLoc Competition is still of value to the R&D community, because there is still room to develop better signal processing and data fusion algorithms that would fuse various types of smartphone data collected in this project to develop indoor localization apps with higher localization accuracy.
Tags: Indoor Localization; Smartphone Apps; Smartphone Location; Smartphone Sensor Data; Smartphone RF Received Signal; Wi-Fi; GPS/GNSS; Cellular Telephony; Accelerometer; Gyroscope; Magnetometer; Light Sensor; Barometer; Data Fusion Algorithms; Digital Signal Processing,
Modified: 2025-04-06
Vimeo Creative Commons Collection (V3C1)
Data provided by National Institute of Standards and Technology
The V3C1 dataset (drawn from a larger V3C video dataset) is composed of 7475 Vimeo videos (1.3 TB, 1000 h) with Creative Commons licenses and mean duration of 8 min. All videos will have some metadata available e.g., title, keywords, and description in json files. The dataset has been segmented into 1,082,659 short video segments according to the provided master shot boundary files. In addition, Keyframes and thumbnails per video segment have been extracted and available.
Tags: TRECTRECVID video searchcontent-based video retrieval,
Modified: 2025-04-06
Vimeo Creative Commons Collection (V3C2)
Data provided by National Institute of Standards and Technology
The V3C2 dataset (drawn from a larger V3C video dataset) is composed of 9760 Vimeo videos (1.6 TB, 1300 h) with Creative Commons licenses and mean duration of 8 min. All videos will have some metadata available e.g., title, keywords, and description in json files. The dataset has been segmented into 1,425,454 short video segments according to the provided master shot boundary files. In addition, Keyframes and thumbnails per video segment have been extracted and available. For more details, please consult the TRECVID website: trecvid.nist.gov
Tags: TRECTRECVID video searchcontent-based video retrieval,
Modified: 2025-04-06
Process and robot data from a two robot workcell representative performing representative manufacturing operations.
Data provided by National Institute of Standards and Technology
This data set is captured from a robot workcell that is performing activities representative of several manufacturing operations. The workcell contains two, 6-degree-of-freedom robot manipulators where one robot is performing material handling operations (e.g., transport parts into and out of a specific work space) while the other robot is performing a simulated precision operation (e.g., the robot touching the center of a part with a tool tip that leaves a mark on the part). This precision operation is intended to represent a precise manufacturing operation (e.g., welding, machining).
Tags: Condition Monitoring,diagnostics,robotics,manufacturing,prognostics,Workcell,
Modified: 2025-04-06
1986 County Business Patterns: Business Patterns
Data provided by United States Census Bureau
County Business Patterns (CBP) is an annual series that provides economic data by industry at the U.S., State, County and Metropolitan Area levels. This series includes the number of establishments, employment during the week of March 12, first quarter payroll, and annual payroll. CBP provides statistics for businesses with paid employees for the U.S., Puerto Rico, and the Island Areas. Census Bureau staff identified a processing error that affects selected data from the 2014 County Business Patterns (CBP).
Tags: census,
Modified: 2025-06-23
1991 County Business Patterns: Business Patterns
Data provided by United States Census Bureau
County Business Patterns (CBP) is an annual series that provides economic data by industry at the U.S., State, County and Metropolitan Area levels. This series includes the number of establishments, employment during the week of March 12, first quarter payroll, and annual payroll. CBP provides statistics for businesses with paid employees for the U.S., Puerto Rico, and the Island Areas. Census Bureau staff identified a processing error that affects selected data from the 2014 County Business Patterns (CBP).
Tags: census,
Modified: 2025-06-23
NDN-DPDK: High-Speed Named Data Networking Forwarder
Data provided by National Institute of Standards and Technology
NDN-DPDK is a set of high-performance Named Data Networking (NDN) programs developed with Data Plane Development Kit (DPDK). It includes a network forwarder and a traffic generator. https://github.com/usnistgov/ndn-dpdk
Tags: Named Data Networking,Information Centric Networking,
Modified: 2025-04-06