The large-scale dataset in first-person (egocentric) vision; multi-faceted, audio-visual, non-scripted recordings in native environments - i.e. the wearers' homes, capturing all daily activities in the …
A dataset for fake news detection. It consists of 2 sets of data verified via Politifact and Gossipcop.
The Fashion MNIST dataset is a large freely available database of fashion images that is commonly used for training and testing various machine learning algorithms. …
This dataset contains knowledge base relation triples and textual mentions of Freebase entity pairs, as used in the work published in (Toutanova and Chen CVSM-2015) …
The FreiHAND dataset for hand pose and shape estimation from single colour image, which can serve both as training and benchmarking dataset for deep learning …
The General Language Understanding Evaluation (GLUE) benchmark is a collection of resources for training, evaluating, and analyzing natural language understanding systems. GLUE consists of:
- …
GoodWiki is a 179 million token dataset of English Wikipedia articles collected on September 4, 2023, that have been marked as Good or Featured by …
Hate Speech and Offensive Language, 2017
A dataset of tweets specifically created for hate speech detection in online media. The dataset can be used to train classifiers that differentiate between hate …
The Public Dataset of Accelerometer Data for Human Motion Primitives Detection is a public collection of labelled accelerometer data recordings to be used for the …
ImageNet is an image dataset organized according to the WordNet hierarchy. Each meaningful concept in WordNet, possibly described by multiple words or word phrases, is …