Datasets

  • 20 Newsgroups, 2008

    The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across 20 different newsgroups. The 20 newsgroups collection has …

  • AG's News Topic Classification, 2015

    AG is a collection of more than 1 million news articles. News articles have been gathered from more than 2000 news sources by ComeToMyHead in …

  • Amazon Product Data, 2015

    This dataset contains product reviews and metadata from Amazon, including 142.8 million reviews spanning May 1996 - July 2014.

    This dataset includes reviews (ratings, text, …

  • BS Detector, 2017

    This dataset is collected from a browser extension called BD detector created for checking news veracity. It searches all links on a given web-page for …

  • Caltech 101, 2003

    Pictures of objects belonging to 101 categories. About 40 to 800 images per category. Most categories have about 50 images.

  • Caltech 256, 2006

    Collection of 30607 pictures of objects belonging to 256 categories. About 80 to 827 images per category. There is an average of 119 images per …

  • CIFAR-10, 2009

    The CIFAR-10 dataset consists of 60000 32x32 colour images in 10 classes, with 6000 images per class. There are 50000 training images and 10000 test …

  • CIFAR-100, 2009

    The CIFAR-100 dataset consists of 32x32 colour images. This dataset has 100 classes containing 600 images each. There are 500 training images and 100 testing …

  • Citeseer, 2003

    The CiteSeer dataset consists of 3312 scientific publications classified into one of six classes. The citation network consists of 4732 links. Each publication in the …

  • Cityscapes, 2016

    Cityscapes is a new large-scale dataset that contains a diverse set of stereo video sequences recorded in street scenes from 50 different cities, with high …