Datasets for Computer Vision Research

Source: Internet
Author: User
Tags uci machine learning repository

Datasets for Computer Vision Research http://www.di.ens.fr/willow/research.php

The ponce Group

Main Page
Vision Research
Robotics research
Publications
Technical reports
Datasets
Software

Fifteen scene categories

This is a dataset of registrteen natural scene categories that expands on the thirteen category dataset released by Fei-fei Li. The two new categories
AreIndustrialAndStore. Classification results for the specified teen categories are presented in the following paper:

Svetlana Lazebnik, Cordelia Schmid, and Jean ponce. Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. Proceedings of the IEEE Conference
On computer vision and pattern recognition, 2006, accepted.

Download(84 MB zip file)

Thanks to Fei-fei Li and Aude Oliva for providing parts of this database.

3D object recognition stereo Dataset

This dataset consists of 9 objects and 80 test images. The training images areStereo viewsFor each of the 9 objects that are roughly equally spaced around the equatorial ring for each of them. The number of stereo views ranges
From 7 to 12 for the different objects. The test images areMonocular ImagesOf objects under varying amounts of clutter and occlusion and different lighting conditions.

Results On This dataset are reported in the following paper:

Akash Kushal and Jean ponce. Modeling 3D objects from stereo views and recognizing them in photographs. Proceedings of the European Conference on computer vision, 2006,
To appear.

Browse and download

3D photography Dataset

This is a collection of ten multiview data sets captured in our lab. Each set consists of 24 images of a single rigid object, together with camera parameters and extracted apparent seconds s for each image.

Reconstruction Results For This dataset appear in the following paper:

Yasutaka Furukawa and Jean ponce. Carved visual hulls for image-based modeling. Proceedings of the European Conference on computer vision, 2006, to appear.

Browse and download

Visual Hull Datasets

This is a collection of Visual Hull datasets used in the following paper:

Svetlana Lazebnik, Yasutaka Furukawa, and Jean ponce. projective visual hulls. Submitted to International Journal of computer vision, 2006.

Browse and download

Birds

This database contains 600 images (100 samples each) of six different classes of birds. The images are color JPEG, of Variable Resolution. The classes (each in its own directory) are as follows:

  • Egret
  • Mandarin Duck
  • Snowy owl
  • Puffin
  • Toucan
  • Wood duck

If you use this database in your own research, please cite the following paper:

Svetlana Lazebnik, Cordelia Schmid, and Jean ponce. A maximum entropy framework for part-Based Texture and object recognition. Proceedings of the IEEE International
Conference on computer vision, Beijing, China, October 2005, vol. 1, pp. 832-838.

Download:

Birds.zip (43 MB)
File numbers for training, validation, and test images for each category.
File numbers of training pairs

Butterflies

This database contains 619 images of seven different classes of butterflies. The images are color JPEG, of Variable Resolution. The classes (each in its own directory) are as follows:

  • Admiral: 111 Images
  • Black swallowtail: 42 Images
  • Machaon: 83 Images
  • Monarch 1 (WINGS closed): 74 Images
  • Monarch 2 (wings open): 84 Images
  • Peaco CK: 134 Images
  • Zebra: 91 Images

If you use this database in your own research, please cite the following paper:

Svetlana Lazebnik, Cordelia Schmid, and Jean ponce. Semi-local affine parts for object recognition. Proceedings of the British Machine Vision Conference, September 2004,
Vol. 2, pp. 959-968.

Download:

Butterflies.zip (43 MB)
File numbers of training pairs and validation images for each class



Comparative Evaluation
Object Recognition Database

This database features modeling shots of eight objects and 51 cluttered test shots ining multiple objects. the images are color S, the resolutions are 1.2 mpix (1280x960) and 3.7 mpix (2200x1700 ). if you use this database in your own research,
Please cite the following paper:

Fred rothganger, Svetlana Lazebnik, Cordelia Schmid, and Jean ponce. 3D Object Modeling and Recognition Using local affine-invariant image descriptors and multi-view spatial
Constraints. International Journal of computer vision, vol. 66, no. 3, March 2006, pp. 231-259.

When reporting your results on this data, please refer to the comparative evaluation of different state of the art recognition methods contained in this article.

Browse and download:

Image directory
Archive in. tgz format (108 MB)

The creation of this database was supported in part by the National Science Foundation under grants IIS-0308087 and IIS-0312438, The UIUC-CNRS research collaboration agreement, the European FET-open project vibes, The uiuc Campus Research Board, and
The Beckman Institute. For questions, contact Fred rothganger (rothgang-at-uiuc.edu ).

Texture Database

The texture database features 25 texture classes, 40 samples each. all images are in grayscale JPG format, 640x480 pixels. if you use this database in your own research, please cite the following paper:

Svetlana Lazebnik, Cordelia Schmid, and Jean ponce. A Sparse texture representation using local affine regions. IEEE Transactions on Pattern Analysis and machine intelligence,
Vol. 27, No. 8, pp. 1265-1278, August 2005.

Browse and download:

Example images (four per texture)
T01-T05.zip (45 MB)
T06-T10.zip (40 MB)
T11-T15.zip (44 MB)
T16-T20.zip (46 MB)
T21-T25.zip (39 MB)

The creation of this database was supported in part by the National Science Foundation under Grant IIS-0308087, the European project lava, The UIUC-CNRS research collaboration agreement, the uiuc Campus Research Board, and the Beckman Institute. for
Questions, contact Svetlana Lazebnik (slazebni-at-uiuc.edu ).

Video Sequences

This dataset is used for research on Euclidean upgrades based on minimal assumptions about the camera (e.g. square pixels ).

Browse and download:

Example Reconstructions
Archive in *. tgz format (23 MB)

Links to external datasetstexture

  • The KTH-TIPS Image Database
  • Brodatz textures
  • Meastex --- a bunch of useful texture stuff
  • Vision texture (vistex), MIT Media Lab
  • Curet (Columbia-Utrecht reflectance and texture database)
  • University of Oulu texture Database
  • Texture lab, Heriot Watt University, Edinburgh
Object Recognition
  • ETH-80 database --- 80 objects (8 basic categories), each object is represented by 41 views
  • Columbia Object Image Library: coil-20, coil-100
  • Dataset page maintained by the visual geometry group, Oxford University
  • Caltech Vision Group archive (the same recognition data sets as on the Oxford page)
  • Uiuc image database for car Detection
  • Henry Schneiderman's car Database
Image Libraries
  • Digital Library Project, Berkeley
Statistics of Natural Images
  • Hans van hateren's natural stimuli collection
Machine Learning
  • UCI machine learning Repository
Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.