Goose Dataset Save

Dataset of Canada goose images with annotations of bounding boxes with object classes, suitable for testing object detection algorithms.

Project README

Goose Dataset

This is a dataset of 1,000 Canada goose images. They are all mugshots, with the head being the main feature of the image. Each image comes with an annotation XML in the PASCAL VOC format with a bounding box of the head and the object class goose-head.

About the Dataset

Canada geese have a distinct and high-contrast facial pattern that is highly recognizable. The images in this dataset are all somewhat visually similar. The head in the image is either the left side or the right side, not front or back. The background is usually blurry. Only one goose head is present in each image. These properties makes the dataset rather simple for various computer vision tasks, and can serve as a starting point for computer vision experiments such as object recognition, localization, detection, etc.

All images are photographed by me.

sample_images.png
Sample images from the dataset.

Usage

The images directory contains 1,000 JPG images of the size 800 x 533. The annotations directory contains the XML annotations. These can be directly used in the darkflow implementation of object detection by placing them under darkflow/test/training.

If you are only using the images and not the annotations, I also implemented a load_data() API that has compatible usage as those in the Keras built-in datasets.

(x_train, y_train), (x_test, y_test) = GooseDataset.load_data(test_ratio=0.2,
                                                              grayscale=False,
                                                              cropped=False,
                                                              resize_shape=None,
                                                              save=False)
  • Returns:

    • 2 tuples:
      • (x_train, x_test): Arrays of integers with shape (n_samples, n_rows, n_columns, 3) in the range [0, 255] for RGB, or arrays of floats with shape (n_samples, n_rows, n_columns, 1) in the range [0, 1] for grayscale.
      • (y_train, y_test): Arrays of integers with shape (n_samples, 1). All values are 1, which stands for presence of goose in the picture.
  • Arguments:

    • test_ratio: Float. Ratio of (number of testing data) / (number of all data).
    • grayscale: Boolean. Whether to load the images in grayscale.
    • cropped: Boolean. Whether to load only the cropped bounding boxes of the images. If True, a new directory cropped_images will be created in addition to loading the images.
    • resize_shape: [n_rows, n_columns], target shape of the loaded images. If None, the default shape [533, 800] will be used.
    • save: Boolean. Whether to save the images as how they are loaded into a new directory processed_images in addition to loading. This can be used to visualize the actual images being loaded after grayscaling, cropping, and/or resizing.

To see an actual example, please check the Example Jupyter notebook.

Note

Please do not use this dataset to derive any work that harms animals.

Open Source Agenda is not affiliated with "Goose Dataset" Project. README Source: steggie3/goose-dataset
Stars
35
Open Issues
0
Last Commit
5 years ago
License
MIT

Open Source Agenda Badge

Open Source Agenda Rating