Docker Dataset Save

Docker database images with pre-populated data for testing and/or practice.

Project README

Docker Dataset

Build Status

Have you ever wanted to access pre-populated databases with dummy but valid data? It can be for something as simple as practicing writing SQL queries to running tests on databases. Under such circumstances, you have to either have to create dummy data or utilize some internet-searching skills to find data to populate your database. I think this is a common enough problem/requirement that solution can be Dockerized for reuse. So here is a Docker image for PostgreSQL with databases populated with sample data.

Datasets

So far we have the following datasets which are being used in the images.

  • Postgres Sample Databases: The datasets being used from here are dellstore2 (tagged dellstore), iso3166, sportsdb, usda and world. pgFoundry has been down for a few days now. Therefore we have switched the URLs to their FTP sources here.

Databases

The only database supported so far is PostgreSQL. We use the alpine version of the official image as the base image to keep our image slim.

Tags

Available tags are dellstore, iso3166, sportsdb, usda, world, all and latest. all and latest are the same image with all the datasets in one image. Each of them has been loaded into their own database in the image. The rest of the tags belong to images single datasets.

pagila has been removed

The pagila tag has been removed due to the fact that it was broken for a while and it also broke the all and latest tags. This is because the Pagila dataset we were using had a change which was not compatible with any version of Postgres (See #1 and this issue for context.

Usage

You can start the container by running:

docker run -d --name pg-ds-<tag> aa8y/postgres-dataset:<tag>

and access it by:

docker exec -it pg-ds-<tag> psql -d <db_name>

where <tag> is one of the tags mentioned here and <db_name> is the database name which is one of the dataset names mentioned here. You can also use them with docker-compose. See this example for information on how to use them.

Custom images

If you want to build a custom image with not one or all the datasets, but some, then you can do that using:

docker build -t aa8y/postgres-dataset:some --build-arg DATASETS=dellstore,world .

and then following the same aforementioned steps for using your custom image.

Future Work

  • Images for other popular databases like MySQL.
  • Add french-towns-communes-francais datasets from Postgres Sample Databases.
  • Find and add more free data sources.
Open Source Agenda is not affiliated with "Docker Dataset" Project. README Source: aa8y/docker-dataset

Open Source Agenda Badge

Open Source Agenda Rating