Sample Databases

From PostgreSQL wiki

(Difference between revisions)
Jump to: navigation, search
(Created page)
 
(move sample databases over from the Performance QA page)
Line 17: Line 17:
 
* [http://www.commandprompt.com/ppbook/booktown.sql Book Town] - used for the examples in [http://www.commandprompt.com/ppbook/ Practical PostgreSQL]
 
* [http://www.commandprompt.com/ppbook/booktown.sql Book Town] - used for the examples in [http://www.commandprompt.com/ppbook/ Practical PostgreSQL]
 
* Benchmarking databases such as [[DBT-2]] or [[TPC-H]] can be used as samples.
 
* Benchmarking databases such as [[DBT-2]] or [[TPC-H]] can be used as samples.
 +
* [http://www.freebase.com/docs/data_dumps Freebase] - Various wiki style data on places/people/things - ~600MB compressed
 +
* [http://www.imdb.com/interfaces#plain IMDB] - the IMDB database - see also http://code.google.com/p/imbi/
 +
* [http://www.data.gov/ ] - US federal government data collection see also [http://www.sunlightlabs.com/ sunlightlabs]
 +
* [http://wiki.dbpedia.org/Downloads DBpedia] - wikipedia data export project
 +
* [http://linux.dell.com/dvdstore/ Dell DVDstore] - Dells DVD Store context data
 +
* [http://www.eoddata.com/ eoddata] - historic stock market data (requires registration - licence?)
 +
* [http://www.transtats.bts.gov/Tables.asp?DB_ID=120&DB_Name=Airline%20On-Time%20Performance%20Data&DB_Short_Name=On-Time RITA] - Airline On-Time Performance Data
 +
* [http://wiki.openstreetmap.org/wiki/Planet.osm Openstreetmap] - Openstreetmap source data
  
 
[[Category:Benchmarking]]
 
[[Category:Benchmarking]]

Revision as of 18:29, 20 May 2011

Many database systems provide sample databases with the product. A good intro to popular ones that includes discussion of samples available for other databases is Sample Databases for PostgreSQL and More

One trivial sample that PostgreSQL ships with is the Pgbench. This has the advantage of being built-in and supporting a scalable data generator--you can make databases of any size ranging from 16MB to 600GB (approximately) with the current version.

PgFoundry Samples

The latest collection of PostgreSQL compatible database samples is at PgFoundry Sample Databases. It includes three commonly used benchmark databases:

  • World: Based on the MySQL World sample. Has a list of Cities, Countries, and what language they speak.
  • dellstore2: PostgreSQL port of a database-neutral e-commerce test application developed by Dell. The original code supports three size scales in their data generator (10MB, 1GB, 100GB), currently only the normal, smallest sized data set has been ported to PostgreSQL. PostgreSQL 8.4: Windowing Functions uses this test data to show some advanced queries.
  • Pagilia: Based on MySQL's replacement for World, Sakila, which is itself inspired by the Dell DVD Store.

There are some other sample databases there as well, such as a USDA Food database and a large list of country data via ISO-3166 standards.

Other Samples

Personal tools