Election data sets

Here is a list of all the data sets in the election category, showing 20 per page.

Pierce Election Data

The 2008 Pierce Data contains the results from several elections, including county executive, held in Pierce, WA in 2008. The set contains 4 distinct elections with between 4 and 7 canddiates and 40,000 and 300,000 voters.

Note that these elections were conducted under a ranked voting system which allowed blank entries. In processing this data for PrefLib we have ignored blanks and only report the order over the candidates.

The data on this page was donated by Jeffrey O'Neill who runs the site OpenSTV.org.

Want to have more details, see this page.

Click here to download the dataset

Proto French Election

This analog dataset to the 2002 French Presidential Election Dataset was collected by Jean-Francois Laslier, Karine Van der Straeten and Michel Balinski. It consists of 398 approval ballots collected over potential candidates for the 2002 French Presidential election cast by students at Institut d’Etudes Politiques de Paris.

This dataset is interesting as its companion dataset CD-00002 has both the subjective evaluations of the candidates, along with the approvals. This dataset only preserves the approval ballots cast by the students. As the candidate set is the potential presidential candidates (and thus, not the exact set used in ED-00026), this is presented as a separate dataset.

Want to have more details, see this page.

Click here to download the dataset

San Francisco Election Data

The San Francsico data contains the results from several elections, including board of supervisors, district attorny, and mayoral elections, held in San Francisco, CA between 2008 and 2012. The set contains 14 distinct elections with between 4 and 25 canddiates and 18,000 and 195,000 voters.

Note that these elections were conducted under a ranked voting system which allowed blank entries. In processing this data for PrefLib we have ignored blanks and only report the order over the candidates.

The data on this page was donated by Jeffrey O'Neill who runs the site OpenSTV.org.

Want to have more details, see this page.

Click here to download the dataset

San Leandro Election Data

The San Leandro data contains the results from several elections, including mayor and city council elections, held in San Leandro, CA between 2010 and 2012. The set contains 3 distinct elections with between 4 and 7 canddiates and about 25,000 voters each.

Note that these elections were conducted under a ranked voting system which allowed blank entries. In processing this data for PrefLib we have ignored blanks and only report the order over the candidates.

The data on this page was donated by Jeffrey O'Neill who runs the site OpenSTV.org.

Want to have more details, see this page.

Click here to download the dataset

San Sebastian Poster Competition

Approval Ballots from the San Sebastian Poster Competition held during The Summer School on Computational Social Choice organized by COST Action IC1205 at the Miramar Palace in San Sebastian in July 2016. This set has two elections of approval ballots with 17 alternatives and about 60 voters each. The data on this page was donated by Ulle Endriss.

Two elections were held, using approval voting. In the first election the alternatives were posters A1-A17; in the second election the alternatives were posters B1-B17. There were 67 eligible voters (56 summer school participants, including the 34 poster presenters, as well as 7 lecturers and 4 organizers). Of these, 65 voters participated in the first election and 60 voters participated in the second election (1 voter did not vote in either election). The elections were conducted using the Whale3 system of Sylvain Bouveret. Most of the posters are available at the summer school website.

The original data file (ED-00033-00000001.dat) includes one column per poster. Each of the two sets of posters is ordered by the number of approvals received. Each row corresponds to a voter. The voters are ordered by the number of approvals they have given across both elections, except that the 7 voters who only participated in one of the two elections are listed last. The other files are converted into standard PrefLib format where all approved alternatives are considered a tied equivalence class.

Want to have more details, see this page.

Click here to download the dataset

Skate Data

This dataset contains figure skating rankings from various competitions during the 1998 season including the World Juniors, World Championships, and the Olympics. These data sets generally have 10-25 candidates (skaters) and 8-10 judges (voters).

The candidates (skaters) are ordered such that the first candidate skated first, and on down the list. We have maintained this order as presented in the original versions of this dataset.

Want to have more details, see this page.

Click here to download the dataset

Sushi Data

This dataset contains the results of a series of surveys conducted by Toshihiro Kamishima asking 5000 individuals for their preferences about various kinds of sushi. There are three different datasets that were elicited in different ways:

This dataset contains 14 files in total including soc, soi, toi, and toc files.

Note that the dataset was incorrectly converted, it has been fixed as of Jan 2016, please re-download.

Due to licence issues we require that you go through Toshihiro Kamishima website to obtain the datafiles and observe the following licence terms:

Want to have more details, see this page.

Click here to download the dataset

T Shirt

This dataset contains complete rank orderings of T-Shirt designs voted on by members of the Optimization Research Group at NICTA. There are 11 designs (candidates) and 30 votes about these deisgns. Voters were required to submit complete strict orders.

This data has been kindly donated by Carleton Coffrin.

Want to have more details, see this page.

Click here to download the dataset

Takoma Park Election Data

The Takoma Park Data contains the results from the 2007 Takoma Park, WA special election for city council. The set contains one elections with between 4 canddiates and about 400 voters.

Note that these elections were conducted under a ranked voting system which allowed blank entries. In processing this data for PrefLib we have ignored blanks and only report the order over the candidates.

The data on this page was donated by Jeffrey O'Neill who runs the site OpenSTV.org.

Want to have more details, see this page.

Click here to download the dataset

UK Labor Party Leadership Vote

The 2010 UK Labor Party Leadership Vote is posted at www.rangevoting.org. This set contains the votes cast by all 266 MPs over the 5 leadership candidates. The votes are incomplete strict orders which we have posted along with extensions placing all unranked candidates tied at the end and pairwise graphs.

Want to have more details, see this page.

Click here to download the dataset

Vermont District Races

This dataset contains votes for 15 different races for various public offices held in Vermont in 2014. This data was collected and donated by Jeremy A Hansen. There are 3 to 6 candidates and 532 to 1960 voters in these data files. Not all races were competitive so not every race is reported for every district.

Want to have more details, see this page.

Click here to download the dataset

Web Search

This dataset contains the results of comparing websearches across Bing, Google, Yahoo, and Ask. This data is provided by Robert Bredereck at TU:Berlin. Robert provides tools to compute Kemeny rankings on this data at his website at TU Berlin.

The data files marked big contain around 2000 candidates each while the data files marked small contain between 100 and 200 results. The search querys are shown in the names of the individual data files below. For the WebImpact files the number of search results for a particular term were used to creage a complete ranking over the search terms. These files measure the webimpact of various world cities and countries. The results are not complete and not every candidate (website) is ranked by all the voters (search engines). We have extended this data into tournament graphs, weighted majoirty graphs, and created a toc dataset where all candidates are tied, at the end of rankings.

Want to have more details, see this page.

Click here to download the dataset