{PrefLib}: A Library for Preferences

Formats

We have attempted to unify the formatting of the data as much as possible. Each format is as close to a comma separated format (CSV) as possible to improve portability. We currently have 10 types of data in 3 formats.

Data Categories

Our data is separated into four broad categories:

Supported By:

SOC - Strict Orders - Complete List

Each file with a soc extension contains a profile consisting of a complete, transitive, and asymmetric relation over a group of objects. These are written A,B,C; meaning that A is strictly preferred to B which is strictly preferred to C. The strict relation is always denoted by use of a comma (,).

Download all data files containing SOC data.

Top.

SOI - Strict Orders - Incomplete List

Each file with a soi extension contains a profile consisting of a transitive and asymmetric relation over a group of objects. These are written A,B,C; meaning that A is strictly preferred to B which is strictly preferred to C. The strict relation is always denoted by use of a comma (,) and unranked elements are not included in the list of a particular agent.

Download all data files containing SOI data.

Top.

TOC - Orders with Ties - Complete List

Each file with a toc extension contains a profile consisting of an transitive relation where all elements appear in every list. These written A,{B,C},D; meaning A is strictly preferred to {B,C}, while the voter is indifferent between B and C, with A,B, and C all preferred to D. The strict relation is always denoted by use of a comma (,) while elements that an agent is indifferent between are grouped in a curly brace ({}).

Download all data files containing TOC data.

Top.

TOI - Orders with Ties - Incomplete List

Each file with a toi extension contains a profile consisting of an transitive relation where not all elements appear in every list. These written A,{B,C},D; meaning A is strictly preferred to {B,C}, while the voter is indifferent between B and C, with A,B, and C all preferred to D. The strict relation is always denoted by use of a comma (,) while elements that an agent is indifferent between are grouped in a curly brace ({}).

Download all data files containing TOI data.

Top.

TOG - Tournament Graph

Files with a tog extension describe a tournament graph. A tournament graph is a complete directed graph over a set of alternatives. In our formatting we simply state the list of pairwise relations that describe the tournament (A,B; B,C). The strict relation is always denoted by use of a comma (,). Note that tournaments must be complete assignment of the strict preference relation, otherwise it is a majority graph (see below).

Download all data files containing TOG data.

Top.

MJG - Majority Graph

Files with a mjg extension describe a majority graph (relation). A majority graph is a directed graph over a set of alternatives. In our formatting we simply state the list of pairwise relations that described the majority graph (A,B; B,C). The strict relation is always denoted by use of a comma (,). Note that majority graphs need not be complete, additionally there is no concept of a "tie" between two elements.

Download all data files containing MJG data.

Top.

WMG - Weighted Majority Graph

Files with a wmg extension describe a majority graph (relation) with weights. A majority graph is a directed graph over a set of alternatives. In our formatting we simply state the list of pairwise relations that described the majority graph (A,B; B,C) along with the number of times the particular relation was listed (A,B) minus the number of times the reverse relation was listed (B,A). The strict relation is always denoted by use of a comma (,). Note that majority graphs need not be complete, additionally there is no concept of a "tie" between two elements.

Download all data files containing WMG data.

Top.

PWG - Pairwise Graph

Files with a pwg extension describe a set of pairwise relations. Every possible pairwise comparison is listed (if it appears in the data set) along with the number of time it appears. For example: 61 A,B; 21 B,A.

Download all data files containing PWG data.

Top.

WMD - Weighted Matching Data

Files with a wmd extension describe a set of weighted matching data. These are directed graphs where source is linked to the sink with a specified weight.

Download all data files containing WMD data.

Top.

DAT - Extra Data File

Files with a dat extension contain miscellaneous data for a given problem. Generally this is extra information that may be useful for some people.

Top.

Election Data Format

The format for all election data is (each element on a new line):

  • Number of Candidates
  • 1, Candidate Name
  • 2, Candidate Name
  • ...
  • Number of Voters, Sum of Vote Count, Number of Unique Orders
  • count, preference list. (12,1,2,{3,4}). A strict ordering is indicated by a comma (,) and elements that are indifferent are grouped with a ({ }).
  • count, preference list. (12,1,2,3,4). A strict ordering is indicated by a comma (,) and elements that are indifferent are grouped with a ({ }).
  • ...
Votes are sorted by count in the individual data files. The "Number of Voters" field is the number of actual ballots that were cast. The "Sum of Vote Count" is the sum of weights. In most cases the Number of Voters = Sum of Vote Count, except where for example, we have induced a relation like generating a pairwise graph from a set of linear orders. In this case we would have some number n of voters over m alternatives but we would have n * m choose 2 as the sum of weights since each voter expresses a relation between each pair of elements.

See all the Election Data (ED) data sets.

Top.

Weighted Matching Data

The format for all weighted matching data is (each element on a new line):

  • Number of Nodes, Number of Edges
  • 1, Name of Node
  • 2, Name of Node
  • ...
  • Source, Destination, Weight
  • Source, Destination, Weight
  • ...
The edges are sorted by source so that all edges starting from the same source are grouped together.

See all the Matching Data (MD) data sets.

Top.

Extra Data File

The format for miscellaneous data is always a simple CSV file.

  • Item 1 Name, Item 2 Name, Item 3 Name, ..., Item N Name
  • Item 1 Value, Item 2 Value, Item 3 Value, ..., Item N Value
Files with a dat extension are generally paired with another file, providing more information than is expressible in the basic data formats.

Top.

Links
Tools
Data