Combinatorial data sets

Here is a list of all the data sets in the combinatorial category, showing 20 per page.

Social Recommendation

This dataset contains the Facebook Social Graph and full ratings of 16 restaurants and 23 pubs by 93 users.

The .zip file contains anonymous versions the social network and the items ratings. It includes three files:

Each line in the rating files (pubs.csv and rest.csv) represents a participant with the structure: userid,X1,...,Xn. The userid in these files corresponds with the ids in the links.csv file.

The data on this page has been donated by Lihi Dery.

Want to have more details, see this page.

Click here to download the dataset

Trip Advisor Data

This dataset contains 675,069 reviews of 1,851 hotels across the world scraped from Trip Advisor. The data was scraped and donated by Hongning Wang.

One file contains the numerical aspect ratings provided by the users, along with other information about the hotel. The second file contains the text of the users review. These reviews have been slightly modified, all excess spaces and tabs have been removed and all commas have bene changed to semi-colons.

Both files are zipped due to their size. Both files are encoded in the dat format and the first line of each file explains the fields within the file. Some of the usernames are encoded in Unicode so please be careful when parsing the files!

Want to have more details, see this page.

Click here to download the dataset