Description of Dataset used in Fetchback® Competition

 

File Name: DataForClass.txt

File Size: ~786MB

Number of Tuples: 13,584,386

Number of Attributes in the Table: 4

 

 

The data is gathered by Fetchback® (www.fetchback.com), a retargeting company, for research use. It consists of the transaction records in one week (08/08/2010 – 08/14/2010).

 

In this dataset, each row represents a transaction, which has 4 attributes, namely pid, siteid, uid and date.

·         pid is the unique identifier for each product, varying from 7 to 8 digits.

·         siteid is the unique identifier for each online shopping website, 4 digits.

·         uid is the unique identifier for each user, varying from 3 to 8 digits.

·         date is the transaction date.

 

 

Sample tuples of the data:

 

pid:13505646 - siteId:9093 - uid:08097540 - date:2010-08-08

pid:16062417 - siteId:9102 - uid:95429188 - date:2010-08-08

pid:12546546 - siteId:7167 - uid:71516943 - date:2010-08-08

pid:691224 - siteId:4266  - uid:07079557 - date:2010-08-08

pid:4577421 - siteId:4266 - uid:07079557 - date:2010-08-08