Throughout this chapter, we will make use of the dataset from https://sites.google.com/site/limkwanhui/datacode, which consists of Flickr data from users who take photos at different locations. These photos are then mapped to known theme park attractions. Flickr is an image-hosting service. Let's assume Flickr wants to create a plugin on their mobile app that, as users take photos at the different attractions, identifies user preferences and provides recommendations on other attractions that might be of interest to them.
Let's also suppose that the number of photos a user takes ata particular attraction is an indicator of their interest in the attraction. Our goal is to analyze a dataset with triples of the user ID, attraction, number of photos taken form so that given an arbitrary set of attractions...