CoRe: Exploiting the Personalized Influence of Two-dimensional Geographic Coordinates for Location Recommendations

advertisement
CoRe: Exploiting the Personalized Influence of
Two-dimensional Geographic Coordinates for Location
Recommendations
Jia-Dong Zhanga , Chi-Yin Chowa,∗
a
Department of Computer Science, City University of Hong Kong, Hong Kong
Abstract
With the rapid growth of location-based social networks (LBSNs), location
recommendations play an important role in shaping the life of individuals.
Fortunately, a variety of community-contributed data, such as geographical information, social friendships and residence information, enable us to
mine users’ reality and infer their preferences on locations. In this paper, we
propose an effective and efficient location recommendation framework called
CoRe. CoRe achieves three key goals in this work. (1) We model a personalized check-in probability density over the two-dimensional geographic
coordinates for each user. (2) We propose an efficient approximation approach to predict the probability of a user visiting a new location using her
personalized check-in probability density. (3) We develop a new method to
measure the similarity between users based on their social friendship and residence information, and then devise a fusion rule to integrate the geographical
influence with the social influence so as to improve the user preference mod∗
Corresponding author. Tel.: +852 3442-8679; Fax:+852 3442-0503.
Email addresses: jzhang26-c@my.cityu.edu.hk (Jia-Dong Zhang),
chiychow@cityu.edu.hk (Chi-Yin Chow)
Preprint submitted to Information Sciences
September 6, 2014
el on location recommendations. Finally, we conduct extensive experiments
to evaluate the recommendation accuracy, recommendation efficiency and
approximation error of CoRe using two large-scale real data sets collected
from two popular LBSNs: Foursquare and Gowalla. Experimental results show that CoRe achieves significantly superior performance compared to
other state-of-the-art geo-social recommendation techniques.
Keywords: Location-based social networks, Location recommendations,
Personalization, Geographical influence, Social influence
1. Introduction
In a location-based social network (LBSN) depicted in Figure 1, such as
Foursquare and Gowalla, users can establish social links with others, indicate
their residence, check in some points-of-interest (POIs), e.g., restaurants,
stores, and museums, and share their experiences of visiting specific locations
by leaving some tips or comments [12]. These community-contributed data
enable us to mine users’ reality [40, 42, 46]. For example, in LBSNs it is
prevalent to recommend locations for users based on their preferences inferred
from the community-contributed data [3, 44], which not only helps users
explore new places and enrich their life but also enables companies to launch
advertisements to potential customers and improve their profits.
The geographical feature of POIs distinguishes them from other nonspatial items, such as books, music and movies in conventional recommender
systems [5], because physical interactions are required for users to visit POIs.
Thus, the geographical information (geographic coordinates) of locations plays a significant influence on users’ check-in behaviors,
2
Social link
User
Check-in or Visit
POI or Location
<latitude, longitude>
Residence
Figure 1: A location-based social network [62]
known as geographical influence for short, which gives new opportunities
and challenges to infer users’ preferences for making location recommendations for them.
Limitations in existing methods. There are a few studies that learn
users’ preferences through exploiting the geographical influence for location
recommendations [9, 29, 36, 56]. Unfortunately, these studies use the onedimensional geographical distance between locations for location recommendations. For example, the literatures [29, 36] simply assume that the propensity of a user for a POI is inversely proportional to the distance between
the POI and her visited locations; more sophistically, two techniques [9, 56]
model the distance between locations visited by the same user as a power-law
distribution or multi-center Gaussian distribution. In general, these studies
suffer from two major limitations. (1) One-dimensional geographic distance influence. Existing work simplifies the geographical influence as a
one-dimensional and monotonous distance distribution, but it is more reasonable and intuitive to model the geographical influence as a two-dimensional
and multimodal check-in probability density. The reason is twofold. (a) The
probability of a user visiting a location is not simply monotonous respect-
3
(a) Foursquare
(b) Gowalla
Figure 2: Personal check-in probability densities
ing their distance, because the visiting probability is not only affected by
the distance but also the location’s intrinsic characteristics. For example,
in reality the check-in locations of a user are usually distributed in several
areas. (b) It is hard to compute a visiting probability for a location based
on a distance distribution, since it needs to find a reference location to derive a reasonable distance for the location in the first place. Conversely, it
is considerably intuitive to employ a two-dimensional check-in probability
density to compute a visiting probability for any location with latitude and
4
longitude. In addition, it is rational to use the two-dimensional density on
the spherical dimensions because the altitude is a dependent variable (i.e.,
the altitude is usually determined based on a pair of latitude and longitude)
and the earth is almost a sphere (i.e., the altitude of a location is negligible).
(2) Non-personalized geographical influence. They apply the same inversely proportional relation between the propensity and the distance for all
users [29, 36] or universally estimate a common distance distribution for all
users [9, 56], but in practice the geographical influence of locations should
be personalized since the user’s check-in behavior is unique [62].
Real-world motivating examples. To observe users’ unique checkin behaviors and multimodal check-in probability densities over the twodimensional geographic coordinates, a spatial analysis is conducted on two
publicly available real data sets collected from Foursquare [15] and Gowalla [11], which are two popular LBSNs. Specifically, we focus on three users
who are randomly chosen from each data set. Figure 2 depicts their individual check-in probability density over the two-dimensional geographic
coordinates. The probability density is estimated based on the kernel density estimation (KDE) technique [49]. We have the following two findings.
(1) The geographical influence of locations on these three users’ check-in behaviors is unique since their check-in probability densities are distinct from
each other. (2) These check-in probability densities are usually multimodal
rather than unimodal or monotonous.
Our approach. In this paper, we propose a new location recommendation framework, called CoRe, that achieves three key goals. (1) We infer
users’ preferences by exploiting the personalized two-dimensional geographi-
5
cal influence. Specifically, we estimate a personalized two-dimensional checkin probability density over the latitude and longitude coordinates for each
user rather than using a common one-dimensional distance distribution for all
users. In addition, since the check-in probability densities of users are diverse,
we cannot assume their forms. We thus use a nonparametric density estimation method, i.e., the popular kernel density estimation [49]. (2) We can use a
personalized check-in probability density to predict the probabilities of each
user visiting each of all new locations (that have not been visited by the user). In particular, we contrive an efficient approximation approach to derive
these visiting probabilities based on the fast Gauss transform [20], in order
to improve system efficiency and scalability. (3) We integrate the geographical influence with the social influence in the conventional social networks
to enhance the user preference model and obtain better quality of location
recommendations. First of all, to consider the social influence, we develop
a new method to measure the similarity between users based on their social
friendships and residence information, because nearby friends share more
commonly visited locations than others [11]; the similarity measure between
users is employed to estimate the rating of a user to a new location. We then
design a new fusion rule, called the union rule, to integrate the geographical
probability and social rating of a user to a new location into a unified score,
instead of applying the simple product or sum rule. Eventually, we can make
a location recommendation for each user by top-k locations with the highest
scores to her.
This study is significant different from our previous work [62]. The main
differences and contributions of this paper can be summarized as follows:
6
• Although both studies utilize the same kernel density estimation technique to model a geographical check-in probability density for each
user, this study exploits the two-dimensional geographic coordinates
of users’ check-in locations whereas the previous work [62] only considers the distance between users’ check-in locations. Actually, this is
the first study to learn users’ preferences for location recommendations
by utilizing the personalized two-dimensional geographical influence.
Moreover, to improve the computational complexity of location recommendations, this study proposes an efficient approximation approach
to predict the probability of a user visiting any new location rather
than using the exact computation method as in the previous work [62].
(Section 2)
• Both studies employ a fusion rule to integrate the geographical influence
with the social influence in order to improve the quality of location
recommendations. However, this study develops a more sophisticated
fusion rule whereas the previous work [62] applies the simple product or
sum rule. In addition, this work devises a new method to measure the
similarity between users based on their social friendships and residence
information. (Section 3)
• We conduct extensive experiments to evaluate not only the recommendation accuracy of CoRe as in the previous work [62], but also its recommendation efficiency and approximation error using two large-scale real
data sets collected from Foursquare and Gowalla. Experimental results
show two key facts. (1) CoRe outperforms the state-of-the-art geo-
7
social recommendation techniques including the multi-center Gaussian
model (MGM) [9], power-law distribution (PD) [56] and iGSLR [62] in
terms of recommendation accuracy and efficiency. (2) CoRe is much
more efficient than the exact method, and it only leads to small approximation errors. (Sections 4 and 5)
The remainder of this paper is organized as follows. We firstly model the
personalized two-dimensional geographical influence for each user and propose an efficient approach to predict the probabilities of the user visiting new
locations in Section 2. We then integrate geographical influence with social
influence through a new user similarity measuring method and a new fusion
rule in Section 3. In Sections 4 and 5, we present our experiment settings
and analyze the performance of CoRe, respectively. Section 6 highlights related work on location recommendations. Finally, we conclude this paper in
Section 7.
2. Modeling Personalized Two-dimensional Geographical Influence
In this section, we propose a kernel density estimation (KDE) approach to
model the personalized two-dimensional geographical influence in Section 2.1
and develop an efficient approximation approach to derive users’ preferences
in Section 2.2. Table 1 summarizes the key notations used in this paper.
2.1. A KDE-Based Approach to Model the Personalized Two-dimensional
Geographical Influence
The experimental results in Section 1 inspired us to learn the users’ preferences by utilizing the personalized two-dimensional geographical influence.
8
Table 1: Key notations in this paper
Notation
Meaning
U
Set of all users in an LBSN
u
Some user and u ∈ U
L
Set of all locations (or POIs) in an LBSN
l
Some location with the latitude and longitude coordinates and
l = (lat, lon)T ∈ L
Lu
Set of locations visited by user u (i.e., u’s check-in locations) and
Lu = {l1 , l2 , . . . , ln } ⊂ L
p(l|Lu )
Predicted probability of u visiting l given Lu
Ω
A set of cells that constitute a partition of the geographical space
B
Some cell and B ∈ Ω
lB
Center of B
Fu
Set of users having social links with u and Fu ⊂ U
ru′ ,l
Actual rating of user u′ to visited location l
r̂u,l
Estimated rating of user u to unvisited location l
ŝu,l
Unified score by fusing p(l|Lu ) and r̂u,l
To this end, we estimate a personalized two-dimensional check-in probability density for each user. Specifically, we apply a general nonparametric
probability density estimation technique, known as the kernel density estimation [49] (KDE). Opposed to parametric estimation methods, a nonpara-
9
metric estimation technique does not assume a fixed distribution form in
advance, but instead learn the distribution form from data. Thus, KDE can
be used with arbitrary distributions because it does not have any assumption
on the form of the underlying distribution.
Personalized two-dimensional check-in probability density estimation. Let Lu = {l1 , l2 , . . . , ln } be the set of locations visited by the user
u that are drawn from some distribution with an unknown density f . Its
kernel density estimator fˆ using Lu is given by:
)
(
n
∑
1
x
−
l
i
fˆ(x) =
K
,
nσ 2 i=1
σ
(1)
where each location li = (lati , loni )T is a two-dimensional column vector with
the latitude (lati ) and longitude (loni ), K(·) is the kernel function and σ is
a smoothing parameter, called the bandwidth. In this paper we apply the
widely used standard two-dimensional normal kernel [49]:
K(x) =
1
1
exp(− xT x),
2π
2
and the optimal bandwidth [49]:
σ=n
√
− 16
1 T
σ̂ σ̂,
2
(2)
(3)
where σ̂ is the marginal standard deviation vector of Lu .
Thus, given u’s set of visited locations Lu = {l1 , l2 , . . . , ln }, the probability of user u visiting a new location l can be computed by:
n
1
1 ∑
exp(− 2 (l − li )T (l − li )).
p(l|Lu ) =
2
2πnσ i=1
2σ
(4)
Computational complexity. Algorithm 1 outlines the process for computing p(l|Lu ) through Equation (4). Algorithm 1 calculates a probability
10
Algorithm 1 The exact computation of p(l|Lu )
Input: u’s set of visited locations Lu = {l1 , l2 , . . . , ln }.
Output: p(l|Lu ) for each location l ∈ L − Lu .
1: Compute the bandwidth σ using Equation (3)
2: for each unvisited location l ∈ L − Lu do
3:
z ← 0 // Initializing auxiliary variable z
4:
for each li ∈ Lu do
5:
z ← z + exp(−(l − li )T (l − li )/(2σ 2 ))
6:
end for
7:
p(l|Lu ) ← z/(2πnσ 2 )
8: end for
for each unvisited location l ∈ L − Lu in order to make location recommendations for u by returning the top-k locations with the highest probability.
It is required to evaluate the sum of n Gaussian exp(−(l − li )T (l − li )/(2σ 2 ))
at each target point l ∈ L − Lu , the computational complexity of which is
O(|L − Lu |n) = O(|L|n) in which |L| ≫ |Lu | = n since users only checked in
a small fraction of locations.
2.2. An Efficient Approximation Approach to Infer Users’ Preferences
In Algorithm 1 with the computational complexity O(|L|n), when users
have checked in many POIs (i.e., with a large value of n), the calculation on
recommending top-k POIs to these users is prohibitively expensive. In this
section, we approximately compute p(l|Lu ) through the fast Gauss transform [20] to reduce its complexity to be independent of n, i.e., O(|L|). The
fast Gauss transform is an optimized variant of the fast multipole method
11
for Gaussian distributions. As for another kernel functions, we can employ
the more general fast multipole method [19].
Fast Gauss transform. The fast Gauss transform shifts a Gaussian
exp(−(l − li )T (l − li )/(2σ 2 )) centered at li = (lati , loni )T to a sum of Hermite
polynomials centered at l0 = (lat0 , lon0 )T by the Hermite expansion, given
by:
(
)q (
)
c−1
∑
l − l0
1
1
li − l0
T
√
exp(− 2 (l − li ) (l − li )) =
hq √
+ ε(c), (5)
2σ
(q!)2
2σ
2σ
q=0
where
(
)q (
)q (
)q
li − l0
lati − lat0
loni − lon0
√
√
√
=
,
2σ
2σ
2σ
)
) (
)
(
(
l − l0
lati − lat0
loni − lon0
√
√
hq √
= hq
hq
,
2σ
2σ
2σ
(6)
(7)
the Hermite functions hq (x) are defined by:
hq (x) = (−1)q
dq −x2
e ,
dxq
(8)
and ε is the error due to truncating the infinite series after c terms. When
li = (lati , loni )T closes to l0 = (lat0 , lon0 )T , a small value of c is enough
to guarantee that the error ε is negligible as these terms converge to zero
quickly.
Efficient approximation algorithm. At first, to ensure li being close
to l0 , the geographical space is subdivided into a set of cells Ω using a uniform
grid, and each cell B ∈ Ω with center lB and size σ, i.e., the optimal
bandwidth in Equation (3), as depicted in Figure 3. Accordingly, each
location li ∈ Lu is shifted into the center lB of the cell B in which li lies.
12
6σ
5σ
4σ
3σ
2σ
σ
0
σ
2σ
3σ
4σ
5σ
6σ
7σ
8σ
9σ
10σ
11σ 12σ
Figure 3: An example of the uniform grid (cell size equals bandwidth in Equation (3))
Thus, the approximation of p(l|Lu ) in Equation (4) is given by:
n
1 ∑
1
exp(−
(l − li )T (l − li ))
2πnσ 2 i=1
2σ 2
(
)q (
)
c−1
li − lB
1 ∑∑∑ 1
l − lB
√
≈
hq √
2πnσ 2 B∈Ω l ∈B q=0 (q!)2
2σ
2σ
i
(
)q (
)
c−1
l − lB
1 ∑ ∑ 1 ∑ li − lB
√
hq √
=
2πnσ 2 B∈Ω q=0 (q!)2 l ∈B
2σ
2σ
i
)
(
c−1
1 ∑∑
l − lB
=
Aq (B)hq √
2πnσ 2 B∈Ω q=0
2σ
p(l|Lu ) =
together with
(
)q
1 ∑ li − lB
√
Aq (B) =
.
(q!)2 l ∈B
2σ
(9)
(10)
i
Computational complexity. Algorithm 2 summarizes the overall process for approximating p(l|Lu ) through Equation (9). Aq (B) is independent
of any target point l ∈ L − Lu and only required to be computed once for
13
Algorithm 2 The efficient approximation of p(l|Lu )
Input: u’s set of visited locations Lu = {l1 , l2 , . . . , ln } and a constant c.
Output: p(l|Lu ) for each location l ∈ L − Lu .
1: Compute the bandwidth σ using Equation (3)
2: Subdivide the geographical space into a uniform grid
3: Compute common items Aq (B) for each combination of q and cell B ∈ Ω
in terms of Equation (10)
4: for each unvisited location l ∈ L − Lu do
5:
z ← 0 // Initializing auxiliary variable z
6:
for each cell B ∈ Ω that contains l or is immediately adjacent to l do
7:
8:
9:
for q = 0 to c − 1 do
√
z ← z + Aq (B)hq ((l − lB )/( 2σ))
end for
10:
end for
11:
p(l|Lu ) ← z/(2πnσ 2 )
12: end for
each li ∈ Lu in advance, which avoids to separately calculate the Gaussian
exp(−(l − li )T (l − li )/(2σ 2 )) for each combination of l ∈ L − Lu and li ∈ Lu
in Algorithm 1. The pre-computation step requires O(n) work. The approximation step computes p(l|Lu ) using Aq (B) (Lines 4 to 12) and it needs
O(c|L − Lu ||Ω|) = O(|L − Lu |) work, in which c and |Ω| are a small constant
relative to |L − Lu |. Therefore, the computational complexity of Algorithm 2
is O(n) + O(|L − Lu |) = O(|L|). In addition, to accelerate the evaluation
of Equation (9), we also can cut off the sum over the set of all cells Ω by
only considering the nearest cells of l (Line 6), since the cells far away from
14
l contribute a little to p(l|Lu ). For example, consider Figure 3 again. Given
l lying in the red cell, we may just consider the red cell and the nearest blue
cells around it when estimating p(l|Lu ) based on Equation (9).
3. Integrating Geographical and Social Influences
In this section, we integrate the geographical influence with the social
influence in the conventional social networks in order to gain better user
preference model and quality of location recommendations.
3.1. Using Social Influence
To use the social influence, we design a new method to measure the similarity between users based on their social friendships and residence distance,
because nearby friends share more commonly visited locations than others [11]. Specifically, we transform the residence distance of users with social
friendships into a normalized similarity through a sigmoid function that is
widely used to map an output into a probability. Formally, let F (u) be a
set of users having social friendships with u and distance(u, u′ ) be the geographical distance between the residences of users u and u′ . If u′ ∈ F (u),
the similarity between users u and u′ is defined by the sigmoid function as
follows:
sim(u, u′ ) =
2
.
1 + exp(distance(u, u′ ))
(11)
Otherwise, sim(u, u′ ) = 0. Obviously, we have sim(u, u′ ) ∈ [0, 1].
Based on the similarity, we can predict the rating r̂u,l of a user u to a new
location l via the conventional user-based collaborative filtering technique:
∑
sim(u, u′ ) · ru′ ,l
′
r̂u,l = u∑∈U
,
(12)
′
u′ ∈U sim(u, u )
15
where ru′ ,l denotes the frequency of user u′ visiting location l.
3.2. The Union Fusion Rule
Here we consider the integration of the geographical influence with the
social influence to enhance user preference model and improve the quality of
location recommendations. An easy way is to apply the simple product or
sum rule to combine the visiting probability in Equation (9) and rating in
Equation (12) of a user regarding a location into a unified score. However,
the unified score from the product rule is dominated by a smaller value of
either the visiting probability p(l|Lu ) or the estimated rating r̂u,l . As an
example, supposing p(l|Lu ) = 0.9 and r̂u,l = 0.1, the unified score given by
the product rule is 0.09 that is nearly independent of p(l|Lu ) with a large
value but strongly dominated by r̂u,l with a small value. On the other hand,
it is considerably hard to determine the relative weights of p(l|Lu ) and r̂u,l
for the sum rule to achieve the best performance. Although the weights can
be learned from data, they cost much effort to find out their optimal settings
and usually suffer from over-fitting the data.
To this end, we design a new fusion rule, called the union rule, instead of
applying the simple product and sum rules. At first, the union rule values
the geographical influence and social influence equally important under the
condition of the lack of the prior knowledge about their effect. That is,
the predicted probability in Equation (9) and rating in Equation (12) are
transformed into a normalized probability with the same scale using
p̂(l|Lu ) =
p(l|Lu )
,
maxl∈(L−Lu ) p(l|Lu )
16
(13)
and
p̂u,l =
r̂u,l
maxl∈(L−Lu ) r̂u,l
,
(14)
where maxl∈(L−Lu ) p(l|Lu ) and maxl∈(L−Lu ) r̂u,l are the normalization constants for a given user u. Further, the union rule views the difference between
the probability and rating given by Equations (13) and (14) as an uncertain
indicator and utilizes it to penalize the average of them in order to get a unified score. The underlying reason is that the difference between them usually
implies the confidence on the average of them; the larger the difference is,
the lower the confidence will be. Concretely, the union rule estimates the
mean ŝµu,l and standard deviation ŝςu,l by:
ŝµu,l =
p̂(l|Lu ) + p̂u,l
2
(15)
together with
ŝςu,l =
√
(p̂(l|Lu ) − ŝµu,l )2 + (p̂u,l − ŝµu,l )2
2
.
(16)
Then it penalizes the mean ŝµu,l with the standard deviation ŝςu,l to obtain the
final score ŝu,l of user u visiting location l, given by
ŝu,l = ŝµu,l − ŝςu,l .
(17)
It is important to note that: (1) The union rule in Equation (17) addresses
the limitations of the simple product and sum rules, since in the union rule
neither the probability nor the rating could dominate the final score and there
is no need to derive their relative weights. (2) If the difference between the
probability and rating determined by Equations (13) and (14), respectively,
is large, the final score given by the union rule will be a little far from their
17
Algorithm 3 The fusion of the geographical and social influences
Input: u’s set of visited locations Lu = {l1 , l2 , . . . , ln }.
Output: ŝu,l for each location l ∈ L − Lu .
1: for each unvisited location l ∈ L − Lu do
2:
Compute the visiting probability of u to l by Algorithm 2
3:
Compute the similarity of u with social friends by Equation (11)
4:
Compute the rating of u to l by Equation (12)
5:
Normalize the probability and rating by Equations (13) and (14)
6:
Estimate the mean and standard deviation of the normalized probability and rating by Equations (15) and (16)
7:
Compute the final score of u to l by Equation (17)
8: end for
average due to the penalty. (3) It is easy to show that the final score ŝu,l lies
in [0, 1]. We summarize the fusion process of the geographical influence with
the social influence in Algorithm 3.
4. Experimental Evaluation
In this section, we describe our experiment settings for evaluating the
performance of CoRe against the state-of-the-art geo-social recommendation
techniques. Specifically, our evaluation focuses on three aspects of CoRe:
(1) its recommendation accuracy (i.e., precision and recall); (2) its approximation error in comparison to the exact method; and (3) its recommendation
efficiency.
18
Table 2: Statistics of the two data sets
Foursquare Gowalla
Number of users
11,326
196,591
Number of locations (POIs)
182,968
1,280,969
Number of check-ins
1,385,223
6,442,890
Number of social links
47,164
950,327
User-location matrix density
2.3 × 10−4
2.9 × 10−5
Avg. No. of visited POIs per user
42.44
37.18
Avg. No. of check-ins per location 2.63
3.11
4.1. Data Sets
We use two publicly available large-scale real check-in data sets1 that were
crawled from Foursquare [15] and Gowalla [11]. The statistics of the data
sets are shown in Table 2. Figure 4 depicts the distribution of the locations
in the data sets on a world map.
4.2. Evaluated Recommendation Techniques
The recommendation techniques implemented in our experiments are classified into three categories based on the information they use in the location
recommendations.
1
The check-in data sets used for our experiments can be downloaded from http://www.
public.asu.edu/~hgao16/Publications.html and http://snap.stanford.edu/data/
loc-gowalla.html.
19
90
60
Latitude: °
30
0
−30
−60
−90
−180 −150 −120 −90
−60
−30
0
30
Longitude: °
60
90
120
150
180
(a) Foursquare
90
Latitude: °
60
30
0
−30
−60
−90
−180 −150 −120 −90
−60
−30
0
30
Longitude: °
60
90
120
150
180
(b) Gowalla
Figure 4: Distribution of locations on a world map
(1) Geographical category:
• MGM: This technique models the distance between visited locations
and centers (i.e., the most popular locations) as a multi-center Gaussian
20
distribution [9].
• PD: This technique models the distance between every pair of locations
visited by the same user as a power-law distribution [56].
• iGSLR: This technique models the distance between every pair of locations visited by the same user as a nonparametric distribution [62].
• CoRe: Our technique models a personalized two-dimensional check-in
probability density over the geographic coordinates for each user using
Algorithm 2.
• Exact: The only difference between Exact and CoRe is that Exact computes the exact probability of a user visiting a new location using Algorithm 1, while CoRe uses the approximation method to compute the
visiting probability.
(2) Social category:
• SCF: This technique measures the similarity between users by using
their social friendships and then employs the social collaborative filtering technique [5].
• CoRe: Our technique measures the similarity between users based on
their social friendships and residence distance in Equation (11) and also
utilizes the collaborative filtering technique in Equation (12).
(3) Fusion category:
• Prod: This fusion technique applies the product rule to integrate the
geographical and social influences.
21
• Sum: This technique uses the sum rule to integrate the geographical
and social influences. It is worth mentioning that before applying the
sum rule, the probability and rating have been normalized using Equations (13) and (14), respectively, which is different from our previous
work [62] and improves the performance of the sum rule.
• CoRe: Our technique applies the union rule to integrate the geographical and social influences based on Algorithm 3.
Note that we implement a different version of CoRe for each
category using the corresponding influence to achieve fair comparisons in Section 5.
4.3. Performance Metrics
Recommendation accuracy. In general, recommendation techniques
compute a score for each candidate item (i.e., a location or POI in this paper)
regarding a target user and return locations with the top-k highest scores
as a recommendation result to the target user. To evaluate the quality of
location recommendations, it is important to find out how many recommended locations are actually visited by the target user in the testing data set.
Also, it is important to know how many of the locations actually visited were
recommended by the evaluated technique. The former aspect is captured by
precision and the latter by recall. Precision and recall are standard metrics
used to evaluate the recommendation accuracy [9, 56]. We define a discovered
location as a location that is both recommended and actually visited by the
target user. Formally,
22
• Precision defines the ratio of the number of discovered locations to the
total number k of recommended locations, i.e.,
precision =
the number of discovered locations
.
k
(18)
• Recall defines the ratio of the number of discovered locations to the
total number of actually visited locations by the target user in the
testing set, i.e.,
recall =
the number of discovered locations
.
the number of visited locations
(19)
Approximation error. We evaluate the approximation error of CoRe
by comparing its recommendation accuracy with that of Exact (i.e., Algorithm 1). Note that we are more concerned about the effect of approximation
error on the recommendation accuracy than the error itself.
Recommendation efficiency. We compare the running time of CoRe
and Exact with respect to various numbers of check-in locations of users. All
algorithms were implemented in Matlab and run on a machine with 3.4GHz
Intel Core i7 Processor and 16GB RAM.
4.4. Experiment Settings
We split each data set into the training set and the testing set in terms
of the check-in time rather than using a random partition method, because
in practice we can only utilize the past check-in data to predict the future
check-in events. A half of check-in data with earlier timestamps are used as
the training set and the other half of check-in data are used as the testing
set. In CoRe, unless otherwise specified, in Algorithm 2 the cell size is set to
23
the optimal bandwidth in Equation (3) and the number of truncated terms
in Equation (5) is set to a small value: c = 6.
5. Experimental Results
This section analyzes our extensive experimental results. We first compare our CoRe against the state-of-the-art geo-social recommendation techniques in terms of the recommendation accuracy (Section 5.1). We then
study the approximation error of CoRe (Section 5.2). Finally, we evaluate
the recommendation efficiency of CoRe (Section 5.3).
5.1. Recommendation Accuracy
Here we separately investigate the performance of three new techniques
proposed in this paper, including the efficient approximation approach to
exploit the personalized two-dimensional geographical influence in Algorithm 2 (Section 5.1.1), the method to measure the similarity of users in Equation (11) (Section 5.1.2), and the union rule to integrate the geographical
and social influences in Algorithm 3 (Section 5.1.3). Moreover, we discuss
the effect of the number k of recommended locations for users (Section 5.1.4),
the number n of check-in locations of users (Section 5.1.5), and the cell size
on the recommendation accuracy (Section 5.1.6).
5.1.1. Geographical Recommendation Techniques
Figures 5 and 6 compare the recommendation accuracy of CoRe (i.e., Algorithm 2), iGSLR [62], PD [56] and MGM [9] with the effect of the number
of recommended locations for users (top-k) and the number of check-in locations of users (given-n) using two large-scale real data sets collected from
24
0.3
0.3
MGM
PD
iGSLR
CoRe
0.25
0.25
0.2
Recall
Precision
0.2
0.15
0.15
0.1
0.1
0.05
0.05
0
0
5 10 15 20 25 30 35 40 45 50
top−k
(a) Foursquare (Precision)
5 10 15 20 25 30 35 40 45 50
top−k
(b) Foursquare (Recall)
0.3
0.3
MGM
PD
iGSLR
CoRe
0.25
0.25
0.15
0.15
0.1
0.1
0.05
0.05
0
MGM
PD
iGSLR
CoRe
0.2
Recall
0.2
Precision
MGM
PD
iGSLR
CoRe
5 10 15 20 25 30 35 40 45 50
top−k
(c) Gowalla (Precision)
0
5 10 15 20 25 30 35 40 45 50
top−k
(d) Gowalla (Recall)
Figure 5: Comparison of geo-techniques on the number of recommended locations k (CoRe
is Algorithm 2)
25
0.3
0.3
MGM
PD
iGSLR
CoRe
0.25
0.25
0.2
Recall
Precision
0.2
0.15
0.15
0.1
0.1
0.05
0.05
0
10 20 30 40 50 60 70 80 90 100
given−n
0
10 20 30 40 50 60 70 80 90 100
given−n
(a) Foursquare (Precision)
(b) Foursquare (Recall)
0.3
0.3
MGM
PD
iGSLR
CoRe
0.25
0.25
MGM
PD
iGSLR
CoRe
0.2
Recall
0.2
Precision
MGM
PD
iGSLR
CoRe
0.15
0.15
0.1
0.1
0.05
0.05
0
10 20 30 40 50 60 70 80 90 100
given−n
(c) Gowalla (Precision)
0
10 20 30 40 50 60 70 80 90 100
given−n
(d) Gowalla (Recall)
Figure 6: Comparison of geo-techniques on the number of check-in locations n (CoRe is
Algorithm 2)
26
Foursquare and Gowalla, respectively. Note that k is set to a smaller number
than n because a large number of recommended locations may not be helpful
for users. As a whole, our proposed method CoRe always exhibits the
best precision and recall for all the values of k and n. The details are
demonstrated as follows.
MGM models the distance between a location and a center as a universal multi-center Gaussian distribution for all users. Actually, its estimated
probability of p(l|Lu ) is independent of user u or Lu , that is, it recommends
the same set of locations for all users. As a result, it performs the worst.
PD models the distance between every pair of locations visited by the
same user as a power-law distribution for all users. Although it enhances
the performance of recommending locations in comparison to MGM, it still
inherits the limitation of the universal distance distribution for all users.
iGSLR estimates an individual distance distribution for each user and
then improves the recommendation accuracy compared to PD, but the improvement is very limited since it still simplifies the geographical influence
as one-dimensional distance distributions just as in MGM and PD.
CoRe estimates a personalized two-dimensional check-in probability density over the geographic coordinates for each user and always shows the best
recommendation quality in terms of precision and recall. These results verify
the superiority of exploiting the personalized two-dimensional geographical
influence for location recommendations.
5.1.2. Social Recommendation Techniques
Figures 7 and 8 contrast the recommendation accuracy of CoRe (i.e.,
Equation (12)) with that of SCF [5] on the Foursquare and Gowalla data
27
0.3
0.3
SCF
CoRe
0.25
0.2
0.2
Recall
Precision
0.25
0.15
0.15
0.1
0.1
0.05
0.05
0
0
5 10 15 20 25 30 35 40 45 50
top−k
(a) Foursquare (Precision)
5 10 15 20 25 30 35 40 45 50
top−k
(b) Foursquare (Recall)
0.3
0.3
SCF
CoRe
0.25
0.2
0.2
0.15
0.15
0.1
0.1
0.05
0.05
0
SCF
CoRe
0.25
Recall
Precision
SCF
CoRe
5 10 15 20 25 30 35 40 45 50
top−k
(c) Gowalla (Precision)
0
5 10 15 20 25 30 35 40 45 50
top−k
(d) Gowalla (Recall)
Figure 7: Comparison of social techniques on the number of recommended locations k
(CoRe is Equation (12))
sets. By transforming the residence distance and social friendships of users
into a normalized similarity through a sigmoid function in Equation (11),
CoRe is able to reflect the fact that nearby friends share more commonly
visited locations than others. As a result, CoRe peforms better than SCF in
terms of both precision and recall.
On the other hand, both CoRe and SCF suffer from the effect of data
28
0.3
0.3
SCF
CoRe
0.25
0.2
0.2
Recall
Precision
0.25
0.15
0.15
0.1
0.1
0.05
0.05
0
10 20 30 40 50 60 70 80 90 100
given−n
0
10 20 30 40 50 60 70 80 90 100
given−n
(a) Foursquare (Precision)
(b) Foursquare (Recall)
0.3
0.3
SCF
CoRe
0.25
SCF
CoRe
0.25
0.2
0.2
Recall
Precision
SCF
CoRe
0.15
0.15
0.1
0.1
0.05
0.05
0
10 20 30 40 50 60 70 80 90 100
given−n
(c) Gowalla (Precision)
0
10 20 30 40 50 60 70 80 90 100
given−n
(d) Gowalla (Recall)
Figure 8: Comparison of social techniques on the number of check-in locations n (CoRe is
Equation (12))
sparsity. For instance, their precision and recall in the Gowalla data set are
obviously lower than those in the Foursquare data set, because the density
of the Gowalla data set is one order-of-magnitude lower than that of the
Foursquare data set as shown in Table 2.
29
0.5
0.5
Prod
Sum
CoRe
0.4
Recall
Precision
0.4
0.3
0.2
0.1
Prod
Sum
CoRe
0.3
0.2
5
0.1
10 15 20 25 30 35 40 45 50
top−k
5
(a) Foursquare (Precision)
(b) Foursquare (Recall)
0.5
0.5
Prod
Sum
CoRe
Prod
Sum
CoRe
0.4
Recall
Precision
0.4
0.3
0.2
0.1
10 15 20 25 30 35 40 45 50
top−k
0.3
0.2
5
10 15 20 25 30 35 40 45 50
top−k
(c) Gowalla (Precision)
0.1
5
10 15 20 25 30 35 40 45 50
top−k
(d) Gowalla (Recall)
Figure 9: Comparison of fusion rules on the number of recommended locations k (CoRe is
Algorithm 3)
5.1.3. Fusion Rules
Figures 9 and 10 depict the performance of three fusion rules for integrating geographical and social influences, including the product rule (Prod), the
sum rule (Sum), and the union rule (CoRe, i.e., Algorithm 3).
Prod and Sum are comparable to each other on both Foursquare and
Gowalla data sets and they improve the performance of the geo-social rec-
30
0.5
0.5
Prod
Sum
CoRe
0.4
Recall
Precision
0.4
0.3
0.3
0.2
0.1
10 20 30 40 50 60 70 80 90 100
given−n
0.1
10 20 30 40 50 60 70 80 90 100
given−n
(a) Foursquare (Precision)
(b) Foursquare (Recall)
0.5
0.5
Prod
Sum
CoRe
0.4
Recall
0.4
Precision
Prod
Sum
CoRe
0.2
0.3
0.3
Prod
Sum
CoRe
0.2
0.2
0.1
10 20 30 40 50 60 70 80 90 100
given−n
0.1
10 20 30 40 50 60 70 80 90 100
given−n
(c) Gowalla (Precision)
(d) Gowalla (Recall)
Figure 10: Comparison of fusion rules on the number of check-in locations n (CoRe is
Algorithm 3)
ommendation techniques only using either geographical information or social
information in Sections 5.1.1 or 5.1.2, respectively. Promisingly, using the
union rule that considers the difference of two scores in Equations (13) and
(14) given by the geographical and social recommendation techniques respectively as an uncertain indicator to penalize the average of them, CoRe can
further raise the precision and recall in comparison to Prod and Sum.
31
5.1.4. Effect of Number of Recommended Locations
Figures 5, 7 and 9 describe the recommendation accuracy of a variety
of recommendation techniques with respect to the change of the number of
recommended locations, i.e., k from 5 to 50. As k increases, the precision
gradually gets lower but the recall steadily becomes higher on the two data
sets. Our explanation is that, in general, by returning more locations for
users, it is able to discover more locations that users would like to visit.
However, the extra recommended locations are less possible to be liked by
users due to the lower visiting probabilities of these locations.
5.1.5. Effect of Number of Check-in Locations
Figures 6, 8 and 10 depict the recommendation accuracy of a variety
of recommendation techniques with regard to various numbers of check-in
locations of users, i.e., n from 10 to 100, on the two data sets. As users visit
more locations, more check-in data are available for these recommendation
techniques; hence, they can more accurately estimate the scores of these users
for new locations. Accordingly, the precision inclines. Nonetheless, users who
have checked in many locations in the training data set usually have visited
many locations in the testing data set, so the recall fluctuates.
5.1.6. Effect of Cell Sizes
Heretofore, the cell size in CoRe is set to the optimal bandwidth σ in
Equation (3). Figure 11 depicts the effect of cell sizes on the recommendation
accuracy of CoRe in the two data sets. (1) In general, CoRe achieves the
best precision and recall when the cell size equals the optimal bandwidth
σ. (2) With the decrease of the cell size, the recommendation accuracy
32
0.25
Foursquare
Gowalla
Precision
0.2
0.15
0.1
0.05
0
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 1.6 1.7 1.8 1.9 2
Cell size (unit: σ)
(a) Precision
0.25
Foursquare
Gowalla
Recall
0.2
0.15
0.1
0.05
0
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 1.1 1.2 1.3 1.4 1.5 1.6 1.7 1.8 1.9 2
Cell size (unit: σ)
(b) Recall
Figure 11: Effect of cell sizes on recommendation accuracy of CoRe (Algorithm 2)
of CoRe gradually declines. The reason is that: For efficiency CoRe only
takes into account the nearest cells of a location when computing its visiting
probability according to Algorithm 2 and a smaller cell size leads to an end
that more visited locations out of the nearest cells are ignored. (3) As the cell
size increases, the recommendation accuracy of CoRe decreases as well. Our
explanation is that: A larger cell size accordingly requires a larger number of
truncated terms in Equation (5) in order to complete the same approximation
33
0.22
0.2
0.2
0.18
0.18
Recall
Precision
0.22
0.16
0.14
Exact
CoRe
0.12
0.16
0.14
0.1
0.08
0.1
1
0.08
2 3 4 5 6 7 8 9 10
No. of the truncated terms: c
(a) Foursquare (Precision)
1
0.22
0.22
0.2
0.2
0.18
0.18
0.16
0.14
Exact
CoRe
0.12
0.16
0.14
Exact
CoRe
0.12
0.1
0.08
2 3 4 5 6 7 8 9 10
No. of the truncated terms: c
(b) Foursquare (Recall)
Recall
Precision
Exact
CoRe
0.12
0.1
1
2 3 4 5 6 7 8 9 10
No. of the truncated terms: c
(c) Gowalla (Precision)
0.08
1
2 3 4 5 6 7 8 9 10
No. of the truncated terms: c
(d) Gowalla (Recall)
Figure 12: Approximation error of CoRe (Algorithm 2) in comparison to Exact (Algorithm 1)
accuracy.
5.2. Approximation Error
Figure 12 compares the approximation error of CoRe (Algorithm 2) with
that of Exact (Algorithm 1) regarding the change of the number of truncated
terms in Equation (5) in terms of the recommendation accuracy rather than
the error itself, since the error itself is not meaningful for location recom-
34
mendations and the effect of approximation error on the recommendation
accuracy is more significant.
As shown in Figure 12, with the increase of the number of truncated
terms, the precision and recall of CoRe quickly rise and approach to the performance of Exact in both the Foursquare and Gowalla data sets. The reason
is that the approximation error caused by truncating the infinite series with
the first c terms descends exponentially. For instance, in previous experiments, the default value of c is set to 6, in which the caused approximation
error is negligible according to the experimental results depicted in Figure 12.
5.3. Recommendation Efficiency
Figure 13 depicts the running time of CoRe (Algorithm 2), Exact (Algorithm 1), MGM, PD and iGSLR with respect to various numbers of check-in
locations of users. (1) With the increase of the number of check-in locations
of users, CoRe maintains a low consistent running time while Exact takes
a linearly increasing running time, since CoRe has the computational complexity of O(|L|) and Exact with O(|L|n). (2) Although the running time
of MGM also remains constant because of its computational complexity of
O(M |L|) in which M is the number of centers (i.e., the most popular POIs),
MGM requires more running time than CoRe. (3) As the number of check-in
locations of users gets larger, the running time of both PD and iGSLR with
O(|L|n) also increases linearly, but increasing faster than that of Exact, not
to mention CoRe.
Therefore, CoRe is more scalable to the web-scale calculation in the process of location recommendations.
In addition, all evaluated techniques
take more time to recommend locations in the Gowalla data set than the
35
3.5
MGM
PD
iGSLR
Exact
CoRe
Running time (seconds)
3
2.5
2
1.5
1
5 10 15 20 25 30 35 40 45 50 55 60 65 70 75 80 85 90 95 100
No. of check−in locations of users: n
(a) Foursquare
20
Running time (seconds)
18
16
MGM
PD
iGSLR
Exact
CoRe
14
12
10
8
6
5 10 15 20 25 30 35 40 45 50 55 60 65 70 75 80 85 90 95 100
No. of check−in locations of users: n
(b) Gowalla
Figure 13: Recommendation efficiency of CoRe (Algorithm 2) in comparison to Exact
(Algorithm 1), MGM, PD and iGSLR
36
Foursquare data set, because the former contains more location candidates
for recommendations than the latter as shown in Table 2.
6. Related Work
In this section, we highlight related work about location recommendations
in location-based social networks [12].
Mobile context-aware recommender systems. Location recommendations have been widely studied in mobile context-aware recommender systems for tourism [18]. In addition to user locations, potentially useful context
information contains mobility trajectory, distance from POIs, budget, time,
weak day, season, time available for sightseeing, transportation mode, weather condition, etc. For example, to make location recommendations, Barranco
et al. [4] employed users’ mobility trajectories and speeds, Savage et al. [47]
utilized users’ transportation modes and moods, and the works [1, 17] focus
on using time of a day and weather conditions. However, most these studies
do not take into account the social interactions between users.
Social recommendations. With the rapid growth of social networks,
like Facebook and Twitter, social network information, e.g., personal profile
content, tags and friendships, has been utilized to improve the quality of
recommender systems. Such information can be considered as implicit user
feedbacks to alleviate the data sparsity problem of explicit user feedbacks (i.e.,
the rating matrix) in traditional recommender systems [35, 50]. In particular,
recommendation techniques integrating social friendships have been widely
studied [5], including model-based methods [13, 25, 26, 27, 33, 35, 38, 41, 43,
48, 53, 54] and memory-based methods [24, 28]. The rationale behind these
37
methods is that friends are more likely to share common interests and thus
the social influence should be considered when making recommendations.
Location recommendations in LBSNs. Some recent studies provide POI recommendations by using the conventional collaborative filtering
techniques on users’ check-in data [14, 21, 45, 64, 65, 70], GPS trajectory
data [8, 23, 30, 40, 58, 66, 67, 68, 69], or text data [22, 37, 51, 52]. However,
these studies have not leveraged any geographical influence when generating
recommendations. In reality, the geographical information of users and locations plays a significant influence on users’ check-in behaviors [9, 56], since
physical interactions are required for users to visit locations that are totally
different from other non-spatial items, e.g., books, music and movies [56].
Location recommendations using geographical influence. To exploit geographical influence for improving the quality of location recommendations, some techniques [15, 16, 31, 55, 59, 61] employ the geographical
influence of users to derive their similarity weights as an input of the conventional collaborative filtering techniques [6, 7]. However, the performance
is considerably limited due to no consideration for the geographical influence
of locations. In contrast, other techniques explore the geographical influence
of locations. For example, the studies [2, 10, 44, 57] view locations as ordinary non-spatial items and consider the geographical influence of locations
by predefining a range; locations only within this range will be possibly recommended to users. The literatures [29, 32, 34] present a geo-topic model
respectively by assuming that if a location is closer to the locations visited
by a user or the current location of a user, it is more likely to be visited by
the same user. More sophistically, some works model the distance between
38
two locations visited by the same user as a common distribution for all users, e.g., a power-law distribution [36, 39, 56, 60] or a multi-center Gaussian
distribution [9]. In particular, our previous papers [62, 63] personalize the
geographical influence by modeling the distance between locations visited by
the same user as a personalized distance distribution for each user.
7. Conclusion
In this paper, we have proposed an effective and efficient location recommendation framework called CoRe. In CoRe, we have modeled the personalized two-dimensional geographical influence and employed the kernel density estimation technique to derive the personalized two-dimensional check-in
probability density for a user visiting a new location. To improve the system
efficiency and scalability, we have used the fast Gauss transform to develop
the approximation method to compute visiting probabilities. In addition, we
have also designed a new method to measure the similarity between users
based on their social friendships and residence information and a new fusion
rule to combine the geographical and social influences into a unified score.
Finally, we have conducted extensive experiments to evaluate the recommendation accuracy, recommendation efficiency, and approximation errors
of CoRe using two data sets from Foursquare and Gowalla. Experimental results show that CoRe outperforms all other evaluated recommendation
techniques.
39
References
[1] L. Baltrunas, B. Ludwig, S. Peer, F. Ricci, Context relevance assessment and exploitation in mobile recommender systems, Personal and
Ubiquitous Computing 16 (5) (2012) 507–526.
[2] J. Bao, Y. Zheng, M. F. Mokbel, Location-based and preference-aware
recommendation using sparse geo-social networking data, in: Proceedings of the 20th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, Redondo Beach, CA, 2012,
pp. 199–208.
[3] J. Bao, Y. Zheng, D. Wilkie, M. F. Mokbel, A survey on recommendations in location-based social networks, submitted to GeoInformatica.
[4] M. Barranco, J. Noguera, J. Castro, L. Martı́nez, A context-aware mobile recommender system based on location and trajectory, in: J. Casillas, F. J. Martı́nez-López, J. M. Corchado Rodrı́guez (eds.), Management Intelligent Systems, Springer, Berlin, 2012, pp. 153–162.
[5] A. Bellogı́n, I. Cantador, P. Castells, A comparative study of heterogeneous item recommendations in social systems, Information Sciences
221 (2013) 142–169.
[6] J. Bobadilla, A. Hernando, F. Ortega, A. Gutiérrez, Collaborative filtering based on significances, Information Sciences 185 (2012) 1–17.
[7] F. Cacheda, V. Carneiro, D. Fernández, V. Formoso, Comparison of
collaborative filtering algorithms: Limitations of current techniques and
40
proposals for scalable, high-performance recommender systems, ACM
Transactions on the Web 5 (1) (2011) 2:1–24:33.
[8] L. Chen, M. Lv, Q. Ye, G. Chen, J. Woodward, A personal route prediction system based on trajectory data mining, Information Sciences
181 (7) (2011) 1264–1284.
[9] C. Cheng, H. Yang, I. King, M. R. Lyu, Fused matrix factorization with
geographical and social influence in location-based social networks, in:
Proceedings of the 26th National Conference on Artificial Intelligence,
Toronto, Canada, 2012, pp. 17–23.
[10] C. Cheng, H. Yang, M. R. Lyu, I. King, Where you like to go next: Successive point-of-interest recommendation, in: Proceedings of the 23rd
International Joint Conference on Artificial Intelligence, Beijing, China,
2013, pp. 2605–2611.
[11] E. Cho, S. A. Myers, J. Leskovec, Friendship and mobility: User movement in location-based social networks, in: Proceedings of the 17th ACM
SIGKDD International Conference on Knowledge Discovery and Data
Mining, San Diego, CA, 2011, pp. 1082–1090.
[12] H. Costa, L. H. C. Merschmann, F. Barth, F. Benevenuto, Pollution,
bad-mouthing, and local marketing: The underground of location-based
social networks, Information Sciences 279 (2014) 123–137.
[13] P. Cui, F. Wang, S. Liu, M. Ou, S. Yang, L. Sun, Who should share
what?: Item-level social influence prediction for users and posts ranking,
in: Proceeding of the 34th International ACM SIGIR Conference on
41
Research and Development in Information Retrieval, Beijing, China,
2011, pp. 185–194.
[14] H. Gao, J. Tang, X. Hu, H. Liu, Exploring temporal effects for location
recommendation on location-based social networks, in: Proceedings of
the 7th ACM Conference on Recommender Systems, Hong Kong, China,
2013, pp. 93–100.
[15] H. Gao, J. Tang, H. Liu, gSCorr: Modeling geo-social correlations for
new check-ins on location-based social networks, in: Proceedings of
the 21st ACM International Conference on Information and Knowledge
Management, Maui, HI, 2012, pp. 1582–1586.
[16] H. Gao, J. Tang, H. Liu, Addressing the cold-start problem in location
recommendation using geo-social correlations, Data Mining and Knowledge Discovery (2014) accepted to appear.
[17] D. Gavalas, M. Kenteris, A web-based pervasive recommendation system
for mobile tourist guides, Personal and Ubiquitous Computing 15 (7)
(2011) 759–770.
[18] D. Gavalas, C. Konstantopoulos, K. Mastakas, G. Pantziou, Mobile recommender systems in tourism, Journal of Network and Computer Applications 39 (2014) 319–333.
[19] L. Greengard, V. Rokhlin, A fast algorithm for particle simulations,
Journal of Computational Physics 73 (2) (1987) 325–348.
[20] L. Greengard, J. Strain, The fast Gauss transform, SIAM Journal on
Scientific Computing 12 (1) (1991) 79–94.
42
[21] J. Han, H. Lee, Adaptive landmark recommendations for travel planning: Personalizing and clustering landmarks using geo-tagged social
media, Pervasive and Mobile Computing (2014) accepted to appear.
[22] B. Hu, M. Ester, Spatial topic modeling in online social media for location recommendation, in: Proceedings of the 7th ACM Conference on
Recommender Systems, Hong Kong, China, 2013, pp. 25–32.
[23] C.-C. Hung, W.-C. Peng, W.-C. Lee, Clustering and aggregating clues
of trajectories for mining trajectory patterns and routes, The VLDB
Journal (2011) accepted to appear.
[24] M. Jamali, M. Ester, TrustWalker: A random walk model for combining trust-based and item-based recommendation, in: Proceedings of the
15th ACM SIGKDD International Conference on Knowledge Discovery
and Data Mining, Paris, France, 2009, pp. 397–406.
[25] M. Jiang, P. Cui, R. Liu, Q. Yang, F. Wang, W. Zhu, S. Yang, Social
contextual recommendation, in: Proceedings of the 21st ACM International Conference on Information and Knowledge Management, Maui,
HI, 2012, pp. 45–54.
[26] M. Jiang, P. Cui, F. Wang, Q. Yang, W. Zhu, S. Yang, Social recommendation across multiple relational domains, in: Proceedings of the 21st
ACM International Conference on Information and Knowledge Management, Maui, HI, 2012, pp. 1422–1431.
[27] M. Jiang, P. Cui, F. Wang, W. Zhu, S. Yang, Recommendation with
43
social contextual information, IEEE Transactions on Knowledge and
Data Engineering (2014) accepted to appear.
[28] I. Konstas, V. Stathopoulos, J. M. Jose, On social networks and collaborative recommendation, in: Proceedings of the 32nd International
ACM SIGIR Conference on Research and Development in Information
Retrieval, Boston, MA, 2009, pp. 195–202.
[29] T. Kurashima, T. Iwata, T. Hoshide, N. Takaya, K. Fujimura, Geo
topic model: Joint modeling of user’s activity area and interests for
location recommendation, in: Proceedings of the 6th ACM International
Conference on Web Search and Data Mining, Rome, Italy, 2013, pp.
375–384.
[30] K. W.-T. Leung, D. L. Lee, W.-C. Lee, CLR: A collaborative location
recommendation framework based on co-clustering, in: Proceeding of
the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, Beijing, China, 2011, pp. 305–314.
[31] J. J. Levandoski, M. Sarwat, A. Eldawy, M. F. Mokbel, LARS: A
location-aware recommender system, in: Proceedings of the 28th IEEE
International Conference on Data Engineering, Washington, DC, 2012,
pp. 450–461.
[32] Y.-M. Li, C.-L. Chou, L.-F. Lin, A social recommender mechanism for
location-based group commerce, Information Sciences 274 (2014) 125–
142.
44
[33] Y.-M. Li, H.-W. Hsiao, Y.-L. Lee, Recommending social network applications via social filtering mechanisms, Information Sciences 239 (2013)
18–30.
[34] D. Lian, C. Zhao, X. Xie, G. Sun, E. Chen, Y. Rui, GeoMF: Joint geographical modeling and matrix factorization for point-of-interest
recommendation, in: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, accepted
to appear, New York City, NY, 2014.
[35] C. Lin, R. Xie, X. Guan, L. Li, T. Li, Personalized news recommendation
via implicit social experts, Information Sciences 254 (2014) 1–18.
[36] B. Liu, Y. Fu, Z. Yao, H. Xiong, Learning geographical preferences for
point-of-interest recommendation, in: Proceedings of the 19th ACM
SIGKDD International Conference on Knowledge Discovery and Data
Mining, Chicago, IL, 2013, pp. 1043–1051.
[37] B. Liu, H. Xiong, Point-of-interest recommendation in location based
social networks with topic and location awareness, in: Proceedings of
the SIAM International Conference on Data Mining, Austin, TX, 2013,
pp. 396–404.
[38] C. Liu, H.-C. Yang, J. Fan, L.-W. He, Y.-M. Wang, Distributed nonnegative matrix factorization for web-scale dyadic data analysis on mapreduce, in: Proceedings of the 19th International Conference on World
Wide Web, Raleigh, NC, 2010, pp. 681–690.
45
[39] X. Liu, Y. Liu, K. Aberer, C. Miao, Personalized point-of-interest recommendation by mining users’ preference transition, in: Proceedings of
the 22nd ACM International Conference on Information and Knowledge
Management, San Francisco, CA, 2013, pp. 733–738.
[40] M. Lv, L. Chen, G. Chen, Mining user similarity based on routine activities, Information Sciences 236 (2013) 17–32.
[41] H. Ma, T. C. Zhou, M. R. Lyu, I. King, Improving recommender systems
by incorporating social contextual information, ACM Transactions on
Information Systems 29 (2) (2011) 9:1–9:23.
[42] T. M. Mitchell, Mining our reality, Science 326 (5960) (2009) 1644–1645.
[43] J. Noel, S. Sanner, K.-N. Tran, P. Christen, L. Xie, E. V. Bonilla, E. Abbasnejad, N. Della Penna, New objective functions for social collaborative filtering, in: Proceedings of the 21st International Conference on
World Wide Web, Lyon, France, 2012, pp. 859–868.
[44] J. M. Noguera, M. J. Barranco, R. J. Segura, L. Martı́nez, A mobile
3D-GIS hybrid recommender system for tourism, Information Sciences
215 (2012) 37–52.
[45] A. Noulas, S. Scellato, N. Lathia, C. Mascolo, A random walk around the
city: New venue recommendation in location-based social networks, in:
Proceedings of the IEEE International Conference on Social Computing,
Amsterdam, The Netherlands, 2012, pp. 144–153.
46
[46] Y. Qu, J. Zhang, Trade area analysis using user generated mobile location data, in: Proceedings of the 22nd International Conference on
World Wide Web, Rio de Janeiro, Brazil, 2013, pp. 1053–1064.
[47] N. S. Savage, M. Baranski, N. E. Chavez, T. Höllerer, I’m feeling LoCo:
A location based context aware recommendation system, in: G. Gartner,
F. Ortag (eds.), Advances in Location-Based Services, Springer, Berlin,
2012, pp. 37–54.
[48] Y. Shen, R. Jin, Learning personal + social latent factor model for social
recommendation, in: Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Beijing,
China, 2012, pp. 1303–1311.
[49] B. W. Silverman, Density estimation for statistics and data analysis,
Chapman and Hall, London, 1986.
[50] B. Yang, S. Lee, S. Park, S. goo Lee, Exploiting various implicit feedback for collaborative filtering, in: Proceedings of the 21st International
Conference on World Wide Web, Lyon, France, 2012, pp. 639–640.
[51] D. Yang, D. Zhang, Z. Yu, Z. Wang, A sentiment-enhanced personalized location recommendation system, in: Proceedings of the 24th
ACM Conference on Hypertext and Social Media, Paris, France, 2013,
pp. 119–128.
[52] D. Yang, D. Zhang, Z. Yu, Z. Yu, Fine-grained preference-aware location search leveraging crowdsourced digital footprints from LBSNs, in:
47
Proceedings of the ACM International Joint Conference on Pervasive
and Ubiquitous Computing, Zurich, Switzerland, 2013, pp. 479–488.
[53] S.-H. Yang, B. Long, A. Smola, N. Sadagopan, Z. Zheng, H. Zha, Like
like alike: Joint friendship and interest propagation in social networks,
in: Proceedings of the 20th International Conference on World Wide
Web, Hyderabad, India, 2011, pp. 537–546.
[54] X. Yang, H. Steck, Y. Liu, Circle-based recommendation in online social networks, in: Proceedings of the 18th ACM SIGKDD International
Conference on Knowledge Discovery and Data Mining, Beijing, China,
2012, pp. 1267–1275.
[55] M. Ye, P. Yin, W.-C. Lee, Location recommendation for location-based
social networks, in: Proceedings of the 18th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems,
San Jose, CA, 2010, pp. 458–461.
[56] M. Ye, P. Yin, W.-C. Lee, D.-L. Lee, Exploiting geographical influence
for collaborative point-of-interest recommendation, in: Proceeding of
the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, Beijing, China, 2011, pp. 325–334.
[57] H. Yin, Y. Sun, B. Cui, Z. Hu, L. Chen, LCARS: A location-contentaware recommender system, in: Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining,
Chicago, IL, 2013, pp. 221–229.
48
[58] J. J.-C. Ying, W.-C. Lee, V. S. Tseng, Mining geographic-temporalsemantic patterns in trajectories for location prediction, ACM Transactions on Intelligent Systems and Technology 5 (1) (2014) 2:1–2:33.
[59] J. J.-C. Ying, E. H.-C. Lu, W.-N. Kuo, V. S. Tseng, Urban point-ofinterest recommendation by mining user check-in behaviors, in: Proceedings of the ACM SIGKDD International Workshop on Urban Computing, Beijing, China, 2012, pp. 63–70.
[60] Q. Yuan, G. Cong, Z. Ma, A. Sun, N. Magnenat-Thalmann, Time-aware
point-of-interest recommendation, in: Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval, Dublin, Ireland, 2013, pp. 363–372.
[61] C. Zhang, L. Shou, K. Chen, G. Chen, Y. Bei, Evaluating geo-social
influence in location-based social networks, in: Proceedings of the 21st
ACM International Conference on Information and Knowledge Management, Maui, HI, 2012, pp. 1442–1451.
[62] J.-D. Zhang, C.-Y. Chow, iGSLR: Personalized geo-social location recommendation - a kernel density estimation approach, in: Proceedings of
the 21st ACM SIGSPATIAL International Conference on Advances in
Geographic Information Systems, Orlando, FL, 2013, pp. 344–353.
[63] J.-D. Zhang, C.-Y. Chow, Y. Li, iGeoRec: A personalized and efficient
geographical location recommendation framework, IEEE Transactions
on Services Computing (2014) accepted to appear.
49
[64] J.-D. Zhang, C.-Y. Chow, Y. Li, LORE: Exploiting sequential influence for location recommendations, in: Proceedings of the 22nd ACM
SIGSPATIAL International Conference on Advances in Geographic Information Systems, accepted to appear, Dallas, TX, 2014.
[65] J.-D. Zhang, G. Ghinita, C.-Y. Chow, Differentially private location
recommendations in geosocial networks, in: Proceedings of the 15th
IEEE International Conference on Mobile Data Management, accepted
to appear, Brisbane, Australia, 2014.
[66] V. W. Zheng, B. Cao, Y. Zheng, X. Xie, Q. Yang, Collaborative filtering
meets mobile recommendation: A user-centered approach, in: Proceedings of the 24th National Conference on Artificial Intelligence, Atlantaa,
GA, 2010, pp. 236–241.
[67] V. W. Zheng, Y. Zheng, X. Xie, Q. Yang, Collaborative location and
activity recommendations with GPS history data, in: Proceedings of the
19th International Conference on World Wide Web, Raleigh, NC, 2010,
pp. 1029–1038.
[68] V. W. Zheng, Y. Zheng, X. Xie, Q. Yang, Towards mobile intelligence:
Learning from GPS history data for collaborative recommendation, Artificial Intelligence 184-185 (2012) 17–37.
[69] Y. Zheng, L. Zhang, Z. Ma, X. Xie, W.-Y. Ma, Recommending friends
and locations based on individual location history, ACM Transactions
on the Web 5 (1) (2011) 5:1–5:44.
50
[70] D. Zhou, B. Wang, S. M. Rahimi, X. Wang, A study of recommending locations on location-based social network by collaborative filtering,
in: L. Kosseim, D. Inkpen (eds.), Advances in Artificial Intelligence,
Springer, Berlin, 2012, pp. 255–266.
51
Download