point-of-interest recommendations: learning …hli38/slide/kdd_16.pdfpoint-of-interest...

Point-of-Interest Recommendations: Learning Potential Check-ins from Friends

Huayu Li∗, Yong Ge+, Richang Hong−, Hengshu Zhu×

∗University of North Carolina at Charlotte

+University of Arizona

−Hefei University of Technology×

Baidu Research-Big Data Lab

Outline

Introduction

Research Problem

Research Challenges

Related Work

Methodologies

Experiments

Introduction

Users Mobile DevicesLocation-based Social

Network (LBSN) Services

Introduction

Information Overload• Foursquare: 65 million venues

• Facebook: 16 million local business

• Yelp: 2.1 million claimed business

New Region

Which One?

Introduction

Information Overload• Foursquare: 65 million venues

• Facebook: 16 million local business

• Yelp: 2.1 million claimed business

New Region

Which One? A location recommender system is very important!

Research Problem

Given a set of users and a set of locations they have visited

before, the objective is to recommend the locations to an

individual who might have interest to visit.

visited recommended

Research Challenges

Complex Decision Making Process• Social Network Influence

• Geographical Influence

Research Challenges

Data Sparsity Issue• Each user only visits a limited number of locations.

• For new user/location, we do not have their check-in information.

Research Challenges

Data Sparsity Issue• Each user only visits a limited number of locations.

• For new user/location, we do not have their check-in information.

Implicit Feedback Issue• Only check-in frequency without explicit rating.

• We do not know user’s explicit preference for locations.

Related Work

Modeling Social Network Influence• Social regularization constraint (WSDM’11)

• Social correlations (CIKM’12, IJCAI’13, ICDM’15)

• User-based collaborative filtering (SIGIR’11)

Modeling geographical influence• Incorporating geographical distance (KDD’11, SIGIR’11, AAAI’12,

SIGSPATIAL’ 13, KDD’14, ICDM’15)

• Incorporating activity area (KDD’ 14)

• Incorporation nearest neighbors (CIKM’14)

Methods: Framework

Learn potential locations from

friends

Learn user’s preference for

locations

Methods: Framework

friends

locations

Definition of Friends

Social Friends ℱ𝑖s

• The users who socially connect with the target user 𝑖 in LBSNs.

Location Friends ℱ𝑖𝑙

• The users who check-in the same locations as the target user 𝑖.

Neighboring Friends ℱ𝑖𝑛

• The users who live physically closest to the target user 𝑖.

𝑙1𝑙2

𝑙3𝑙4 𝑙5

𝑢𝑖

Definition of Friends

Social Friends ℱ𝑖s

• The users who socially connect with the target user 𝑖 in LBSNs.

Location Friends ℱ𝑖𝑙

• The users who check-in the same locations as the target user 𝑖.

Neighboring Friends ℱ𝑖𝑛

• The users who live physically closest to the target user 𝑖.

𝑙1𝑙2

𝑙3𝑙4 𝑙5

𝑢𝑖

ℱi = ℱ𝑖s ∪ 𝑆(ℱ𝑖

𝑙) ∪ 𝑆(ℱ𝑖𝑛)

Methods: Learning Potential Locations

𝑢𝑖PROBLEM DEFINITION:For the target user 𝑖, given

a set of locations that her

friends have checked-in

before but she never visits,

the problem is to find top

most potential locations that

she might be interested in.

𝑢𝑖

𝑃𝑖𝑗𝑝𝑜𝑡

𝑙𝑗

Location Candidate

Linear Aggregation

Random Walk

Methods: Linear Aggregation

𝑢𝑖

𝑙𝑗

Probability 𝑃𝑖𝑗𝑝𝑜𝑡

user 𝑖 visits a location 𝑗:

∝ max𝑓∈ℱ

𝑖𝑗{𝑆𝑖𝑚(𝑖, 𝑓; 𝑗)}

𝜁𝑆𝑖𝑚𝑢 𝑖, 𝑓 + (1 − 𝜁)𝑃𝑖𝑗𝐺

Similarity of User Interest Similarity of Geo-location

Methods: Random Walk

𝑢𝑖 Nodes: users and locations

Links: user-user, user-location, location-location

𝐲 = 1 − 𝛽 𝐀𝐲 +𝛽

|ℳ𝑖𝑜∩ℳ𝑖

𝑓|+|ℱ𝑖|+1

is the steady probability

corresponding to location j

Transition Matrix Restart Nodes

Observed Locations Potential Locations Other Unobserved Locations

Methods: Framework

friends

locations

Recommendation Models

The preference Ƹ𝑝𝑖𝑗 of user 𝑖 for location 𝑗:

≈×d

⊙Users’ preference for locations

Category FeatureMatrix

Location Latent Matrix

User Latent Matrix

Ƹ𝑝𝑖𝑗 = (𝑞𝑖𝑐𝑗 + 𝜀) 𝐮𝑖𝑇𝐯𝑗

User’s Preference for Category

Tuning Parameter

User’s Typical Preference for Location

𝐏 ෩𝐐 = 𝐐 + 𝛆

Loss function of general form

argmin𝐔,𝐕,𝐐

𝐸𝑖 𝑝𝑖𝑗 , 𝑝𝑖𝑘 , 𝑝𝑖ℎ , ො𝑝𝑖𝑗 , ො𝑝𝑖𝑘 , ො𝑝𝑖ℎ + Θ(𝐔, 𝐕,𝐐)

∀ 𝑗 ∈ ℳ𝑖𝑜, 𝑘 ∈ ℳ𝑖

𝑝, ℎ ∈ ℳ𝑖

Estimated Value

Observed Locations

Potential Locations

Other Unobserved Locations

Estimated Value

Observed Locations

Potential Locations

𝜆𝑢

2||𝐔||2

2 +𝜆𝑣

2||𝐕||2

2+𝜆𝑞

2||𝐐||2

Regularization Term

Observed Locations

Potential Locations

𝜆𝑢

2||𝐔||2

2 +𝜆𝑣

2||𝐕||2

2+𝜆𝑞

2||𝐐||2

Regularization Term

Square Error based Model Ranking Error based Model

Square Error based Model

The user’s preference for a location is defined as:

𝑝𝑖𝑗 = ൞

1 𝑖𝑓 𝑗 ∈ ℳio

𝛼 𝑖𝑓 𝑗 ∈ ℳi𝑝

0 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒

Observed Locations

Potential Locations

Other unobserved Locations

The user’s preference for a location is defined as:

𝑝𝑖𝑗 = ൞

1 𝑖𝑓 𝑗 ∈ ℳio

𝛼 𝑖𝑓 𝑗 ∈ ℳi𝑝

0 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒

Squared error loss function

𝐸𝑖 ∙ =

𝑗=1

𝑤𝑖𝑗(𝑝𝑖𝑗 − Ƹ𝑝𝑖𝑗 )2

𝑤𝑖𝑗 = ቊ1 + 𝛾 × 𝑟𝑖𝑗 , 𝑖𝑓 𝑗 ∈ ℳi

1, 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒

Weight Matrix

Squared error based objective function

= min𝐔,𝐕,𝐐

𝑖=1

𝑗=1

𝑤𝑖𝑗(𝑝𝑖𝑗 − Ƹ𝑝𝑖𝑗 )2

+ Θ(𝐔, 𝐕, 𝐐)

Initialization

Alternating Update

Alternating Least Square

Ranking Error based Model

Model the ranking order among user’s preference for three types of locations

ቊƸ𝑝𝑖𝑗 > Ƹ𝑝𝑖𝑘Ƹ𝑝𝑖𝑘 > Ƹ𝑝𝑖ℎ

, ∀ 𝑗 ∈ ℳ𝑖𝑜,𝑘 ∈ ℳ𝑖

Observed Location

Potential Location

Other Unobserved Location

Model the ranking order among user’s preference for three types of locations

ቊƸ𝑝𝑖𝑗 > Ƹ𝑝𝑖𝑘Ƹ𝑝𝑖𝑘 > Ƹ𝑝𝑖ℎ

, ∀ 𝑗 ∈ ℳ𝑖𝑜,𝑘 ∈ ℳ𝑖

Ranking error loss function

𝐸𝑖 ∙ = −

𝑗∈ℳ𝑖𝑜

𝑘∈ℳ𝑖𝑝

ln 𝜎( Ƹ𝑝𝑖𝑗 − Ƹ𝑝𝑖𝑘) −

𝑘∈ℳ𝑖𝑝

ℎ∈ℳ𝑖𝑢

ln 𝜎( Ƹ𝑝𝑖𝑘 − Ƹ𝑝𝑖ℎ)

Using Logistic Function to Model Ranking Order

Ranking error based objective function

Initialization

Update

Stochastic Gradient Descent with Boostrap Sampling

Sampling

Incorporating Geographical Influence

Check-in probability is refined by a power-law function associated with the distance between user home position and a location.

Ƹ𝑝𝑖𝑗 ∝ 𝑝𝑖𝑗𝐺 × 𝜎( Ƹ𝑝𝑖𝑗)

𝑝𝑜𝑤𝑒𝑟𝑙𝑎𝑤(𝑑(𝑖, 𝑗))

Recommendation Strategies

Target User 𝑖

New Location

Standard Recommendation

New User RecommendationƸ𝑝𝑖𝑗 = (𝑞𝑖𝑐𝑗 + 𝜀) 𝐮𝑖

𝑇𝐯𝑗

New Location Recommendation

Ƹ𝑝𝑖𝑗 ∝ 𝑝𝑖𝑗𝐺× 𝜎

σ𝑙∈𝜓𝑗

𝑆𝑖𝑚𝐺(𝑗, 𝑙) Ƹ𝑝𝑖𝑙

σ𝑙∈𝜓𝑗

𝑆𝑖𝑚𝐺(𝑗, 𝑙)

Datasets: Gowalla

Test Methodology• Selecting 80% as training and using the rest 20% as testing according to

timestamp

Evaluation Metrics: • Top-K Recommendation Accuracy

(Precision@K and Recall@K)

Experiments

Statistics of Data Set

New Location Rec New User Rec

#User #Location #Check-in Sparsity #New Location #Test #New User #Test

52,216 98,351 2,577,336 0.0399% 78,881 568,937 9,326 79,153

Exp. : Standard Recommendation

Precision@K Recall@K

Modeling unobserved check-ins can improve recommendation accuracy !

Exp. : Standard Recommendation

Modeling potential check-ins can benefit recommendation!

Exp. : New User Recommendation

Modeling potential check-ins can solve user cold-start issue!

Exp. : New Location Recommendation

Modeling potential check-ins can solve location cold-start issue!

Performance comparison for new location recommendation in terms of Precision@K and Recall@K.

Conclusion

Empirically analyze the correlations between users and their three type of friends using real-world data

Learn a set of locations for each user that her friends have checked-in before and she is most interested in

Develop matrix factorization based models via different error loss functions with the learned potential check-ins, and propose two scalable optimization methods

Design three different recommendation strategies

Thank You

point-of-interest recommendations: learning …hli38/slide/kdd_16.pdfpoint-of-interest...

Documents

affordable and efficient, or overpriced and underwhelming...

(escorts): stakeholder(escorts): stakeholder engagement &...

uwpd external review findings and recommendations · uwpd...

ms. sun yongge general secretary internet society of china...

constrained interest-base tour recommendations in large...

sun yongge

tuesday 26 - world nuclear exhibition...• dominique...

recommendations on asset and interest declarations …

real time recommendations using spark streaming ·...

reforming major interest rate benchmarks: progress · pdf...

annual report 2009-2010 · recommendations from the plan,...

electronic cash and blockchain security - cyber...

nestlé, dalston - carlisle natural history · pdf...

closing the omni-channel loop via dynamic, data...

biased recommendations from biased and unbiased experts ·...

matrix rank and inertia formulas in the analysis of ... ·...

- a status report with policy recommendations...an expanding...

the role of fundamental analysis in information arbitrage...

from interest to lead to conversion & then recommendations ...

regulatory insights for asset managers, mutual …...best...