Collaborative Topic Regression (CTR)
- Collaborative Topic Regression is a framework that combines latent topic modeling (via LDA) with probabilistic matrix factorization to jointly capture item content and user interactions.
- It addresses cold-start and data sparsity by coupling content signals with collaborative user feedback, thereby enhancing recommendation precision.
- Extensions incorporating social network influences and cognitive attention constraints further boost its performance in large-scale, dynamic environments.
Collaborative Topic Regression (CTR) is a probabilistic, machine-learning framework that integrates latent topic modeling of item content with collaborative filtering based on user–item interactions. CTR addresses the cold-start and data sparsity problems in recommender systems by modeling the generative process of user adoption events and item content simultaneously. It extends traditional matrix factorization with LDA-style topic representations for items, and supports further extension to social and cognitive factors such as social network influence and limited human attention.
1. Model Foundations
CTR builds upon two foundational components: probabilistic matrix factorization (PMF) and Latent Dirichlet Allocation (LDA). In the canonical CTR model, each item (e.g., document, media file) is represented by a topic-proportion vector , drawn from a Dirichlet prior; user latent interest vectors are sampled from a zero-mean Gaussian prior. The item latent vector is defined as , where accounts for collaborative signal not captured by content, with . The user–item interactions (ratings/adoptions) are modeled as:
with unknown binary or continuous ratings and confidence weights . Item content generation follows standard LDA:
- For each word in item , topic assignment .
- Word , where is the topic-word distribution.
This joint formulation couples collaborative and content-based signals in the shared latent space.
2. Extended Frameworks: Social and Cognitive Signals
CTR has been generalized to incorporate social network effects and psycho-cognitive constraints:
- Social Matrix Factorization (CTR+SMF): Incorporates observed user–user links via a social latent vector for each friend , with an additional likelihood matching observed friendships as , and a logistic mapping. This coupling allows the model to infer the degree of 'peer influence' on each user's preferences (Purushotham et al., 2012).
- Limited Attention CTR (LA-CTR): Models cognitive constraints by introducing an attention vector for user 's allocation of attention to friend over topics. Each , encoding both baseline influence and topical focus of attention. The user’s adoption of item via friend is modeled by . This captures both social diffusion and cognitive processing constraints (Kang et al., 2013).
3. Objective Functions and Inference Methods
CTR and its extensions optimize a joint (log-)posterior, regularizing the solution through Gaussian and Dirichlet priors:
LA-CTR extends this with additional regularization and social/attention likelihoods, introducing terms penalizing deviations of attention from the product of influence and interest, and matching observed adoption paths via friends.
Model estimation is performed using MAP–EM style algorithms, alternating between:
- E-step: Variational approximations for topic assignments, e.g., for LDA components.
- M-step: Closed-form updates for user/item/attention/social vectors, e.g.,
with analogous updates for , , and .
Hyperparameters ( terms, confidence weights) are selected via held-out cross-validation.
4. Online and Jointly Coupled Learning
The original CTR is a batch method, computing topic proportions and rating factors separately. Online Bayesian CTR (OBCTR) proposes a streaming variational inference framework, updating posteriors incrementally with each incoming tuple (Liu et al., 2016). Key properties include:
- Mean-field posteriors for , (Gaussian), topic distributions , (Dirichlet), and word-topic assignments .
- Per-sample updates using BayesPA-style minimization: each new sample nudges the latent factors, and rating signals directly influence topic assignments via the collaborative–content coupling term.
- Constant per-sample memory and computation, suitable for streaming and large-scale contexts.
- Empirically, OBCTR achieves improved RMSE and held-out likelihood compared to both batch CTR and less tightly coupled online variants.
5. Empirical Evaluation and Performance
CTR and its variants have been extensively benchmarked on public datasets:
| Model | Lastfm (recall@250) | Delicious (recall@250) | Digg 2009 (recall@100) |
|---|---|---|---|
| PMF (CF) | 0.42 | 0.36 | — |
| CTR | 0.45 | 0.39 | ~0.11 |
| CTR+SMF | 0.48 | 0.43 | ~0.17 |
| LA-CTR | — | — | ~0.19 |
| LA-CTR | — | — | ~0.22 |
- On social bookmarking (Delicious), incorporating social structure yields 3–5% higher recall than relying on content alone (Purushotham et al., 2012).
- On music (Lastfm), content (tags) is more predictive, with social signals providing a complementary boost.
- On social news voting (Digg), LA-CTR demonstrates relative improvements of 20–30% over CTR+SMF, confirming that modeling limited, non-uniform attention sharpens prediction of social adoptions (Kang et al., 2013).
- In all cases, baselines using only collaborative or content signals underperform joint models.
6. Limitations and Prospective Directions
- Model Scalability: Extensions such as LA-CTR introduce latent variables in the worst case; in practice, sparsity of social graphs and interactions makes these models computationally