The Bayesian Low-Rank Determinantal Point Process Mixture Model
Determinantal point processes (DPPs) are an elegant model for encoding probabilities over subsets, such as shopping baskets, of a ground set, such as an item catalog. They are useful for a number of machine learning tasks, including product recommendation. DPPs are parametrized by a positive semi-de...
Saved in:
Main Authors: | , , |
---|---|
Format: | Journal Article |
Language: | English |
Published: |
15-08-2016
|
Subjects: | |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Abstract | Determinantal point processes (DPPs) are an elegant model for encoding
probabilities over subsets, such as shopping baskets, of a ground set, such as
an item catalog. They are useful for a number of machine learning tasks,
including product recommendation. DPPs are parametrized by a positive
semi-definite kernel matrix. Recent work has shown that using a low-rank
factorization of this kernel provides remarkable scalability improvements that
open the door to training on large-scale datasets and computing online
recommendations, both of which are infeasible with standard DPP models that use
a full-rank kernel. In this paper we present a low-rank DPP mixture model that
allows us to represent the latent structure present in observed subsets as a
mixture of a number of component low-rank DPPs, where each component DPP is
responsible for representing a portion of the observed data. The mixture model
allows us to effectively address the capacity constraints of the low-rank DPP
model. We present an efficient and scalable Markov Chain Monte Carlo (MCMC)
learning algorithm for our model that uses Gibbs sampling and stochastic
gradient Hamiltonian Monte Carlo (SGHMC). Using an evaluation on several
real-world product recommendation datasets, we show that our low-rank DPP
mixture model provides substantially better predictive performance than is
possible with a single low-rank or full-rank DPP, and significantly better
performance than several other competing recommendation methods in many cases. |
---|---|
AbstractList | Determinantal point processes (DPPs) are an elegant model for encoding
probabilities over subsets, such as shopping baskets, of a ground set, such as
an item catalog. They are useful for a number of machine learning tasks,
including product recommendation. DPPs are parametrized by a positive
semi-definite kernel matrix. Recent work has shown that using a low-rank
factorization of this kernel provides remarkable scalability improvements that
open the door to training on large-scale datasets and computing online
recommendations, both of which are infeasible with standard DPP models that use
a full-rank kernel. In this paper we present a low-rank DPP mixture model that
allows us to represent the latent structure present in observed subsets as a
mixture of a number of component low-rank DPPs, where each component DPP is
responsible for representing a portion of the observed data. The mixture model
allows us to effectively address the capacity constraints of the low-rank DPP
model. We present an efficient and scalable Markov Chain Monte Carlo (MCMC)
learning algorithm for our model that uses Gibbs sampling and stochastic
gradient Hamiltonian Monte Carlo (SGHMC). Using an evaluation on several
real-world product recommendation datasets, we show that our low-rank DPP
mixture model provides substantially better predictive performance than is
possible with a single low-rank or full-rank DPP, and significantly better
performance than several other competing recommendation methods in many cases. |
Author | Gartrell, Mike Paquet, Ulrich Koenigstein, Noam |
Author_xml | – sequence: 1 givenname: Mike surname: Gartrell fullname: Gartrell, Mike – sequence: 2 givenname: Ulrich surname: Paquet fullname: Paquet, Ulrich – sequence: 3 givenname: Noam surname: Koenigstein fullname: Koenigstein, Noam |
BackLink | https://doi.org/10.48550/arXiv.1608.04245$$DView paper in arXiv |
BookMark | eNotz71OwzAUQGEPMEDhAZjwCyTYiW9iTwjKr5SqVZU9urFvhEVqIydA-_aIwnS2I33n7CTEQIxdSZErDSBuMO39Vy4roXOhCgVn7LZ9I36PB5o8Bt7E72yL4Z0_0Exp5wOGGUe-iT7MfJOipWniK7-fPxPxVXQ0XrDTAceJLv-7YO3TY7t8yZr18-vyrsmwqiEzVgnsC-eUdqavwch6ENpoAYUFR6IEUoC2lLKXqh-skW6wulaWrNHSVeWCXf9tj4LuI_kdpkP3K-mOkvIHDCVE5Q |
ContentType | Journal Article |
Copyright | http://arxiv.org/licenses/nonexclusive-distrib/1.0 |
Copyright_xml | – notice: http://arxiv.org/licenses/nonexclusive-distrib/1.0 |
DBID | AKY EPD GOX |
DOI | 10.48550/arxiv.1608.04245 |
DatabaseName | arXiv Computer Science arXiv Statistics arXiv.org |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: GOX name: arXiv.org url: http://arxiv.org/find sourceTypes: Open Access Repository |
DeliveryMethod | fulltext_linktorsrc |
ExternalDocumentID | 1608_04245 |
GroupedDBID | AKY EPD GOX |
ID | FETCH-LOGICAL-a675-9c40ab2dd48d9b75917f0898052c5de035e45ac311b14bfc91dfc874cec981d63 |
IEDL.DBID | GOX |
IngestDate | Mon Jan 08 05:49:10 EST 2024 |
IsDoiOpenAccess | true |
IsOpenAccess | true |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-a675-9c40ab2dd48d9b75917f0898052c5de035e45ac311b14bfc91dfc874cec981d63 |
OpenAccessLink | https://arxiv.org/abs/1608.04245 |
ParticipantIDs | arxiv_primary_1608_04245 |
PublicationCentury | 2000 |
PublicationDate | 2016-08-15 |
PublicationDateYYYYMMDD | 2016-08-15 |
PublicationDate_xml | – month: 08 year: 2016 text: 2016-08-15 day: 15 |
PublicationDecade | 2010 |
PublicationYear | 2016 |
Score | 1.6423308 |
SecondaryResourceType | preprint |
Snippet | Determinantal point processes (DPPs) are an elegant model for encoding
probabilities over subsets, such as shopping baskets, of a ground set, such as
an item... |
SourceID | arxiv |
SourceType | Open Access Repository |
SubjectTerms | Computer Science - Learning Statistics - Machine Learning |
Title | The Bayesian Low-Rank Determinantal Point Process Mixture Model |
URI | https://arxiv.org/abs/1608.04245 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV07T8MwED6RTiwIBKg85YHVInbsJJ4Q0JYOvAQdukV-RYpAKeoDyr_nnASVhcWDH4PvLN935893ABdoAh1LM02x5RRvSUa15JKK3KjMao5HJoSyx6_Z4zQfDEOaHPL7F0bP19Vnmx_YLC5ZGqiO4XEugojzQNm6e5q2j5NNKq5u_mYeYsym64-RGO3CTofuyHWrjj3Y8vU-XKEqyI3-9uG_IrmffdEXXb-RwYaIgkueZ1W9JB1tnzxU6xDZJ6FS2fsBTEbDye2YdnULqEb4TZUVsTbcOZE7ZTKJDlEZ5yrUDrDS-TiRXkhtE8YME6a0irnS5pmw3ipEj2lyCD10_X0fSKJz3BNzUiOyUkKib8RtmSaB65Fpw46g3-y2-GhTUxRBEEUjiOP_h05gG81-GiKjTJ5Cbzlf-TOIFm513sj3B8ZFd4w |
link.rule.ids | 228,230,782,887 |
linkProvider | Cornell University |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=The+Bayesian+Low-Rank+Determinantal+Point+Process+Mixture+Model&rft.au=Gartrell%2C+Mike&rft.au=Paquet%2C+Ulrich&rft.au=Koenigstein%2C+Noam&rft.date=2016-08-15&rft_id=info:doi/10.48550%2Farxiv.1608.04245&rft.externalDocID=1608_04245 |