The Bayesian Low-Rank Determinantal Point Process Mixture Model

Determinantal point processes (DPPs) are an elegant model for encoding probabilities over subsets, such as shopping baskets, of a ground set, such as an item catalog. They are useful for a number of machine learning tasks, including product recommendation. DPPs are parametrized by a positive semi-de...

Full description

Saved in:

Bibliographic Details
Main Authors:	Gartrell, Mike, Paquet, Ulrich, Koenigstein, Noam
Format:	Journal Article
Language:	English
Published:	15-08-2016
Subjects:	Computer Science - Learning Statistics - Machine Learning
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Abstract	Determinantal point processes (DPPs) are an elegant model for encoding probabilities over subsets, such as shopping baskets, of a ground set, such as an item catalog. They are useful for a number of machine learning tasks, including product recommendation. DPPs are parametrized by a positive semi-definite kernel matrix. Recent work has shown that using a low-rank factorization of this kernel provides remarkable scalability improvements that open the door to training on large-scale datasets and computing online recommendations, both of which are infeasible with standard DPP models that use a full-rank kernel. In this paper we present a low-rank DPP mixture model that allows us to represent the latent structure present in observed subsets as a mixture of a number of component low-rank DPPs, where each component DPP is responsible for representing a portion of the observed data. The mixture model allows us to effectively address the capacity constraints of the low-rank DPP model. We present an efficient and scalable Markov Chain Monte Carlo (MCMC) learning algorithm for our model that uses Gibbs sampling and stochastic gradient Hamiltonian Monte Carlo (SGHMC). Using an evaluation on several real-world product recommendation datasets, we show that our low-rank DPP mixture model provides substantially better predictive performance than is possible with a single low-rank or full-rank DPP, and significantly better performance than several other competing recommendation methods in many cases.
AbstractList	Determinantal point processes (DPPs) are an elegant model for encoding probabilities over subsets, such as shopping baskets, of a ground set, such as an item catalog. They are useful for a number of machine learning tasks, including product recommendation. DPPs are parametrized by a positive semi-definite kernel matrix. Recent work has shown that using a low-rank factorization of this kernel provides remarkable scalability improvements that open the door to training on large-scale datasets and computing online recommendations, both of which are infeasible with standard DPP models that use a full-rank kernel. In this paper we present a low-rank DPP mixture model that allows us to represent the latent structure present in observed subsets as a mixture of a number of component low-rank DPPs, where each component DPP is responsible for representing a portion of the observed data. The mixture model allows us to effectively address the capacity constraints of the low-rank DPP model. We present an efficient and scalable Markov Chain Monte Carlo (MCMC) learning algorithm for our model that uses Gibbs sampling and stochastic gradient Hamiltonian Monte Carlo (SGHMC). Using an evaluation on several real-world product recommendation datasets, we show that our low-rank DPP mixture model provides substantially better predictive performance than is possible with a single low-rank or full-rank DPP, and significantly better performance than several other competing recommendation methods in many cases.
Author	Gartrell, Mike Paquet, Ulrich Koenigstein, Noam
Author_xml	– sequence: 1 givenname: Mike surname: Gartrell fullname: Gartrell, Mike – sequence: 2 givenname: Ulrich surname: Paquet fullname: Paquet, Ulrich – sequence: 3 givenname: Noam surname: Koenigstein fullname: Koenigstein, Noam
BackLink	https://doi.org/10.48550/arXiv.1608.04245$$DView paper in arXiv
BookMark	eNotz71OwzAUQGEPMEDhAZjwCyTYiW9iTwjKr5SqVZU9urFvhEVqIydA-_aIwnS2I33n7CTEQIxdSZErDSBuMO39Vy4roXOhCgVn7LZ9I36PB5o8Bt7E72yL4Z0_0Exp5wOGGUe-iT7MfJOipWniK7-fPxPxVXQ0XrDTAceJLv-7YO3TY7t8yZr18-vyrsmwqiEzVgnsC-eUdqavwch6ENpoAYUFR6IEUoC2lLKXqh-skW6wulaWrNHSVeWCXf9tj4LuI_kdpkP3K-mOkvIHDCVE5Q
ContentType	Journal Article
Copyright	http://arxiv.org/licenses/nonexclusive-distrib/1.0
Copyright_xml	– notice: http://arxiv.org/licenses/nonexclusive-distrib/1.0
DBID	AKY EPD GOX
DOI	10.48550/arxiv.1608.04245
DatabaseName	arXiv Computer Science arXiv Statistics arXiv.org
DatabaseTitleList
Database_xml	– sequence: 1 dbid: GOX name: arXiv.org url: http://arxiv.org/find sourceTypes: Open Access Repository
DeliveryMethod	fulltext_linktorsrc
ExternalDocumentID	1608_04245
GroupedDBID	AKY EPD GOX
ID	FETCH-LOGICAL-a675-9c40ab2dd48d9b75917f0898052c5de035e45ac311b14bfc91dfc874cec981d63
IEDL.DBID	GOX
IngestDate	Mon Jan 08 05:49:10 EST 2024
IsDoiOpenAccess	true
IsOpenAccess	true
IsPeerReviewed	false
IsScholarly	false
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-a675-9c40ab2dd48d9b75917f0898052c5de035e45ac311b14bfc91dfc874cec981d63
OpenAccessLink	https://arxiv.org/abs/1608.04245
ParticipantIDs	arxiv_primary_1608_04245
PublicationCentury	2000
PublicationDate	2016-08-15
PublicationDateYYYYMMDD	2016-08-15
PublicationDate_xml	– month: 08 year: 2016 text: 2016-08-15 day: 15
PublicationDecade	2010
PublicationYear	2016
Score	1.6423308
SecondaryResourceType	preprint
Snippet	Determinantal point processes (DPPs) are an elegant model for encoding probabilities over subsets, such as shopping baskets, of a ground set, such as an item...
SourceID	arxiv
SourceType	Open Access Repository
SubjectTerms	Computer Science - Learning Statistics - Machine Learning
Title	The Bayesian Low-Rank Determinantal Point Process Mixture Model
URI	https://arxiv.org/abs/1608.04245
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV07T8MwED6RTiwIBKg85YHVInbsJJ4Q0JYOvAQdukV-RYpAKeoDyr_nnASVhcWDH4PvLN935893ABdoAh1LM02x5RRvSUa15JKK3KjMao5HJoSyx6_Z4zQfDEOaHPL7F0bP19Vnmx_YLC5ZGqiO4XEugojzQNm6e5q2j5NNKq5u_mYeYsym64-RGO3CTofuyHWrjj3Y8vU-XKEqyI3-9uG_IrmffdEXXb-RwYaIgkueZ1W9JB1tnzxU6xDZJ6FS2fsBTEbDye2YdnULqEb4TZUVsTbcOZE7ZTKJDlEZ5yrUDrDS-TiRXkhtE8YME6a0irnS5pmw3ipEj2lyCD10_X0fSKJz3BNzUiOyUkKib8RtmSaB65Fpw46g3-y2-GhTUxRBEEUjiOP_h05gG81-GiKjTJ5Cbzlf-TOIFm513sj3B8ZFd4w
link.rule.ids	228,230,782,887
linkProvider	Cornell University
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=The+Bayesian+Low-Rank+Determinantal+Point+Process+Mixture+Model&rft.au=Gartrell%2C+Mike&rft.au=Paquet%2C+Ulrich&rft.au=Koenigstein%2C+Noam&rft.date=2016-08-15&rft_id=info:doi/10.48550%2Farxiv.1608.04245&rft.externalDocID=1608_04245