The Bayesian Low-Rank Determinantal Point Process Mixture Model

Determinantal point processes (DPPs) are an elegant model for encoding probabilities over subsets, such as shopping baskets, of a ground set, such as an item catalog. They are useful for a number of machine learning tasks, including product recommendation. DPPs are parametrized by a positive semi-de...

Full description

Saved in:
Bibliographic Details
Main Authors: Gartrell, Mike, Paquet, Ulrich, Koenigstein, Noam
Format: Journal Article
Language:English
Published: 15-08-2016
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Abstract Determinantal point processes (DPPs) are an elegant model for encoding probabilities over subsets, such as shopping baskets, of a ground set, such as an item catalog. They are useful for a number of machine learning tasks, including product recommendation. DPPs are parametrized by a positive semi-definite kernel matrix. Recent work has shown that using a low-rank factorization of this kernel provides remarkable scalability improvements that open the door to training on large-scale datasets and computing online recommendations, both of which are infeasible with standard DPP models that use a full-rank kernel. In this paper we present a low-rank DPP mixture model that allows us to represent the latent structure present in observed subsets as a mixture of a number of component low-rank DPPs, where each component DPP is responsible for representing a portion of the observed data. The mixture model allows us to effectively address the capacity constraints of the low-rank DPP model. We present an efficient and scalable Markov Chain Monte Carlo (MCMC) learning algorithm for our model that uses Gibbs sampling and stochastic gradient Hamiltonian Monte Carlo (SGHMC). Using an evaluation on several real-world product recommendation datasets, we show that our low-rank DPP mixture model provides substantially better predictive performance than is possible with a single low-rank or full-rank DPP, and significantly better performance than several other competing recommendation methods in many cases.
AbstractList Determinantal point processes (DPPs) are an elegant model for encoding probabilities over subsets, such as shopping baskets, of a ground set, such as an item catalog. They are useful for a number of machine learning tasks, including product recommendation. DPPs are parametrized by a positive semi-definite kernel matrix. Recent work has shown that using a low-rank factorization of this kernel provides remarkable scalability improvements that open the door to training on large-scale datasets and computing online recommendations, both of which are infeasible with standard DPP models that use a full-rank kernel. In this paper we present a low-rank DPP mixture model that allows us to represent the latent structure present in observed subsets as a mixture of a number of component low-rank DPPs, where each component DPP is responsible for representing a portion of the observed data. The mixture model allows us to effectively address the capacity constraints of the low-rank DPP model. We present an efficient and scalable Markov Chain Monte Carlo (MCMC) learning algorithm for our model that uses Gibbs sampling and stochastic gradient Hamiltonian Monte Carlo (SGHMC). Using an evaluation on several real-world product recommendation datasets, we show that our low-rank DPP mixture model provides substantially better predictive performance than is possible with a single low-rank or full-rank DPP, and significantly better performance than several other competing recommendation methods in many cases.
Author Gartrell, Mike
Paquet, Ulrich
Koenigstein, Noam
Author_xml – sequence: 1
  givenname: Mike
  surname: Gartrell
  fullname: Gartrell, Mike
– sequence: 2
  givenname: Ulrich
  surname: Paquet
  fullname: Paquet, Ulrich
– sequence: 3
  givenname: Noam
  surname: Koenigstein
  fullname: Koenigstein, Noam
BackLink https://doi.org/10.48550/arXiv.1608.04245$$DView paper in arXiv
BookMark eNotz71OwzAUQGEPMEDhAZjwCyTYiW9iTwjKr5SqVZU9urFvhEVqIydA-_aIwnS2I33n7CTEQIxdSZErDSBuMO39Vy4roXOhCgVn7LZ9I36PB5o8Bt7E72yL4Z0_0Exp5wOGGUe-iT7MfJOipWniK7-fPxPxVXQ0XrDTAceJLv-7YO3TY7t8yZr18-vyrsmwqiEzVgnsC-eUdqavwch6ENpoAYUFR6IEUoC2lLKXqh-skW6wulaWrNHSVeWCXf9tj4LuI_kdpkP3K-mOkvIHDCVE5Q
ContentType Journal Article
Copyright http://arxiv.org/licenses/nonexclusive-distrib/1.0
Copyright_xml – notice: http://arxiv.org/licenses/nonexclusive-distrib/1.0
DBID AKY
EPD
GOX
DOI 10.48550/arxiv.1608.04245
DatabaseName arXiv Computer Science
arXiv Statistics
arXiv.org
DatabaseTitleList
Database_xml – sequence: 1
  dbid: GOX
  name: arXiv.org
  url: http://arxiv.org/find
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
ExternalDocumentID 1608_04245
GroupedDBID AKY
EPD
GOX
ID FETCH-LOGICAL-a675-9c40ab2dd48d9b75917f0898052c5de035e45ac311b14bfc91dfc874cec981d63
IEDL.DBID GOX
IngestDate Mon Jan 08 05:49:10 EST 2024
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-a675-9c40ab2dd48d9b75917f0898052c5de035e45ac311b14bfc91dfc874cec981d63
OpenAccessLink https://arxiv.org/abs/1608.04245
ParticipantIDs arxiv_primary_1608_04245
PublicationCentury 2000
PublicationDate 2016-08-15
PublicationDateYYYYMMDD 2016-08-15
PublicationDate_xml – month: 08
  year: 2016
  text: 2016-08-15
  day: 15
PublicationDecade 2010
PublicationYear 2016
Score 1.6423308
SecondaryResourceType preprint
Snippet Determinantal point processes (DPPs) are an elegant model for encoding probabilities over subsets, such as shopping baskets, of a ground set, such as an item...
SourceID arxiv
SourceType Open Access Repository
SubjectTerms Computer Science - Learning
Statistics - Machine Learning
Title The Bayesian Low-Rank Determinantal Point Process Mixture Model
URI https://arxiv.org/abs/1608.04245
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://sdu.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV07T8MwED6RTiwIBKg85YHVInbsJJ4Q0JYOvAQdukV-RYpAKeoDyr_nnASVhcWDH4PvLN935893ABdoAh1LM02x5RRvSUa15JKK3KjMao5HJoSyx6_Z4zQfDEOaHPL7F0bP19Vnmx_YLC5ZGqiO4XEugojzQNm6e5q2j5NNKq5u_mYeYsym64-RGO3CTofuyHWrjj3Y8vU-XKEqyI3-9uG_IrmffdEXXb-RwYaIgkueZ1W9JB1tnzxU6xDZJ6FS2fsBTEbDye2YdnULqEb4TZUVsTbcOZE7ZTKJDlEZ5yrUDrDS-TiRXkhtE8YME6a0irnS5pmw3ipEj2lyCD10_X0fSKJz3BNzUiOyUkKib8RtmSaB65Fpw46g3-y2-GhTUxRBEEUjiOP_h05gG81-GiKjTJ5Cbzlf-TOIFm513sj3B8ZFd4w
link.rule.ids 228,230,782,887
linkProvider Cornell University
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=The+Bayesian+Low-Rank+Determinantal+Point+Process+Mixture+Model&rft.au=Gartrell%2C+Mike&rft.au=Paquet%2C+Ulrich&rft.au=Koenigstein%2C+Noam&rft.date=2016-08-15&rft_id=info:doi/10.48550%2Farxiv.1608.04245&rft.externalDocID=1608_04245