Key points are not available for this paper at this time.
We describe latent Dirichlet allocation (LDA), a generative probabilistic model for collections of discrete data such as text corpora. LDA is a three-level hierarchical Bayesian model, in which each item of a collection is modeled as a finite mixture over an underlying set of topics. Each topic is, in turn, modeled as an infinite mixture over an underlying set of topic probabilities. In the context of text modeling, the topic probabilities provide an explicit representation of a document. We present efficient approximate inference techniques based on variational methods and an EM algorithm for empirical Bayes parameter estimation. We report results in document modeling, text classification, and collaborative filtering, comparing to a mixture of unigrams model and the probabilistic LSI model.
Building similarity graph...
Analyzing shared references across papers
Loading...
Blei et al. (Sat,) studied this question.
www.synapsesocial.com/papers/69da7fd3387cf70698686d27 — DOI: https://doi.org/10.5555/944919.944937
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:
David M. Blei
Andrew Y. Ng
Michael I. Jordan
Journal of Machine Learning Research
Stanford University
University of California, Berkeley
Building similarity graph...
Analyzing shared references across papers
Loading...