Bayesian analysis in natural language processing /

Natural language processing (NLP) went through a profound transformation in the mid-1980s when it shifted to make heavy use of corpora and data-driven techniques to analyze language. Since then, the use of statistical techniques in NLP has evolved in several ways. One such example of evolution took...

Full description

Saved in:
Bibliographic Details
Main Author: Cohen, Shay (Author)
Format: Electronic eBook
Language:English
Published: Cham, Switzerland : Springer, 2019.
Edition:Second edition.
Series:Synthesis lectures on human language technologies ; #41.
Subjects:
Online Access:Connect to this title online

MARC

LEADER 00000cam a2200000Mi 4500
001 b3939706
003 CStclU
005 20240627104538.0
006 m o d
007 cr |n|||||||||
008 190420s2019 sz ob 001 0 eng d
020 |a 9781681735276  |q (electronic bk.) 
020 |a 168173527X  |q (electronic bk.) 
020 |z 9781681735283  |q (hardcover) 
020 |z 1681735288  |q (hardcover) 
020 |z 9781681735269  |q (paperback) 
020 |z 1681735261  |q (paperback) 
020 |a 9783031021701  |q (electronic bk.) 
020 |a 3031021703  |q (electronic bk.) 
035 |a (OCoLC)1097979255 
035 |a (OCoLC)1097979255  |z (OCoLC)1126121508  |z (OCoLC)1138974800  |z (OCoLC)1167493024 
040 |a EBLCP  |b eng  |e rda  |e pn  |c EBLCP  |d UIU  |d BNG  |d YOU  |d MGCLP  |d OCLCQ  |d N$T  |d OCLCQ  |d LUN  |d LDP  |d OCLCO  |d UKAHL  |d GW5XE  |d OCLCQ  |d UtOrBLW 
049 |a STAW 
050 4 |a QA76.9.N38  |b C643 2019eb 
100 1 |a Cohen, Shay,  |e author.  |0 http://id.loc.gov/authorities/names/nb2016015244 
245 1 0 |a Bayesian analysis in natural language processing /  |c Shay Cohen. 
250 |a Second edition. 
264 1 |a Cham, Switzerland :  |b Springer,  |c 2019. 
300 |a 1 online resource (345 pages). 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
490 1 |a Synthesis lectures on human language technologies,  |x 1947-4059 ;  |v #41 
500 |a Part of: Synthesis digital library of engineering and computer science. 
504 |a Includes bibliographical references and index. 
505 0 |a Intro; List of Figures; List of Figures; List of Figures; Preface (First Edition); Acknowledgments (First Edition); Preface (Second Edition); Preliminaries; Probability Measures; Random Variables; Continuous and Discrete Random Variables; Joint Distribution over Multiple Random Variables; Conditional Distributions; Bayes' Rule; Independent and Conditionally Independent Random Variables; Exchangeable Random Variables; Expectations of Random Variables; Models; Parametric vs. Nonparametric Models; Inference with Models; Generative Models; Independence Assumptions in Models 
505 8 |a Directed Graphical ModelsLearning from Data Scenarios; Bayesian and Frequentist Philosophy (Tip of the Iceberg); Summary; Exercises; Introduction; Overview: Where Bayesian Statistics and NLP Meet; First Example: The Latent Dirichlet Allocation Model; The Dirichlet Distribution; Inference; Summary; Second Example: Bayesian Text Regression; Conclusion and Summary; Exercises; Priors; Conjugate Priors; Conjugate Priors and Normalization Constants; The Use of Conjugate Priors with Latent Variable Models; Mixture of Conjugate Priors; Renormalized Conjugate Distributions 
505 8 |a Discussion: To Be or not to Be Conjugate?Summary; Priors Over Multinomial and Categorical Distributions; The Dirichlet Distribution Re-Visited; The Logistic Normal Distribution; Discussion; Summary; Non-Informative Priors; Uniform and Improper Priors; Jeffreys Prior; Discussion; Conjugacy and Exponential Models; Multiple Parameter Draws in Models; Structural Priors; Conclusion and Summary; Exercises; Bayesian Estimation; Learning with Latent Variables: Two Views; Bayesian Point Estimation; Maximum a Posteriori Estimation; Posterior Approximations Based on the MAP Solution 
505 8 |a Decision-Theoretic Point EstimationDiscussion and Summary; Empirical Bayes; Asymptotic Behavior of the Posterior; Summary; Exercises; Sampling Methods; MCMC Algorithms: Overview; NLP Model Structure for MCMC Inference; Partitioning the Latent Variables; Gibbs Sampling; Collapsed Gibbs Sampling; Operator View; Parallelizing the Gibbs Sampler; Summary; The Metropolis-Hastings Algorithm; Variants of Metropolis-Hastings; Slice Sampling; Auxiliary Variable Sampling; The Use of Slice Sampling and Auxiliary Variable Sampling in NLP; Simulated Annealing; Convergence of MCMC Algorithms 
505 8 |a Markov Chain: Basic TheorySampling Algorithms Not in the MCMC Realm; Monte Carlo Integration; Discussion; Computability of Distribution vs. Sampling; Nested MCMC Sampling; Runtime of MCMC Samplers; Particle Filtering; Conclusion and Summary; Exercises; Variational Inference; Variational Bound on Marginal Log-Likelihood; Mean-Field Approximation; Mean-Field Variational Inference Algorithm; Dirichlet-Multinomial Variational Inference; Connection to the Expectation-Maximization Algorithm; Empirical Bayes with Variational Inference; Discussion; Initialization of the Inference Algorithms 
505 8 |a Convergence Diagnosis 
520 |a Natural language processing (NLP) went through a profound transformation in the mid-1980s when it shifted to make heavy use of corpora and data-driven techniques to analyze language. Since then, the use of statistical techniques in NLP has evolved in several ways. One such example of evolution took place in the late 1990s or early 2000s, when full-fledged Bayesian machinery was introduced to NLP. This Bayesian approach to NLP has come to accommodate various shortcomings in the frequentist approach and to enrich it, especially in the unsupervised setting, where statistical learning is done without target prediction examples. In this book, we cover the methods and algorithms that are needed to fluently read Bayesian learning papers in NLP and to do research in the area. These methods and algorithms are partially borrowed from both machine learning and statistics and are partially developed "in-house" in NLP. We cover inference techniques such as Markov chain Monte Carlo sampling and variational inference, Bayesian estimation, and nonparametric modeling. In response to rapid changes in the field, this second edition of the book includes a new chapter on representation learning and neural networks in the Bayesian context. We also cover fundamental concepts in Bayesian statistics such as prior distributions, conjugacy, and generative modeling. Finally, we review some of the fundamental modeling techniques in NLP, such as grammar modeling, neural networks and representation learning, and their use with Bayesian analysis. 
588 0 |a Online resource; title from digital title page (viewed on May 7, 2019). 
650 0 |a Natural language processing (Computer science)  |0 http://id.loc.gov/authorities/subjects/sh88002425 
650 0 |a Bayesian statistical decision theory.  |0 http://id.loc.gov/authorities/subjects/sh85012506 
650 7 |a Bayesian statistical decision theory.  |2 fast  |0 (OCoLC)fst00829019  |0 http://id.worldcat.org/fast/829019 
650 7 |a Natural language processing (Computer science)  |2 fast  |0 (OCoLC)fst01034365  |0 http://id.worldcat.org/fast/1034365 
740 0 |a Springer Nature Synthesis Collection of Technology Collection 7. 
776 0 8 |i Print version:  |a Cohen, Shay.  |t Bayesian analysis in natural language processing  |z 9781681735283  |z 9781681735269 
830 0 |a Synthesis lectures on human language technologies ;  |v #41.  |0 http://id.loc.gov/authorities/names/no2009100468 
856 4 0 |u https://login.libproxy.scu.edu/login?url=https://link.springer.com/10.1007/978-3-031-02170-1  |z Connect to this title online  |t 0 
907 |a .b39397063  |b 240629  |c 230418 
998 |a uww  |b 230418  |c m  |d z   |e l  |f eng  |g sz   |h 0 
918 |a .bckstg  |b 2016-12-01 
919 |a .ulebk  |b 2022-07-07 
999 f f |i db933ef6-d6f7-5e99-b9b3-1293ffb4ea95  |s e124b5cb-bdeb-55f6-bce6-a4ea0951a348  |t 0