Welcome to the Stat 295
Home Page


This is from Mike West's website at Duke University


NEW: Ted Hart pointed me to the following blog, by Jim Albert. It's directed towards learning Bayes, and is connected with his book, Bayesian Computation with R, and is apparently keyed to a course he is teaching this semester. Lots of good advice here.

I've started experimenting with my own blog for the course. Not much there, but we'll see how it develops.

 

Charts for Lectures

(PDF Files, you will need Adobe® Acrobat Reader® to read these)

0. Preliminaries
1. Introduction
2. Simple Examples
3. Bayes' Theorem NOTE: THIS VERSION CONTAINS THE HIDDEN CHARTS
4. Interpretation
5. MCMC
6. Poisson Inference
7. Normal Linear Models
8. Priors
9. Hierarchical Bayes
10. Hypothesis Testing
11. Model Selection and Averaging
12. Missing and Censored Data


Reading

8/27 Start reading Chapter 1 of Gelman, et. al. and Chapter 1 of Lavine's online book.

9/18 I recommend that you read Andrew Gelman's blog on a regular basis. Today he has a discussion of R coding style.

9/26 Andrew Gelman's blog points to a site where an R tutorial is being developed.

11/8 Here is a copy of the Dellaportas et. al. paper.

 

Problem Sets

9/7 Working with one or two others, start on the First Problem Set (Due Monday, September 17)

9/17 Again working in small groups, do the following problems from Section 1.10 of Lavine's web-based book: Problems 2, 28, 29, 35, 38. If you wish, you can use R to answer problem 2 by simulation. (Due Friday, September 28)

9/28 Working with one or two others, start on the Third Problem Set (Due Friday, October 12)

10/15 Do the Fourth Problem Set (Due Monday, October 22).

10/26 Do the Fifth Problem Set (Due Friday, November 9).

11/8 Do the Sixth Problem Set (Due Friday, November 16)

11/15 Do the Seventh Problem Set (Due Wednesday, December 5)

 

Examples from Class (Number indicates corresponding chart set)

5.1 Example 4 - Normal Inference with Metropolis-Hastings Sampling (Metropolis Within Gibbs)
5.2 Example 5 - Same, but with all-at-once sampling.
6.1 Example 6 - Errors-in-variables example
10.1 Example 7 - Hypothesis testing example with reversible-jump MCMC

 


Syllabus

Bayesian Inference (Stat 295)

This is a course in Bayesian statistics. The instructor is an astronomer by profession, so the course will emphasize applications to the physical sciences; however, the material of the course will be useful for applying Bayesian inference in a wide variety of contexts. Bayesian inference is a powerful and increasingly popular statistical approach, which allows one to deal with complex problems in a conceptually simple and unified way. The recent introduction of Markov Chain Monte Carlo (MCMC) simulation methods has made possible the solution of large problems in Bayesian inference that were formerly intractable. This course will introduce the student to the basic methods and techniques of modern Bayesian inference, including parameter estimation, MCMC simulation, hypothesis testing, and model selection/model averaging in the context of practical problems.

Books

Bayesian Data Analysis, Second Edition (Andrew Gelman, John B. Carlin, Hal S. Stein and Donald B. Rubin. London: Chapman and Hall)

Introduction to Statistical Thought (Michael Lavine), available here as a free web download.

A new book has just been published that uses R extensively to introduce the basic ideas of Bayesian inference and computation. It is too late to order copies for the course, but it can be ordered on Amazon.com and other online book suppliers. I highly recommend it. It is Jim Albert's book, Bayesian Computation with R. At under $50 from Amazon.com (I have not checked prices on other services), it is relatively inexpensive. It also comes with a web-available R package that for installation in your copy of R, that contains a large number of useful R functions.

Topics (not necessarily in this order; subtopics will be presented as appropriate)

Review of probability calculus. Interpretations of probability (e.g., frequency, degree-of-belief). Coherence. Bayes's Theorem. Joint, conditional, and marginal distribution. Independence. Prior distribution, likelihood, and posterior distribution. Bayesian estimation and inference on discrete state spaces. Likelihoods, odds and Bayes factors. Simple and composite alternatives.

Markov Chain Monte Carlo (MCMC) simulation as a method for practical calculation of Bayesian results. The Gibbs sampler. Metropolis-Hastings sampling. Metropolis-within-Gibbs sampling. Computer tools, e.g., BUGS,S+, R.

Bayesian point and interval parameter estimation. Bayesian credible intervals. Comparison with frequentist parameter estimation and confidence intervals. Bayesian inference on Gaussian distributions. Maximum Likelihood estimation as a Bayesian approximation. Laplace's approximation. Bayesian inference in non-Gaussian cases, e.g., Poisson, Cauchy, and arbitrary distributions. Linear and nonlinear models. Errors-in-variables models. Selection models. Hierarchical models

Prior selection. Subjective and objective priors. Priors as a way to encode actual prior knowledge. Sensitivity of the posterior distribution to the prior. Priors for hierarchical models.

Bayesian hypothesis testing. Comparison with frequentist hypothesis testing. Model selection and model averaging. Reversible jump MCMC for models of variable size. Approximations, e.g., AIC, BIC. Philosophical issues, likelihood principle, and the Bayesian Ockham's Razor.

Grading

The course grade will be based 80% on the assignments and 20% on class participation. By class participation, I mean that I will often leave unanswered questions in the notes that will be found on the web. You should read the notes in advance and attempt to answer these questions for yourselves. I will ask students for their answers to these questions in class. Also, I will sometimes ask for students' ideas about how they solved the assignments.

In general, I encourage students to work on the assignments in small groups of two or three (maximum). Statistics is by nature a cooperative enterprise. Statisticians act as experts in that field and advise clients (who are experts in their fields) on how to apply statistics to their problem. By working in groups, I hope to foster this sort of cooperative attitude between students in the class. If a group works on an assignment, I would like one paper turned in for the group, with everyone's name at the top. It goes without saying that I expect that everyone who works in a group will contribute roughly equally to the final result. For example, in a programming assignment, each member of the group should attempt to program the problem, and the group should then try to work out differences (e.g., if different students in the group arrive at different results, the group should try to figure out why this is so, to locate the sources of the discrepancies and fix them; if no resolution can be found, then the students should turn in a paper that displays the several different attempts with a discussion explaining the group's best understanding of the reasons for the discrepancy). Similarly, if a problem is worked and different members of the group obtain different answers, a similar resolution should be attempted, and if no agreement is obtained, the group should present a discussion. My role will be to examine what each group presents and comment on them, as well as to provide a grade.

Office Hours

My office is Lord 107. Office hours are MWF 10-10:50.

 


Web Resources

A very basic discussion of the intuitive basis of Bayesian reasoning can be found at http://yudkowsky.net/bayes/bayes.html. This contains some javascript calculators to try out simple calculations.

Tom Loredo's Bayesian Inference in the Physical Sciences (BIPS) website has a lot of useful information about Bayesian inference. Note particularly the first five items in his Bayesian Reprints page, which are very nice tutorials on practical application of Bayesian inference. He also has extensive pointers to other websites including software, reprint archives, etc.

The book by E. T. Jaynes can be found in a preliminary form here.

First Bayes is a software package that is intended to help students with the first steps in understanding Bayesian inference. It runs under Windows. It concentrates on simple, closed-form examples but may be helpful to you.

The International Society for Bayesian Analysis (ISBA) is the international Bayesian organization. It sponsors meetings and publishes a newsletter. Dues are not expensive, and for students are set at a reduced rate of $10/year.

Bayesians Worldwide contains links to the home pages of a large number of Bayesians. Many of these individuals maintain collections of their reprints. Most of the prominent Bayesians are listed.

The Bayesian Songbook contains songs that have been presented at various Bayesian meetings over the years. Just for fun. There are also links to pictures of the infamous "Cabarets" at which these songs were sung.

 


Free Software

Carnegie-Mellon University's statistics group has a library of many different statistics packages, including Bayesian packages. It can be accessed here.

Although CMU archives the R package, it's best to go to the R Project (CRAN) homepage, since you'll probably get the most recent version of it. Click here. R runs on Windows, Linux, UNIX and Macintosh (OS 9 or higher). The introductory tutorial for R can be found here. Many add-on packages for R are available at CRAN.

The BUGS project at the University of Cambridge offers the BUGS (Bayesian inference Using Gibbs Sampling) package. It does both Gibbs and Metropolis-Hastings sampling, and the software can be downloaded here. It runs on Windows and the "classic" version runs on UNIX. There is no Mac version of BUGS. However, if you purchase Virtual PC you can run it on a Mac (at reduced speed), but Virtual PC is not free. Virtual PC is sold by Microsoft. Another (cheap) system is sold by iEmulator. The new Macintosh computers (based on Intel chips) can run BUGS using Boot Camp and your own copy of Windows, and they will run it at full speed. There are other systems that will also run Windows on the new Macintoshes, for example, Parallels.

Not so free software

S Plus is not free, but there is a fairly inexpensive student package. It is sold by Mathsoft. Most of the functionality of S Plus can be found in the free R package (above) so unless you need something not available in R, you don't need to buy S Plus. Also, S Plus has some memory management problems that cause problems in large simulations. R does not have this problem.

Matlab is another software package that has become popular for MCMC simulations. It is faster than R or S Plus. A student version is available. Matlab can be instructed to produce C or C++ code, which will run very fast.

Another software package that has been used successfully in MCMC simulations is Gauss, sold by Aptech.

Minitab is used by quite a few people.

SAS is extremely powerful. It is reputed to be the most difficult of the popular packages to learn. There are versions for Windows, and UNIX. The recent Intel Macintoshes can run it under Boot Camp.

Yet another popular package is SPSS. There are versions for Windows and Macintosh.


You may send me an E-mail message right now, if you have any questions or comments about the course. My office is Lord 107. My office hours are MWF 10-10:50.

Here is a link to my Home Page.
This page is under construction. Keep tuned for new material.


All materials at this website Copyright (C) 1994-2007 by William H. Jefferys. All Rights Reserved.