Tonic is a collection of pure R tools for generating and manipulating MCMC output. It is also a nice mixer.
The current version includes the top-level functions:
- gw_sampler - sample a target density using the ensemble sampler of Goodman & Weare (2010)
- mh_sampler - sample a target density using the random walk M-H method
- contour_matrix - plot an NxN matrix summarising pairwise relationships between variables
- chain_convergence - make diagnostic plots of chain outputs
Tonic is an R package, but is still in development. To set up from GitHub first install (if you haven't already) Hadley Wickham's devtools package.
install.packages("devtools")
Now you can install tonic straight from GitHub:
devtools::install_github("svdataman/tonic")
It will also install the mvtnorm package which it depends on. Now you're good to go.
Let's define a posterior to sample. This should be an R function that
- takes a vector of parameters as its first argument,
- returns a (scalar) log density (up to some normalisation constant,
- return -Inf where the posterior is zero (e.g. because prior is zero). Here we'll use a multivariate normal as a simple example.
my_posterior <- function(theta) {
cov <- matrix(c(1,0.98,0.8, 0.98,1.0,0.97, 0.8,0.97,2.0), nrow = 3)
logP <- mvtnorm::dmvnorm(theta, mean = c(-1, 2, 0), sigma = cov, log = TRUE)
return(logP)
}
Now we can generate a sample from this posterior using the Goodman-Weare algorithm as follows.
chain <- tonic::gw_sampler(my_posterior, theta.0 = c(0,0,0), nsteps = 1e4)
The output is list (chain) containing, among other things, an array (theta) with 10,000 samples.
- You must specify the name of the posterior function and the initial values for parameters (theta.0).
- You may change the number of walkers (default: nwalkers = 100).
- There is a `burn in' period (default: burn.in = 2000). This means the first few iterations are thrown away (not returned).
- For a list of other parameters (optional), see the built-in help.
Or use the random walk Metropolis-Hastings algorithm:
chain <- tonic::mh_sampler(my_posterior, theta.0 = c(0,0,0), nsteps = 1e4)
In this case the acceptance rate (see below) is rather low.
- You may manually specify a more suitable covariance matrix (set the cov parameter equal to a valid covariance matrix).
- You may 'adapt' the proposal distribution based on the samples from the 'burn in' period (adapt = TRUE).
- By default we produce the nsteps samples by merging the output from nchains independent chains (default: nchains = 5).
- There is a `burn in' period (default: burn.in = 1000). This means the first few iterations are thrown away (not returned).
- For a list of other parameters (optional), see the built-in help.
We can visualise the multivariate samples using the contour_matrix() function.
tonic::contour_matrix(chain$theta, prob.levels = c(0.9, 0.99), smooth1d = TRUE)
The [i, j] panel shows the theta[, i] variable plotted against the theta[, j] variable. We plot density contours enclosing 90% and 99% of the mass, and points for samples beyond the outermost contour. Along the diagonal ([i, i] panels) we show the (smoothed) density (optionally: histogram) of variable theta[, i]. For more on how to customise the plot, see the built-in help.s
It is a good idea to check the output before taking too seriously any inferences based on the posterior sample. First, check the average acceptance rate:
print(chain$accept.rate)
One usually aims for a value in the range 0.1-0.8.
Now, check the behaviour of the chains. The chain_diagnosis() function plots, for each parameter of the posterior, the traces of each walker (gw) or chain (mh), the ACFs and the distributions.
tonic::chain_diagnosis(chain)
Here is an example using output from gw_sampler().
Here we see the chains seem to be long enough (ACF decays to zero, trace plots show no outlying walkers or trend in the mean, ...). For more details of what is shown, see the built-in help.
The output can also be assessed using the chain_convergence() function
tonic::chain_convergence(chain)
This plots, for each variable, the Gelman-Rubin R.hat statistic along with a comparison of the 80% intervals from each chain. (The MH method runs with nchains independent chains, the GW method runs with nwalkers walkers.) If the chains are `well mixed' R.hat should be close to 1.0 (ideally <1.1) and the intervals for each chain should share a lot of overlap.
Note that this is more useful for the output of the MH method. (The GW method works best with a large ensemble of walkers - nwalkers >= 50 - but requires fewer iterations of the full ensemble, so the inter-walker comparisons are less useful.)
The coda package provides more diagnostic functions that may be used. (If you don't have coda you will need to use install.packages("coda").) E.g.
require(coda)
summary(mcmc(chain$theta))
effectiveSize(mcmc(chain$theta))
The latter estimates the effective sample size (for each variable), i.e the sample size adjusted for the autocorrelation timescale. (The mcmc() function converts the theta array into an mcmc object ready for coda functions.)
Getting well-behaved MCMC output can be tricky. If the chains are not well-mixed, you might want to:
- check the posterior (and prior) are well-defined.
- use a longer burn-in period (burn.in).
- use a different start position (theta.0).
- provide a better choice of covariance matrix (parameter: cov) for the MH method, and/or adjust this after burn-in (adjust = TRUE).
- use a t-distribution for the proposal (MH: method = "t").
- use more walkers or chains (GW method: nwalkers; MH method: nchains).
- adjust the details of the update move in the GW method (parameters: walk.rate, atune, stune).
- let the chains run for longer (nsteps).
For more help on each function, try the in-built help, e.g.
? tonic::gw_sampler
? tonic::mh_sampler
? tonic::contour_matrix
? tonic::chain_convergence
- periodic saving of output
- configure input/output so that can pick-up chain(s) where left off.
- annealing? (with evidence calculation)
- evidence from nested sampling?
- add Laplace evidence function, MAP finder
If you find tonic useful in your work, please cite the following paper for which tonic was developed:
S. Vaughan et al., 2016, MNRAS, v461, pp3145-3152
The contour_matrix() and mh_sampler() functions are a development of the code used to produce Fig. 2 of S. Vaughan (2010, MNRAS, v402, pp307-320); the former is based on the pairs() plots of base R.