Bayesian Linear Regression

Mark Andrews

The Bayesian linear model

\[y_i \sim \mathrm{N}(\mu_i, \sigma^2)\] \[\mu_i = \beta_0 + \beta_1 x_i\]

Parameters \(\beta_0, \beta_1, \sigma^2\) receive prior distributions. MCMC samples from the posterior over all parameters jointly.

M_5 <- brm(weight ~ height + gender,
  save_pars = save_pars(all = TRUE),
  data = weight_df
)
M_5

The summary output reports posterior mean, SD, and credible intervals for each parameter.

plot(M_5)

[Diagram: four coloured wavy lines running horizontally — the “caterpillar” trace plot]

A well-mixed chain shows no trends, no stuck periods. All four chains should overlap in the same region.

mcmc_plot(M_5, type = "hist")
mcmc_plot(M_5, type = "dens")
mcmc_plot(M_5, type = "areas")

Each parameter’s marginal posterior is a distribution, not a point estimate. The width of the distribution reflects remaining uncertainty.

draws <- as_draws_df(M_5)

mean(draws$b_height)
quantile(draws$b_height, probs = c(0.025, 0.975))
mean(draws$b_height > 0)

Any posterior quantity can be computed from the samples.

[Diagram: 2D contour plot of joint posterior over two parameters. Concentric oval contours showing correlation between parameters.]

The joint posterior captures correlations between parameters. Marginal summaries do not.

fixef(M_5)
prior_summary(M_5)
stancode(M_5)