A Bayesian framework for change-point detection with uncertainty quantification
Abstract
We introduce a novel Bayesian method that can detect multiple structural breaks in the mean and variance of a length $T$ time-series. Our method quantifies the uncertainty by returning $\alpha$-level credible sets around the estimated locations of the breaks. In the case of a single change in the mean and/or the variance of an independent sub-Gaussian sequence, we prove that our method attains a localization rate that is minimax optimal up to a $\log T$ factor. For an $\alpha$-mixing sequence with dependence, we prove this optimality holds up to $\log^2 T$ factor. For $d$-dimensional mean changes, we show that if $d \gtrsim \log T$ and the mean signal is dense, then our method exactly recovers the location of the change at the optimal rate. We show that we can modularly combine single change-point models to detect multiple change-points. This approach enables efficient inference using a variational approximation of the posterior distribution for the change-points. The proposal is applicable to both continuous and count data. Extensive simulation studies demonstrate that our method is competitive with the state-of-the-art and returns credible sets that are an order of magnitude smaller than those returned by competitors without sacrificing nominal coverage guarantees. We test our method on real data by detecting i) gating of the ion channels in the outer membrane of a bacterial cell, and ii) changes in the lithological structure of an oil well.