Proposed Feature Extension for CausalPy: Automatic Detection of Intervention Timing #478

JeanVanDyk · 2025-05-26T13:47:42Z

JeanVanDyk
May 26, 2025
Collaborator

This feature aims to extend the existing functionality "Interrupted Time Series" in CausalPy by allowing users to infer the timing of an intervention, rather than requiring it to be specified in advance.

What the feature would provide

Instead of specifying when an intervention occurred, the user describes the type of effect they expect the intervention to have : level shift, trend change, or both. The model then uses this structure to infer the most likely time at which such a change occurred.

The user may:

Provide a range of plausible intervention times, or
Let the model assume no prior knowledge, allowing it to search across the full time span.

The model includes default pre- and post-intervention structures, but users can optionally define their own. Also, users may specify the expected form of the intervention effect, guiding which model to use and getting more precision. At the end, the users will have the possibility to get :

The posterior distribution over possible intervention times,
And the posterior distributions of the pre- and post-intervention parameters.

Bayesian default model

One widely used and easy-to-implement Bayesian model for detecting intervention effects is presented by Xueheng Shi et al, 2022. This model, which has been tested across various scenarios, is versatile and capable of detecting both level shifts and trend changes.

The structure supports a discrete change point and allows the slope and intercept to vary before and after the intervention. However, model precision can potentially be improved by simplifying the structure when prior knowledge is available — for instance, using a pure level-shift model (by removing time dependence) or a pure trend-change model (by removing the intercept discontinuity).
The full model is expressed as:

$$ \begin{align} & \tau \sim DiscreteUniform(0, T) \\ & \sigma \sim HalfNormal(1) \\ \\ & \mu_1 \sim Normal(0, 10) \\ & \alpha_1 \sim Normal(0,10) \\ \\ & \mu_2 \sim Normal(0, 10) \\ & \alpha_2 \sim Normal(0,10) \\ \\ & \mu = \begin{cases} \alpha_1 + \mu_1 * t & \text{si } \tau < t \\ \alpha_2 + \mu_2 * t & \text{sinon} \end{cases} \\ \\ & y \sim Normal(\mu ,\sigma) \end{align} $$

Proposal: Integrating Automatic Intervention Time Detection into InterruptedTimeSeries

What I could see would be to make a new class in pymc_models.py that Interrupted Series would rely on to estimate the Intervention time. Interrupted Series would automatically rely on it when given a range of time or no time for the Intervetion time. Also, we could add a new optional parameter that would get the (optional) parameters to send to initialize the class (effect, pre and post model). After the intervention time is estimated, Interrupted time series would continue as usual with the most likely intervention time as intervention time.

To support automatic inference of the intervention time, I propose creating a new class in pymc_models.py that encapsulates the logic for estimating when the intervention most likely occurred. The InterruptedTimeSeries class would automatically delegate to this new class when:

A range of possible intervention times is provided, or
The intervention time is not specified at all.

We could also introduce a new optional argument in InterruptedTimeSeries to allow users to pass in:

The expected intervention effect (e.g., level, trend, or both),
And/or custom pre- and post-intervention model structures.

The workflow would then become :

If no fixed intervention time is provided, InterruptedTimeSeries initializes and runs the intervention time estimator.
Once the most likely intervention time is inferred (based on the specified or default effect model), the regular interrupted time series workflow resumes, treating the estimated time as the intervention point.

What do you think ?

Here I've gathered all my thoughts so far and outlined how I think this feature could work best. Since I'm still new to CausalPy, I’m looking forward to your feedback! Please let me know if there’s anything I should be careful about or any suggestions before I start implementing. I’m excited to get started and appreciate any guidance you can provide.

Example

Consider a time series where an intervention causes the trend to increase from 0.1 to 0.25, along with a sudden level jump of 2 :

I’ve implemented the model, which follows the default structure described earlier :

After sampling, here are the typical plot outputs users can expect to see from this new feature:

drbenvincent · 2025-05-26T14:32:17Z

drbenvincent
May 26, 2025
Maintainer

Discrete vs continuous

In terms of parameter estimation we want to estimate continuous quantities wherever possible, to take advantage of the more efficient sampling algorithms. One way we can do this is by avoiding use of DiscreteUniform for the switch point.

If the treatment effect is modelled as a step change, this can become problematic, so a trick is to use a sigmoid function where there is a bit of smoothness to the step change, which helps the sampler get gradient information.

1 reply

JeanVanDyk May 26, 2025
Collaborator Author

Thanks for the feedback — that makes a lot of sense. I've replaced the DiscreteUniform switchpoint with a continuous parameter and implemented a smooth transition using a sigmoid function, as suggested. This change does indeed improve sampling efficiency.

drbenvincent · 2025-05-26T14:38:20Z

drbenvincent
May 26, 2025
Maintainer

Types of treatment effect / change being detected

The idea of doing more than just detecting a step change (as in a simple change point model) is very good. So a step change, or a slope change is good, but perhaps not sufficient to be useful in lots of situations.

I can imagine many scenarios where you might expect a change to be transient. Either because the treatment period has a start and stop date, or because the treatment creates an initial effect which then dies down despite the intervention still being in place.

How to deal with that is another matter, I'll try to drop in some ideas in response to your proposed algorithmic approach.

0 replies

drbenvincent · 2025-05-26T14:45:24Z

drbenvincent
May 26, 2025
Maintainer

Implementation / algorithm

I don't have a fixed idea about how this could work. But the default way how I'd think about it is that you're trying to come up with a linear decomposition of the data. One part of that is a treatment effect component - this could be a parameterised function:

the simplest would be a step function, or sigmoid to help parameter estimation. The point of the step change would be described parametrically and could have a prior
another would be a linear increase from zero, which would capture your "change in slope" effect. That would have parameters for the point it happens at, and the slope.
another would be an exponential decay, or a step up and step down function (for effects that are transient)
I guess it's not impossible to imagine some unspecified temporal treatment effect which could perhaps be captured by a gaussian process. That is most general I guess, but probably poses some challenges with parameter identification, especially if the treatment effect is subtle.

0 replies

JeanVanDyk · 2025-05-27T12:14:57Z

JeanVanDyk
May 27, 2025
Collaborator Author

Prototype

Taking into account your feedback, I've put together a prototype of what this model could look like.

class InterventionEstimator():

    __init__(self, value, time, range=None, model=None, effect=None):
        self.value = value
        self.time = time
        self.effect = effect

        self.model = model
        if self.model == None:
            self.build_default_model()
        
        self.build_intervention(range, effect)

To maintain flexibility, users can optionally provide their own PyMC model to model the base time series (i.e. without interventions). The only requirement is that the model defines a base_mu variable of the same dimension as the time series, accessible via model.named_vars["base_mu"].

If no custom model is provided, the class builds a simple linear model by default:

def build_default_model(self):

        with pm.Model() as self.model:

            # Linear component
            rate = pm.Normal("rate", mu=0, sigma=10)
            intercept = pm.Normal("intercept", mu=0, sigma=10)

            # Rate
            base_mu = pm.Deterministic("base_mu", intercept + rate*self.time)

For now, the intervention logic is less flexible. The user specifies which types of intervention effects to include using the effect parameter, which can contain any combination of:

"level" — a discrete shift in mean after the change point

"trend" — a change in slope after the change point

"impulse" — a decaying impulse following the intervention

Here’s the current implementation:

def build_intervention(self, range=None, effect=[]):

        with self.model:

            # We can look for the intervention time in the given range or in the full timelength otherwise.
            if range == None :
                range = self.time
            switchpoint = pm.Uniform("switchpoint", lower=range.min(), upper=range.max())

            # For a smooth transition
            weight = pm.math.sigmoid(self.time - switchpoint)
            
            # Intervention level
            level = 0
            if "level" in effect:
                level =  pm.Normal("level", mu=0, sigma=10)
            # Intervention trend
            trend = 0
            if "trend" in effect:
                trend = pm.Normal("trend", mu=0, sigma=10)
            # Intervention impulse
            impulse=0 
            if "impulse" in effect:
                impulse_amplitude = pm.Normal("impulse_amplitude", mu=0, sigma=5)
                decay_rate = pm.HalfNormal("decay_rate", sigma=1)
                impulse = impulse_amplitude * pm.math.exp(-decay_rate * (self.time-switchpoint))    

            intervention = impulse + level + trend * (self.time - switchpoint)
            mu = pm.Deterministic("mu", self.model.named_vars["base_mu"] + weight*intervention)
            value = pm.Normal("value", mu=mu, sigma=2, observed=self.value)

I've added the "impulse" parameter to capture short-term effects with exponential decay as you thought. It seems to work quite well, as shown below:

I've also added two built-in methods to visualize the model's behavior. The example below uses the COVID-19 dataset used in the "How-to" for the InterruptedTimeSeries on CausalPy website with only the "impulse" effect applied. The plots show that the model struggles to converge on a clear switchpoint, likely due to unmodeled seasonality in the data. This suggests that extending the model to account for seasonal patterns may be necessary for better performance.

Let me know your thoughts on this !

Next possible steps :

Investigate how the model behaves on edge cases.
Evaluate the model's performance.
Explore strategies for handling seasonality, as the current model struggles in such cases (e.g. next plot)
Consider allowing users to define their own intervention expressions, similar to how base_mu is handled.

0 replies

JeanVanDyk · 2025-05-27T14:18:50Z

JeanVanDyk
May 27, 2025
Collaborator Author

Update on two matters

Seasonality

I implemented a seasonal component to better capture recurring patterns in the data, and the improvement in model performance is promising so far. Since I’m working with monthly data, I started with 12 season-specific parameters (one for each month):

with pm.Model() as model:
   # Linear component
   rate = pm.Normal("rate", mu=0, sigma=10)
   intercept = pm.Normal("intercept", mu=0, sigma=10)
   season_effect = pm.Normal("season", mu=0, sigma=10, shape=12)
   season = season_effect[months]

   # Rate
   base_mu = pm.Deterministic("base_mu", intercept + rate*t + season)

Next step: I plan to generalize this so the model can automatically adapt to different seasonalities — for example, weekly, quarterly, or any other cycle length — based on the number of seasons and observations per season.

Impulse

I noticed that the impulse component I previously used was leaking into the time series before the switchpoint. To mitigate this, I modified the formulation to make the impulse symmetric and centered at the switchpoint by applying an absolute value:

impulse = impulse_amplitude * pm.math.exp(-decay_rate * abs(self.time-switchpoint))

This way, although the exponential decay (with the absolute value) creates a symmetric shape around the switchpoint, the sigmoid effectively suppresses any contribution before the switchpoint, ensuring the impulse activates only after the intervention. This makes the impulse more aligned with a realistic, one-sided causal effect.

Results

Below are the updated results using this improved formulation, applied to the same dataset I mentioned in my previous post. The model now captures both seasonality and temporary shifts more reliably.

1 reply

JeanVanDyk May 27, 2025
Collaborator Author

Just to clarify: the black line represents the values of "base_mu" (the baseline trend without intervention), while the red line shows "mu" — which includes the effect of the intervention (base_mu + intervention).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Proposed Feature Extension for CausalPy: Automatic Detection of Intervention Timing #478

Uh oh!

{{title}}

Uh oh!

Replies: 5 comments 2 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Proposed Feature Extension for CausalPy: Automatic Detection of Intervention Timing #478

Uh oh!

JeanVanDyk May 26, 2025 Collaborator

What the feature would provide

Bayesian default model

Proposal: Integrating Automatic Intervention Time Detection into InterruptedTimeSeries

What do you think ?

Example

Replies: 5 comments · 2 replies

Uh oh!

drbenvincent May 26, 2025 Maintainer

Discrete vs continuous

Uh oh!

JeanVanDyk May 26, 2025 Collaborator Author

Uh oh!

drbenvincent May 26, 2025 Maintainer

Types of treatment effect / change being detected

Uh oh!

drbenvincent May 26, 2025 Maintainer

Implementation / algorithm

Uh oh!

Uh oh!

JeanVanDyk May 27, 2025 Collaborator Author

Prototype

Let me know your thoughts on this !

Next possible steps :

Uh oh!

JeanVanDyk May 27, 2025 Collaborator Author

Update on two matters

Seasonality

Impulse

Results

Uh oh!

JeanVanDyk May 27, 2025 Collaborator Author

JeanVanDyk
May 26, 2025
Collaborator

Replies: 5 comments 2 replies

drbenvincent
May 26, 2025
Maintainer

JeanVanDyk May 26, 2025
Collaborator Author

drbenvincent
May 26, 2025
Maintainer

drbenvincent
May 26, 2025
Maintainer

JeanVanDyk
May 27, 2025
Collaborator Author

JeanVanDyk
May 27, 2025
Collaborator Author

JeanVanDyk May 27, 2025
Collaborator Author