0/318

00:00:00

Description

Editorial

Model Prediction Distribution Drift Detector

MEDIUM20 pts

In production machine learning systems, one of the most critical operational challenges is detecting when a deployed model's behavior begins to deviate from its expected performance. This phenomenon, known as model drift or prediction drift, occurs when the statistical properties of model outputs change over time, often signaling degraded model quality, changing user behavior, or shifts in the underlying data distribution.

Effective MLOps (Machine Learning Operations) practices require continuous monitoring of model predictions to ensure reliability and enable timely model retraining or intervention. This problem focuses on implementing a comprehensive prediction distribution monitoring system.

Problem Statement

Given two sequences of prediction scores—a reference set representing baseline model behavior (e.g., from validation or a stable production period) and a current set representing recent model outputs—compute key statistical metrics that quantify distribution differences and detect potential drift.

Metrics to Compute

Mean Shift: The arithmetic difference between the mean of current predictions and the mean of reference predictions. This captures overall shifts in model confidence or scoring tendencies.

$$\text{Mean Shift} = \mu_{current} - \mu_{reference}$$

Standard Deviation Ratio: The ratio of the current distribution's standard deviation to the reference distribution's standard deviation. This measures changes in prediction variability—a ratio significantly different from 1.0 indicates altered model uncertainty characteristics.

$$\text{Std Ratio} = \frac{\sigma_{current}}{\sigma_{reference}}$$

Jensen-Shannon Divergence (JSD): A symmetric, bounded measure of similarity between two probability distributions. Unlike KL divergence, JSD always produces a finite value between 0 (identical distributions) and 1 (maximally different distributions).

For the Jensen-Shannon divergence calculation:

Create histograms using n_bins equally spaced bins spanning the interval [0, 1]
Apply Laplace smoothing to handle zero-count bins: $$P(bin) = \frac{count + 1}{total + n_bins}$$
Compute JSD as the average of KL divergences from each distribution to their mixture distribution M:

$$JSD(P | Q) = \frac{1}{2} \cdot KL(P | M) + \frac{1}{2} \cdot KL(Q | M)$$

where $$M = \frac{1}{2}(P + Q)$$

Drift Detection Flag: A boolean indicator set to True if the Jensen-Shannon divergence exceeds the significance threshold of 0.1, signaling that the prediction distributions have diverged meaningfully.

Your Task

Implement a function that accepts two lists of prediction probabilities and a bin count, then returns a dictionary containing all four monitoring metrics.

Example

Input

reference_preds = [0.1, 0.2, 0.3, 0.4, 0.5]
current_preds = [0.5, 0.6, 0.7, 0.8, 0.9]
n_bins = 5

Output

{'mean_shift': 0.4, 'std_ratio': 1.0, 'js_divergence': 0.0693, 'drift_detected': False}

Explanation

The reference predictions have mean 0.3 and the current predictions have mean 0.7, yielding a mean_shift of 0.4 (a significant upward shift in prediction scores). Both distributions have identical spread (σ ≈ 0.1414), so std_ratio equals 1.0. The Jensen-Shannon divergence of 0.0693 (with Laplace smoothing applied) falls below the 0.1 threshold, so drift_detected is False. Despite the mean shift, the relatively low JSD indicates the distributions overlap sufficiently when considering the histogram representation with smoothing.

Example

Input

reference_preds = [0.2, 0.3, 0.4, 0.5, 0.6]
current_preds = [0.25, 0.35, 0.45, 0.55, 0.65]
n_bins = 5

Output

{'mean_shift': 0.05, 'std_ratio': 1.0, 'js_divergence': 0.0121, 'drift_detected': False}

Explanation

The current predictions are shifted only slightly (by 0.05) compared to the reference. Both sets maintain the same standard deviation, producing std_ratio = 1.0. The very low JS divergence of 0.0121 confirms that the distributions are nearly identical—no drift concern is flagged.

Example

Input

reference_preds = [0.1, 0.3, 0.5, 0.7, 0.9]
current_preds = [0.1, 0.3, 0.5, 0.7, 0.9]
n_bins = 5

Output

{'mean_shift': 0.0, 'std_ratio': 1.0, 'js_divergence': 0.0, 'drift_detected': False}

Explanation

When the reference and current prediction sets are identical, all drift metrics reflect perfect alignment: zero mean shift, unit standard deviation ratio, and zero JS divergence. This represents the ideal baseline scenario where model behavior is completely stable.

Accepted0/0·0% Acceptance

Constraints

1 ≤ len(reference_preds) ≤ 10,000
1 ≤ len(current_preds) ≤ 10,000
0 ≤ reference_preds[i] ≤ 1 for all i
0 ≤ current_preds[i] ≤ 1 for all i
2 ≤ n_bins ≤ 100
All prediction values are valid floating-point probabilities
The reference and current prediction lists may have different lengths

Code

Visualizer

Solutions

14px

Test Cases3

Results

Submissions

n_bins =

current_preds =

[0.1,0.3,0.5,0.7,0.9]

reference_preds =

[0.1,0.3,0.5,0.7,0.9]

Problem Statement

Metrics to Compute

Mean Shift: The arithmetic difference between the mean of current predictions and the mean of reference predictions. This captures overall shifts in model confidence or scoring tendencies.

$$\text{Mean Shift} = \mu_{current} - \mu_{reference}$$

Standard Deviation Ratio: The ratio of the current distribution's standard deviation to the reference distribution's standard deviation. This measures changes in prediction variability—a ratio significantly different from 1.0 indicates altered model uncertainty characteristics.

$$\text{Std Ratio} = \frac{\sigma_{current}}{\sigma_{reference}}$$

Jensen-Shannon Divergence (JSD): A symmetric, bounded measure of similarity between two probability distributions. Unlike KL divergence, JSD always produces a finite value between 0 (identical distributions) and 1 (maximally different distributions).

For the Jensen-Shannon divergence calculation:

Create histograms using n_bins equally spaced bins spanning the interval [0, 1]

Apply Laplace smoothing to handle zero-count bins: $$P(bin) = \frac{count + 1}{total + n_bins}$$

Compute JSD as the average of KL divergences from each distribution to their mixture distribution M:

$$JSD(P | Q) = \frac{1}{2} \cdot KL(P | M) + \frac{1}{2} \cdot KL(Q | M)$$

where $$M = \frac{1}{2}(P + Q)$$

Drift Detection Flag: A boolean indicator set to True if the Jensen-Shannon divergence exceeds the significance threshold of 0.1, signaling that the prediction distributions have diverged meaningfully.

Your Task

Implement a function that accepts two lists of prediction probabilities and a bin count, then returns a dictionary containing all four monitoring metrics.

Model Prediction Distribution Drift Detector

Problem Statement

Metrics to Compute

Your Task

Hints

Model Prediction Distribution Drift Detector

Problem Statement

Metrics to Compute

Your Task

Hints