0/318

00:00:00

Description

Editorial

Data Normalization Techniques: Z-Score and Min-Max Scaling

EASY10 pts

Feature scaling is a critical preprocessing step in machine learning that transforms numerical features to a common scale without distorting differences in the ranges of values. Raw features often have vastly different magnitudes—for instance, age might range from 0-100 while income ranges from 0-1,000,000. This discrepancy can cause significant problems for many machine learning algorithms.

This problem focuses on implementing two of the most widely-used normalization techniques:

1. Standardization (Z-Score Normalization)

Standardization transforms each feature to have a mean of 0 and a standard deviation of 1. For each value in a feature column, the transformation is computed as:

$$z = \frac{x - \mu}{\sigma}$$

Where:

x is the original value
μ (mu) is the mean of the feature column
σ (sigma) is the standard deviation of the feature column

This technique is particularly useful when:

Your data follows a Gaussian distribution (or close to it)
You're using algorithms that assume normally distributed data
You need to handle outliers while preserving their information

2. Min-Max Normalization

Min-max normalization rescales each feature to a fixed range of [0, 1], where the minimum value maps to 0 and the maximum value maps to 1. The formula is:

$$x_{scaled} = \frac{x - x_{min}}{x_{max} - x_{min}}$$

Where:

x is the original value
x_min is the minimum value in the feature column
x_max is the maximum value in the feature column

This technique is preferred when:

You need bounded values within a specific range
Your algorithm is sensitive to the magnitude of values (e.g., neural networks with sigmoid activation)
The feature distribution is not necessarily Gaussian

Your Task

Write a Python function that takes a 2D NumPy array representing a dataset (where each row is a data sample and each column represents a feature). Apply both scaling techniques column-wise and return the results as two separate 2D arrays. All values should be rounded to 4 decimal places.

Important: Apply scaling independently to each feature column, computing statistics (mean, std, min, max) for each column separately.

Example

Input

data = [[1.0, 2.0], [3.0, 4.0], [5.0, 6.0]]

Output

[[[-1.2247, -1.2247], [0.0, 0.0], [1.2247, 1.2247]], [[0.0, 0.0], [0.5, 0.5], [1.0, 1.0]]]

Explanation

For Column 1 (values: 1, 3, 5): • Mean = 3, Std = 1.633 • Standardization: (1-3)/1.633 = -1.2247, (3-3)/1.633 = 0, (5-3)/1.633 = 1.2247 • Min = 1, Max = 5 • Min-Max: (1-1)/(5-1) = 0, (3-1)/(5-1) = 0.5, (5-1)/(5-1) = 1

For Column 2 (values: 2, 4, 6): • Mean = 4, Std = 1.633 • Standardization: (2-4)/1.633 = -1.2247, (4-4)/1.633 = 0, (6-4)/1.633 = 1.2247 • Min = 2, Max = 6 • Min-Max: (2-2)/(6-2) = 0, (4-2)/(6-2) = 0.5, (6-2)/(6-2) = 1

The first output array shows standardized values, the second shows min-max scaled values.

Example

Input

data = [[1.0, 2.0, 3.0], [4.0, 5.0, 6.0], [7.0, 8.0, 9.0]]

Output

[[[-1.2247, -1.2247, -1.2247], [0.0, 0.0, 0.0], [1.2247, 1.2247, 1.2247]], [[0.0, 0.0, 0.0], [0.5, 0.5, 0.5], [1.0, 1.0, 1.0]]]

Explanation

Each column follows a uniform arithmetic progression with equal spacing. The standardization produces z-scores of approximately -1.2247, 0, and 1.2247 for each column. The min-max normalization maps the three evenly-spaced values to 0, 0.5, and 1 respectively.

Example

Input

data = [[-1.0, 0.0, 1.0], [2.0, 3.0, 4.0]]

Output

[[[-1.0, -1.0, -1.0], [1.0, 1.0, 1.0]], [[0.0, 0.0, 0.0], [1.0, 1.0, 1.0]]]

Explanation

With only two samples per column, the standardization produces z-scores of exactly -1 and 1 (one standard deviation below and above the mean). The min-max normalization maps the smaller value to 0 and the larger value to 1 for each feature column.

Accepted0/0·0% Acceptance

Constraints

2 ≤ number of rows (samples) ≤ 1000
1 ≤ number of columns (features) ≤ 100
-10⁶ ≤ data[i][j] ≤ 10⁶
Each column will have at least 2 distinct values (no constant features)
All values in the input array are floating-point numbers
Results must be rounded to exactly 4 decimal places

Code

Visualizer

Solutions

14px

Test Cases3

Results

Submissions

data =

[[1,2,3],[4,5,6],[7,8,9]]

Data Normalization Techniques: Z-Score and Min-Max Scaling

1. Standardization (Z-Score Normalization)

2. Min-Max Normalization

Your Task

Hints

Data Normalization Techniques: Z-Score and Min-Max Scaling

1. Standardization (Z-Score Normalization)

2. Min-Max Normalization

Your Task

Hints