0/318

00:00:00

Description

Editorial

2D Convolution Operation

MEDIUM20 pts

The 2D convolution operation is one of the most fundamental building blocks in computer vision and deep learning. It enables neural networks to detect spatial patterns, edges, textures, and complex features within images and other grid-structured data.

In a 2D convolution, a small filter (called a kernel) slides across the input matrix, computing element-wise products and summing them at each position. This operation transforms the input into a new representation called a feature map, which highlights specific patterns based on the kernel's values.

The Convolution Process:

Given an input matrix I of dimensions H × W, a kernel K of dimensions kH × kW, padding p, and stride s, the output feature map O is computed as follows:

Padding: First, surround the input matrix with p layers of zeros on all sides, creating a padded input of dimensions (H + 2p) × (W + 2p)
Sliding Window: Starting from the top-left corner, place the kernel over the padded input. At each position (i, j), compute:

$$O_{i,j} = \sum_{m=0}^{kH-1} \sum_{n=0}^{kW-1} K_{m,n} \cdot I_{padded}[i \cdot s + m, j \cdot s + n]$$

Stride: After computing one output value, move the kernel by s positions horizontally. When reaching the end of a row, move down by s positions and start from the left again.

Output Dimensions:

The output feature map has dimensions:

Output Height = ⌊(H + 2p - kH) / s⌋ + 1
Output Width = ⌊(W + 2p - kW) / s⌋ + 1

Your Task: Implement a Python function that performs a 2D convolution operation. The function should properly apply zero-padding, slide the kernel with the specified stride, and compute the element-wise products and sums to produce the output feature map.

Example

Input

input_matrix = [[1.0, 2.0, 3.0, 4.0], [5.0, 6.0, 7.0, 8.0], [9.0, 10.0, 11.0, 12.0], [13.0, 14.0, 15.0, 16.0]]
kernel = [[1.0, 0.0], [-1.0, 1.0]]
padding = 1
stride = 2

Output

[[1.0, 1.0, -4.0], [9.0, 7.0, -4.0], [0.0, 14.0, 16.0]]

Explanation

After adding 1 layer of zero-padding, the input becomes 6×6. The 2×2 kernel slides with stride 2, producing a 3×3 output.

• Position (0,0): Covers padded region and top-left element → (1×0) + (0×0) + (-1×0) + (1×1) = 1.0 • Position (0,1): (1×0) + (0×1) + (-1×0) + (1×2) = 2.0... continuing yields -4.0 at edge • The kernel detects diagonal patterns, with the -1 capturing differences between adjacent rows.

The resulting 3×3 feature map captures spatial relationships at half the resolution due to stride 2.

Example

Input

input_matrix = [[1.0, 2.0, 3.0], [4.0, 5.0, 6.0], [7.0, 8.0, 9.0]]
kernel = [[1.0, 0.0], [0.0, 1.0]]
padding = 0
stride = 1

Output

[[6.0, 8.0], [12.0, 14.0]]

Explanation

With no padding and stride 1, the 2×2 kernel slides across the 3×3 input, producing a 2×2 output.

• Position (0,0): (1×1) + (0×2) + (0×4) + (1×5) = 1 + 5 = 6.0 • Position (0,1): (1×2) + (0×3) + (0×5) + (1×6) = 2 + 6 = 8.0 • Position (1,0): (1×4) + (0×5) + (0×7) + (1×8) = 4 + 8 = 12.0 • Position (1,1): (1×5) + (0×6) + (0×8) + (1×9) = 5 + 9 = 14.0

This diagonal kernel extracts the sum of diagonal elements, useful for detecting diagonal features.

Example

Input

input_matrix = [[1.0, 2.0, 3.0, 4.0], [5.0, 6.0, 7.0, 8.0], [9.0, 10.0, 11.0, 12.0], [13.0, 14.0, 15.0, 16.0]]
kernel = [[0.0, 1.0, 0.0], [1.0, -4.0, 1.0], [0.0, 1.0, 0.0]]
padding = 1
stride = 1

Output

[[3.0, 2.0, 1.0, -5.0], [-4.0, 0.0, 0.0, -9.0], [-8.0, 0.0, 0.0, -13.0], [-29.0, -18.0, -19.0, -37.0]]

Explanation

This kernel is the Laplacian filter, a classic edge detection operator. It computes the second derivative of the image, highlighting areas of rapid intensity change.

• For interior points (like position (1,1)), the kernel averages neighbors minus 4× the center: (2+6+6+10) - 4×6 = 0 • Edge and corner positions show non-zero values due to the zero-padding boundary effects • The large negative values in the last row indicate strong edges at the bottom of the image

With padding=1 and stride=1, the output maintains the same 4×4 dimensions as the input, typical for 'same' padding convolutions.

Accepted0/0·0% Acceptance

Constraints

1 ≤ input height, input width ≤ 100
1 ≤ kernel height, kernel width ≤ min(input dimension, 15)
0 ≤ padding ≤ 10
1 ≤ stride ≤ 5
-10⁶ ≤ input_matrix[i][j] ≤ 10⁶
-10⁶ ≤ kernel[i][j] ≤ 10⁶
The input matrix and kernel are guaranteed to have rectangular shapes
The combination of padding, stride, and kernel size will always produce valid output dimensions

Code

Visualizer

Solutions

14px

Test Cases3

Results

Submissions

kernel =

[[1,0],[-1,1]]

stride =

padding =

input_matrix =

[[1,2,3,4],[5,6,7,8],[9,10,11,12],[13,14,15,16]]

2D Convolution Operation

Hints

2D Convolution Operation

Hints