Memory and ILP handling in 2D convolutions

Memory and ILP handling in 2D convolutions involves understanding convolution as an integral expression of signal overlap. In image processing, the continuous convolution integral is converted into a function in a two-dimensional spatial domain. Due to limited resources, images are stored digitally as tensors with discrete values. Filters are used to extract features, with trainable parameters represented by arrays of floats. The code implementation uses SIMD vector instructions instead of x87 precision ones for efficiency. Multiple stack variables are utilized for storing addresses in memory. The program optimizes by filling registers with broadcasted filter values, minimizing load instructions from memory. The process results in parallel computation of convolutions.

https://riemani.ca/conv

To top