Posts Tagged ‘SIMD Performance’

SSE Intrinsics Tutorial

UPDATE: For those interested, I’ve created a full-on assembly/SSE version here.

SSE SIMD Programming is a fascinating subject, but also one that can be a bit difficult to approach. In this post I’m going to create a SIMD version on my RGB->CMYK algorithm, and in the process, show a bunch of handy tricks for working with SIMD.

This post deals with some of the problems and challenges we face when implementing SIMD code, paying close attention to intrinsics, basic SIMD code setup, and buffer type conversion.

Read More