Computing Adler32 Checksums at 41 GB/s (
from to on 30 Apr 04:32

While looking through the fpng source code, I noticed that its vectorized adler32 implementation seemed somewhat complicated, especially given how simple the scalar version of adler32 is. I was curious to see if I could come up with a simpler method, and in doing so, I came up with an algorithm that can be up to 7x faster than fpng’s version, and 109x faster than the simple scalar version.

#c #perf #programming

#c #inks #perf #programming

threaded - newest