Aras Pranckevičius @aras

Recent searches

Search options

Only available when logged in.

**thomastc | frozenfractal** @thomastc · Mar 18

Hmm, generating good hashes for use in simplex noise is surprisingly expensive. Has anyone ever tried using the CRC32 instructions that come with SSE4.2 for this? They only take 1 clock cycle, much faster than anything I've come up with.
#ProcGen #assembly

**thomastc | frozenfractal** @thomastc · Mar 18 *

Mar 18 *

thomastc | frozenfractal @thomastc

I tried. It's bad. It *is* faster than permutation tables though.

Bottom center panel is my attempt.

A grid of 3x3 images of simplex noise. The one at the bottom center has a repeating pattern.

**thomastc | frozenfractal** @thomastc · Mar 18

Mar 18

thomastc | frozenfractal @thomastc

AES instructions don't fare any better.

A similar image as before, but with a different repeating pattern.

**thomastc | frozenfractal** @thomastc · Mar 18

Mar 18

thomastc | frozenfractal @thomastc

I guess I should bring out a profiler and actually see where the bottlenecks are.

Meanwhile, if someone knows a fast hash that takes 128 bits as input (x: i32, y: i32, seed: u64), works in AVX __mm256i registers and has good entropy in the lower bits, I'm all ears.

thomastc | frozenfractal @thomastc@mastodon.gamedev.place

#[inline] ALL THE THINGS!

From 82 ms to 34 ms, that's an insane 2.4x speedup for such a trivial change. That'll teach me to trust the compiler to optimize properly.

My implementation now beats most of the others, with the exception of `fastnoise2` (C++) and `simdnoise` (pure Rust). But I have something they don't: tiling

#RustLang #GameDev

Mar 18, 2025, 04:00 PM·

1boost·7favorites

**Mark IJbema** @mark@tacobelllabs.net · Mar 18

Mar 18

Mark IJbema @mark@tacobelllabs.net

@thomastc wouldn't inlining always speed everything up, but result in a larger executable?

**thomastc | frozenfractal** @thomastc · Mar 18

Mar 18

thomastc | frozenfractal @thomastc

@mark No, because it can cause instruction cache thrashing.

Honestly I was surprised that these functions weren't all inlined automatically by the compiler, because most of them consist of just one call to a compiler intrinsic.

https://doc.rust-lang.org/stable/reference/attributes/codegen.html#the-inline-attribute

doc.rust-lang.orgCode generation - The Rust Reference

Drag & drop to upload

Recent searches

Search options

Administered by:

Server stats:

Recent searches

Search options

Administered by:

Server stats:

Back