NHacker Next
  • new
  • past
  • show
  • ask
  • show
  • jobs
  • submit
Floor and Ceil versus Denormals on CPU and GPU (asawicki.info)
crote 57 minutes ago [-]
Another thing to keep in mind is that CPU processing of denormals tends to be extremely slow - I vaguely recall running into something like a 10x slowdown a decade ago.

For a lot of applications the difference between a denormal and zero is small enough to be irrelevant, so if you expect near-zero values to be common, enabling a denormals-to-zero compiler flag might give you a pretty nice performance boost for free.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact
Rendered at 13:03:42 GMT+0000 (Coordinated Universal Time) with Vercel.