Skip to content

v0.9.1 (patch release)

Latest

Choose a tag to compare

@mariogeiger mariogeiger released this 16 Mar 17:25
· 17 commits to main since this release

Bug fix

  • [Torch/JAX] Fixed a rare overflow in the uniform 1d kernel when indices are large, by casting index arithmetic from 32-bit to 64-bit
  • [Torch/JAX] Disabled parallel compilation of uniform 1d kernels by default (CUEQUIVARIANCE_OPS_PARALLEL_COMPILE now defaults to 0). Parallel compilation caused issues in multi-GPU setups. It will be re-enabled by default in a future release once the underlying bugs are resolved.