gpu new features

V100 新特性

  • Volta SIMT Modelimage __syncwarp() to force reconvergence
  • Cooperative Groups

A100 新特性

  • Asynchronous copy

    image
  • Asynchronous barrier

  • Task graph acceleration

  • 2:4 structured sparsity
    image