diff options
author | AMD Toolchain Support <73240730+amd-toolchain-support@users.noreply.github.com> | 2023-01-16 16:35:31 +0530 |
---|---|---|
committer | GitHub <noreply@github.com> | 2023-01-16 12:05:31 +0100 |
commit | b9048dbee994c5943c0b59945612bc30e3e73479 (patch) | |
tree | 38b91504ea70d7f15e993cae6178a7e115aa19ac /.github/ISSUE_TEMPLATE | |
parent | 55f71e41d574797ba2d79a30f31aae01c757d13d (diff) | |
download | spack-b9048dbee994c5943c0b59945612bc30e3e73479.tar.gz spack-b9048dbee994c5943c0b59945612bc30e3e73479.tar.bz2 spack-b9048dbee994c5943c0b59945612bc30e3e73479.tar.xz spack-b9048dbee994c5943c0b59945612bc30e3e73479.zip |
AMD Optimized CPU Libraries: add v4.0 (#34681)
What's in AOCL 4.0:
1. amdblis
LPGEMM variants with post-ops support
AMD "Zen4" support for BLIS
2. amdlibflame
Upgrade to LAPACK 3.10.1 specification
Improvements in a few more variants of SVD and Eigen Value routines
Multithread support enabled for selected APIs
3. amdfftw
AVX-512 enablement of DFT kernels
AVX-512 optimization of copy and transpose routines
5. amdlibm
Black & Scholes support (logf, expf, erff, both scalar and vector)
AVX-512 variants of vector functions
6. aocl-sparse
New Iterative Solver APIs
AVX-512 support for SPMV API
7. amdscalapack
Upgrade to Netlib ScaLAPACK 2.2.0
Co-authored-by: Massimiliano Culpo <massimiliano.culpo@gmail.com>
Diffstat (limited to '.github/ISSUE_TEMPLATE')
0 files changed, 0 insertions, 0 deletions