Skip to content
#

blas

Here are 70 public repositories matching this topic...

SimSIMD

Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & SVE2 📐

  • Updated Nov 4, 2024
  • C

The repository targets the OpenCL gemm function performance optimization. It compares several libraries clBLAS, clBLAST, MIOpenGemm, Intel MKL(CPU) and cuBLAS(CUDA) on different matrix sizes/vendor's hardwares/OS. Out-of-the-box easy as MSVC, MinGW, Linux(CentOS) x86_64 binary provided. Ở bất đồng Ma trận lớn nhỏ / phần cứng / thao tác hệ thống hạ tương đối mấy cái BLAS kho sgemm hàm số tính năng, cung cấp binary, khai hộp tức dùng.

  • Updated Mar 28, 2019
  • C

Improve this page

Add a description, image, and links to the blas topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the blas topic, visit your repo's landing page and select "manage topics."

Learn more