blas
Here are 70 public repositories matching this topic...
BLAS-like Library Instantiation Software Framework
-
Updated
Nov 4, 2024 - C
Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & SVE2 📐
-
Updated
Nov 4, 2024 - C
🤖 A portable, header-only, artificial neural network library written in C99
-
Updated
Oct 29, 2023 - C
💥 Fast matrix-multiplication as a self-contained Python library – no system dependencies!
-
Updated
Sep 12, 2024 - C
libdspl-2.0 is opensource cross-platform digital signal processing algorithm library, written in C language.
-
Updated
Jul 14, 2024 - C
Using PLT trampolines to provide a BLAS and LAPACK demuxing library.
-
Updated
Oct 10, 2024 - C
The repository targets the OpenCL gemm function performance optimization. It compares several libraries clBLAS, clBLAST, MIOpenGemm, Intel MKL(CPU) and cuBLAS(CUDA) on different matrix sizes/vendor's hardwares/OS. Out-of-the-box easy as MSVC, MinGW, Linux(CentOS) x86_64 binary provided. 在不同矩阵大小/硬件/操作系统下比较几个BLAS库的sgemm函数性能,提供binary,开盒即用。
-
Updated
Mar 28, 2019 - C
A library for Erlang/OTP (and Elixir language) of numerical routines on single and double-precision real and complex number vectors and matrices
-
Updated
Jun 7, 2017 - C
C computation graph, AutoGrad with OpenCL support [WIP]
-
Updated
Feb 26, 2019 - C
A small, header-only, fast C shared library with ML/nonparametrix algorithms for researchers and developers
-
Updated
Oct 22, 2018 - C
Examples of using OpenMP offload with dgemm in the target region
-
Updated
May 26, 2020 - C
phiGEMM: CPU-GPU hybrid matrix-matrix multiplication library
-
Updated
Oct 26, 2014 - C
Improve this page
Add a description, image, and links to the blas topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the blas topic, visit your repo's landing page and select "manage topics."