news 2026/6/24 6:14:18

CANN/catlass GEMM块API文档

作者头像

张小明

前端开发工程师

1.2k 24
文章封面图
CANN/catlass GEMM块API文档

Gemm/Block Class Template Overview

【免费下载链接】catlass本项目是CANN的算子模板库,提供NPU上高性能矩阵乘及其相关融合类算子模板样例。项目地址: https://gitcode.com/cann/catlass

API List

blockMmad List

ComponentDescription
block_mmadBasic template, including BlockMmad.
block_mmad_pingpongPartial specialization of BlockMmad implementing ping-pong matrix multiplication.

Swizzle List

ComponentDescription
block_swizzleBasic swizzle methods
GemmIdentityBlockSwizzleBasic swizzle policy for the GEMM operator

API Breakdown

blockMmad

The blockMmad structure encapsulates the MMAD computation at the Block layer, mapping directly to execution on a single AI Core of the Ascend NPU. Through template parameters, it receives configuration details defining the matrix shapes, tensor layouts (such as row-major or column-major), and data types (DType).

The namespace isCatlass::Gemm::Block. Core members:

TypeNameFunction
ConstructorBlockMmad()Initializes buffers, registers event IDs, and insertssetFlagprimitives for pipeline synchronization.
Destructor~BlockMmad()InsertswaitFlagprimitives for pipeline synchronization.
Functionvoid operator()Executes the matrix multiplication for a Block task.

【免费下载链接】catlass本项目是CANN的算子模板库,提供NPU上高性能矩阵乘及其相关融合类算子模板样例。项目地址: https://gitcode.com/cann/catlass

创作声明:本文部分内容由AI辅助生成(AIGC),仅供参考

版权声明: 本文来自互联网用户投稿,该文观点仅代表作者本人,不代表本站立场。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如若内容造成侵权/违法违规/事实不符,请联系邮箱:809451989@qq.com进行投诉反馈,一经查实,立即删除!
网站建设 2026/6/24 6:09:10

sniffer源码解析:Go语言实现高性能网络流量捕获的关键技术

sniffer源码解析:Go语言实现高性能网络流量捕获的关键技术 【免费下载链接】sniffer 🤒 A modern alternative network traffic sniffer. 项目地址: https://gitcode.com/gh_mirrors/sn/sniffer 在当今的网络监控和调试领域,网络流量嗅…

作者头像 李华
网站建设 2026/6/24 6:01:51

实战配置:5种高效物联网协议桥接方案深度解析

实战配置:5种高效物联网协议桥接方案深度解析 【免费下载链接】esphome ESPHome is a system to control your ESP32, ESP8266, BK72xx, RP2040 by simple yet powerful configuration files and control them remotely through Home Automation systems. 项目地址…

作者头像 李华
网站建设 2026/6/24 5:57:01

GeoDa高级技巧:时空数据动画与平行坐标图的制作指南

GeoDa高级技巧:时空数据动画与平行坐标图的制作指南 【免费下载链接】geoda GeoDa: An introduction to spatial data analysis 项目地址: https://gitcode.com/gh_mirrors/ge/geoda GeoDa是一款强大的空间数据分析工具,能够帮助用户轻松探索地理…

作者头像 李华
网站建设 2026/6/24 5:51:42

如何使用AndHook实现Java方法拦截:从配置到运行的完整教程

如何使用AndHook实现Java方法拦截:从配置到运行的完整教程 【免费下载链接】AndHook Android dynamic instrumentation framework 项目地址: https://gitcode.com/gh_mirrors/an/AndHook Android动态插桩框架AndHook是Android应用开发中功能强大的Java方法拦…

作者头像 李华