SpV8_Pursuing_Optimal_Vectorization_and_Regular_Computation_Pattern_in_SpMV
SpV8_Pursuing_Optimal_Vectorization_and_Regular_Computation_Pattern_in_SpMV在之前的论文阅读中,发现间隔一段时间后,即使看笔记 ...
Algorithm and hardware co optimized solution for large SpMV problems
Algorithm and hardware co optimized solution for large SpMV problems本篇论文用了特定硬件来结合优化,所以泛用性不高。但是提醒了我按行 ...
HPC-Game
HPC Game 0th. 作为HPC的小练习来完成,因为0th是一年前了,所以记录自己的解题应该没有问题。因为提供的超算平台现在不可以用了,相关文档现在也无法访问了,所以是直接做的题目。注册还可以看 ...
Performance analysis and optimization for SpMV based on ARM
Performance analysis and optimization for SpMV based on aligned storage formats on an ARM processor这 ...
NEON编程
原书为Arm的NEON Programmer’s Guide和NEON Programmer Guide for Armv8-A。
根据我的实际体验,这份文档的帮助不大,只是对A64有个基本认识,实际 ...
WISE:Predicting the Performance of Sparse Matrix
WISE:Predicting the Performance of Sparse MatrixAbstract稀疏矩阵向量乘法(SpMV)是一个重要的稀疏kernel。已经开发了许多方法来加速SpM ...
Efficiently Running SpMV on Long Vector Architectures
Efficiently Running SpMV on Long Vector ArchitecturesAbstract稀疏矩阵-向量乘法(SpMV)是并行数值应用的一个重要核心。SpMV中存在稀疏 ...
CS149-Assignment-4(2023FALL)
CS149-Assignment-4(2023FALL)。
CS149:https://gfxcourses.stanford.edu/cs149/fall23/
Assignment 4: Nano ...
An OpenMP Runtime for Transparent Work Sharing across Cache-Incoherent Heterogeneous Nodes
An OpenMP Runtime for Transparent Work Sharing across Cache-Incoherent Heterogeneous NodesKey Words: ...
CMU15-418notes(19-23)
视频:https://www.bilibili.com/video/BV1Rh4y1F7aU/?spm_id_from=333.788&vd_source=463e5b3e4b18e54534 ...