Personal information

No personal information available

Activities

Works (9)

FlashFFTStencil: Bridging Fast Fourier Transforms to Memory-Efficient Stencil Computations on Tensor Core Units

2025-02-28 | Conference paper
Contributors: Haozhi Han; Kun Li; Wei Cui; Donglin Bai; Yiwei Zhang; Liang Yuan; Yifeng Chen; Yunquan Zhang; Ting Cao; Mao Yang
Source: check_circle
Crossref

Jigsaw: Toward Conflict-free Vectorized Stencil Computation by Tessellating Swizzled Registers

2025-02-28 | Conference paper
Contributors: Yiwei Zhang; Kun Li; Liang Yuan; Haozhi Han; Yunquan Zhang; Ting Cao; Mao Yang
Source: check_circle
Crossref

IrGEMM: An Input-Aware Tuning Framework for Irregular GEMM on ARM and X86 CPUs

IEEE Transactions on Parallel and Distributed Systems
2024-09 | Journal article
Contributors: Cunyang Wei; Haipeng Jia; Yunquan Zhang; Jianyu Yao; Chendi Li; Wenxuan Cao
Source: check_circle
Crossref

HAM-SpMSpV: an Optimized Parallel Algorithm for Masked Sparse Matrix-Sparse Vector Multiplications on multi-core CPUs

2024-06-03 | Conference paper
Contributors: Lei Xu; Haipeng Jia; Yunquan Zhang; Luhan Wang; Xianmeng Jiang
Source: check_circle
Crossref

Stencil Computation with Vector Outer Product

2024-05-30 | Conference paper
Contributors: Wenxuan Zhao; Liang Yuan; Baicheng Yan; Penghao Ma; Yunquan Zhang; Long Wang; Zhe Wang
Source: check_circle
Crossref

ConvStencil: Transform Stencil Computation to Matrix Multiplication on Tensor Cores

2024-03-02 | Conference paper
Contributors: Yuetao Chen; Kun Li; Yuhao Wang; Donglin Bai; Lei Wang; Lingxiao Ma; Liang Yuan; Yunquan Zhang; Ting Cao; Mao Yang
Source: check_circle
Crossref

Redesigning OpenKMC for Multi-Component Trillion-Atom Simulations on the New Sunway Supercomputer

IEEE Transactions on Parallel and Distributed Systems
2023 | Journal article
Contributors: Lei Xu; Honghui Shang; Xin Chen; Yunquan Zhang; Lifang Wang; Xingyu Gao; Haifeng Song
Source: check_circle
Crossref

OpenFFT: An Adaptive Tuning Framework for 3D FFT on ARM Multicore CPUs

2023-06-21 | Conference paper
Contributors: Tun Chen; Haipeng Jia; Yunquan Zhang; Kun Li; Zhihao Li; Xiang Zhao; Jianyu Yao; Chendi Li
Source: check_circle
Crossref

AGCM-3DLF: Accelerating Atmospheric General Circulation Model via 3-D Parallelization and Leap-Format

IEEE Transactions on Parallel and Distributed Systems
2023-03-01 | Journal article
Contributors: Hang Cao; Liang Yuan; He Zhang; Yunquan Zhang; Baodong Wu; Kun Li; Shigang Li; Minghua Zhang; Pengqi Lu; Junmin Xiao
Source: check_circle
Crossref