Personal information

No personal information available

Activities

Works (9)

Performance Portability Evaluation of Blocked Stencil Computations on GPUs

2023-11-12 | Conference paper
Contributors: Oscar Antepara; Samuel Williams; Hans Johansen; Tuowen Zhao; Samantha Hirsch; Priya Goyal; Mary Hall
Source: check_circle
Crossref

Polyhedral Specification and Code Generation of Sparse Tensor Contraction with Co-iteration

ACM Transactions on Architecture and Code Optimization
2023-03-31 | Journal article
Contributors: Tuowen Zhao; Tobi Popoola; Mary Hall; Catherine Olschanowsky; Michelle Strout
Source: check_circle
Crossref

Optimizing Data Movement and Achieving Performance Portability with Fine-Grained Data Blocking

2022 | Dissertation or Thesis
Contributors: Zhao, Tuowen
Source: Self-asserted source
Tuowen Zhao

Improving communication by optimizing on-node data movement with data layout

Proceedings of the 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
2021-02-17 | Conference paper
Contributors: Tuowen Zhao; Mary Hall; Hans Johansen; Samuel Williams
Source: Self-asserted source
Tuowen Zhao

Exploiting reuse and vectorization in blocked stencil computations on CPUs and GPUs

Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis
2019-11-17 | Conference paper
Contributors: Tuowen Zhao; Protonu Basu; Samuel Williams; Mary Hall; Hans Johansen
Source: Self-asserted source
Tuowen Zhao

Delivering Performance-Portable Stencil Computations on CPUs and GPUs Using Bricks

2018 IEEE/ACM International Workshop on Performance, Portability and Productivity in HPC (P3HPC)
2018-11 | Conference paper
Contributors: Tuowen Zhao; Samuel Williams; Mary Hall; Hans Johansen
Source: Self-asserted source
Tuowen Zhao

SIMD code generation for stencils on brick decompositions

Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
2018-02-10 | Conference poster
Contributors: Tuowen Zhao; Mary Hall; Protonu Basu; Samuel Williams; Hans Johansen
Source: Self-asserted source
Tuowen Zhao

A Novel Variable-Blocking Representation for Efficient Sparse Matrix-Vector Multiply on GPUs

Supercomputing
2016 | Conference poster
Contributors: Zhao, Tuowen; Rusira, Tharindu; Ahmad, Khalid; Hall, Mary
Source: Self-asserted source
Tuowen Zhao

Chapel With Polyhedral Transformation Using Autotuning

The 3rd Annual Chape Implementers and Users Workshop
2016 | Conference abstract
Contributors: Zhao, Tuowen; Hall, Mary
Source: Self-asserted source
Tuowen Zhao