Personal information

United States

Activities

Employment (2)

Nvidia (United States): Santa Clara, California, US

2022-03-30 to present
Employment
Source: Self-asserted source
Wei Wu

Los Alamos National Laboratory: Los Alamos, NM, US

2017-06-10 to 2022-03-30 | Research Scientist
Employment
Source: Self-asserted source
Wei Wu

Education and qualifications (1)

University of Tennessee: Knoxville, TN, US

2011-08-15 to 2017-05-30 | Ph.D. (Computer Science)
Education
Source: Self-asserted source
Wei Wu

Works (21)

O3BNN: An Out-of-order Architecture for High-performance Binarized Neural Network Inference with Fine-grained Pruning

Proceedings of the ACM International Conference on Supercomputing
2019 | Conference paper
Part of ISBN: 978-1-4503-6079-1
Source: Self-asserted source
Wei Wu

Task Bench: A Parameterized Benchmark for Evaluating Parallel Runtime Performance

2019-08-15 | Preprint
Source: Self-asserted source
Wei Wu

ADAPT: An Event-based Adaptive Collective Communication Framework

Proceedings of the 27th International Symposium on High-Performance Parallel and Distributed Computing
2018 | Conference paper
Part of ISBN: 978-1-4503-5785-2
Source: Self-asserted source
Wei Wu

Superneurons: Dynamic GPU Memory Management for Training Deep Neural Networks

Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
2018 | Conference paper
Part of ISBN: 978-1-4503-4982-6
Source: Self-asserted source
Wei Wu

A Web-based Visual Analytic Framework for Understanding Large-scale Environmental Models: A Use Case for the Community Land Model

Procedia Computer Science
2017 | Conference paper
EID:

2-s2.0-85027366038

Contributors: Xu, Y.; Wang, D.; Janjusic, T.; Wu, W.; Pei, Y.; Yao, Z.
Source: Self-asserted source
Wei Wu via Scopus - Elsevier

Compiler technologies for understanding legacy scientific code: A case study on an ACME land module

Procedia Computer Science
2017 | Conference paper
EID:

2-s2.0-85027317814

Contributors: Wang, D.; Pei, Y.; Hernandez, O.; Wu, W.; Yao, Z.; Kim, Y.; Wolfe, M.; Kitchen, R.
Source: Self-asserted source
Wei Wu via Scopus - Elsevier

Efficient Communications in Training Large Scale Neural Networks

Proceedings of the on Thematic Workshops of ACM Multimedia 2017
2017 | Conference paper
Part of ISBN: 978-1-4503-5416-5
Source: Self-asserted source
Wei Wu

BLASX: A High Performance Level-3 BLAS Library for Heterogeneous Multi-GPU Computing

Proceedings of the 2016 International Conference on Supercomputing
2016 | Conference paper
Part of ISBN: 978-1-4503-4361-9
Source: Self-asserted source
Wei Wu

GPU-aware non-contiguous data movement in open MPI

HPDC 2016 - Proceedings of the 25th ACM International Symposium on High-Performance Parallel and Distributed Computing
2016 | Conference paper
EID:

2-s2.0-84978536097

Contributors: Wu, W.; Bosilca, G.; VandeVaart, R.; Jeaugey, S.; Dongarra, J.
Source: Self-asserted source
Wei Wu via Scopus - Elsevier

Implementing directed acyclic graphs with the heterogeneous system architecture

9th Workshop on General Purpose Processing using GPUs, GPGPU 2016 - Proceedings
2016 | Conference paper
EID:

2-s2.0-84966670212

Contributors: Puthoor, S.; Aji, A.M.; Che, S.; Daga, M.; Wu, W.; Beckmann, B.M.; Rodgers, G.
Source: Self-asserted source
Wei Wu via Scopus - Elsevier

A scientific function test framework for modular environmental model development: Application to the community land model

Proceedings - 2015 International Workshop on Software Engineering for High Performance Computing in Science, SE4HPCS 2015
2015 | Conference paper
EID:

2-s2.0-84989216702

Contributors: Wang, D.; Janjusic, T.; Iversen, C.; Thornton, P.; Karssovski, M.; Wu, W.; Xu, Y.
Source: Self-asserted source
Wei Wu via Scopus - Elsevier

Hierarchical DAG Scheduling for Hybrid Distributed Systems

2015 IEEE International Parallel and Distributed Processing Symposium
2015 | Conference paper
Source: Self-asserted source
Wei Wu

A feasibility study on porting the community land model onto accelerators using OpenACC

International Journal of Advanced Computer Science and Applications
2014 | Journal article
Source: Self-asserted source
Wei Wu

Multifractal and singularity analysis of weighted road networks

International Journal of Modern Physics B
2014 | Journal article
Source: Self-asserted source
Wei Wu

Algorithms for modeling structural changes in human chromosomes

Computer Methods and Programs in Biomedicine
2013 | Journal article
EID:

2-s2.0-84875893848

Contributors: Yang, X.; Wu, W.; Tseng, C.C.
Source: Self-asserted source
Wei Wu via Scopus - Elsevier

Effective algorithms for altering human chromosome shapes

The 2011 International Conference on Modeling, Simulation and Visualization Methods
2011 | Conference paper
Source: Self-asserted source
Wei Wu

Simulation of human abnormal chromosomes: An innovative tool for teaching

2011 International Conference on Control, Automation and Systems Engineering, CASE 2011
2011 | Conference paper
EID:

2-s2.0-80052879448

Contributors: Wu, W.; Yang, X.; Tseng, C.
Source: Self-asserted source
Wei Wu via Scopus - Elsevier

Virtual chromosome modeling for learning human cytogenetics

2010 8th IEEE International Conference on Control and Automation, ICCA 2010
2010 | Conference paper
EID:

2-s2.0-77957882206

Contributors: Yang, X.; Wu, W.; Wen, D.; Chen, B.; Lacny, J.; Tseng, C.
Source: Self-asserted source
Wei Wu via Scopus - Elsevier

Computer Based Simulation of Chromosome Abnormality

BIOCOMP
2009 | Conference paper
Source: Self-asserted source
Wei Wu

Virtual reality based robotics learning system

Proceedings of the IEEE International Conference on Automation and Logistics, ICAL 2008
2008 | Conference paper
EID:

2-s2.0-56449126551

Contributors: Yang, X.; Zhao, Y.; Wu, W.; Wang, H.
Source: Self-asserted source
Wei Wu via Scopus - Elsevier

A Framework for Analyzing the Community Land Model within the Communty Earth System Models

Journal article
Source: Self-asserted source
Wei Wu

Peer review (12 reviews for 1 publication/grant)

Review activity for IEEE transactions on parallel and distributed systems : (12)