- 👋 Hi, I’m Lingqi ZHANG, a Post Doctoral Researcher at RIKEN.
- 👀 My expertise spans node-level performance modeling and performance engineering in memory-bound problems.
- 🌱 I am currently interested in:
- Memory hierarchy
- Task Graph & CUDA Graph
- Tensor Core
- 🌱 My previous works:
- Exploring how the latest GPU hardware influences programming.
- Device-wide synchronization introduced from CUDA 9.0 -> Details pls refer to PERKS repository and Reduction Case Study.
- Other features like async shared memory copy and large cache systems -> Details pls refer to EBISU repository.
- Exploring how the latest GPU hardware influences programming.
RIKEN-RCCS Post Doc
Tokyo Tech Ph.D.
Interested in parallel programming and GPU programming.
Currently working on code generation and machine learning topic
-
RIKEN
- Tokyo
-
00:16
(UTC -12:00)
Pinned Loading
-
EBISU-ICS23
EBISU-ICS23 PublicThis is a repo to keep the experimental implementation of EBISU used in ICS23
Cuda 3
-
SyncMicrobenchmark
SyncMicrobenchmark PublicThis work aims at characterizing the synchronization methods in CUDA.
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.