LOG-Means: efficiently estimating the number of clusters in large datasets []

GPUPool: A Holistic Approach to Fine-Grained GPU Sharing in the Cloud [GPU]

On-the-fly elimination of dynamic irregularities for GPU computing [GPU, ASPLOS, Compiler]

CXL over Ethernet: A Novel FPGA-based Memory Disaggregation Design in Data Centers [CXL, Network, arXiv]

Exploring the Use of WebAssembly in HPC [arXiv]