Challenge: The bandwidth limitations and low communication efficiency faced by traditional bus architectures in on-chip multi-core & many-core systems.
Solution: NoC provides a flexible and high-bandwidth communication infrastructure for heterogeneous chiplets; enables efficient data movement and processing in heterogeneous many-core architectures.
Ring Road: A Scalable Polar-Coordinate-based 2D Network-on-Chip Architecture
Ring Road topology based on isolated cycles and trees; polar coordinate DOR(dimension-order-routing); inter/intra-chip decouple routing algorithm
2024
arXiv
WSU
Atleus: Accelerating Transformers on the Edge Enabled by 3D Heterogeneous Manycore Architectures
heterogeneous 3D NoC; pipeline design across heterogeneous resources; crossbar-wise quantization
2024
ISLPED
WSU
HeTraX: Energy Efficient 3D Heterogeneous Manycore Architecture for Transformer Acceleration
3D integration; distinct planar tiers where each tier is tailor-made for either MHA or the FF network; alleviate memory bottlenecks while preventing frequent rewrites on ReRAM crossbars
StreamGrid: Streaming Point Cloud Analytics via Compulsory Splitting and Deterministic Termination
compulsory splitting for reducing on-chip buffer size; deterministic termination for regularizing non-deterministic operations; line buffer optimization for point cloud pipelines; ILP-based buffer size minimization
CXL over Ethernet: A Novel FPGA-based Memory Disaggregation Design in Data Centers
combining CXL and Ethernet for low-latency remote memory access; FPGA-based prototype with cache optimization; switch-independent congestion control algorithm; native memory semantics for transparent access
2024
arXiv
UCSD
Telepathic Datacenters: Fast RPCs using Shared CXL Memory
pointer-passing RPC over CXL; MPK-based sandboxing for RPC safety; Seal/Release mechanism for RPC safety; RDMA fallback for RPC scalability; Lease/Quota shared memory management