Memory Mapping Tutorial

TEMP: A Memory Efficient Physical-Aware Tensor Partition-Mapping Framework on Wafer-Scale Chips

Abstract: Large language models (LLMs) demand significant memory and computation resources. Wafer-scale chips (WSCs) provide high computation power and die-to-die (D2D) bandwidth but face a unique ...

IEEE

Input Mapping Design for Batch-to-Batch Optimization With Limited Memory

Abstract: This brief discusses data-driven design techniques for batch-to-batch optimization problems and proposes a new input-mapping-based online uncertainty compensation method for ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

TEMP: A Memory Efficient Physical-Aware Tensor Partition-Mapping Framework on Wafer-Scale Chips

Input Mapping Design for Batch-to-Batch Optimization With Limited Memory

Trending now