» Front Page » Permalink » Source ↑ 102 ↓ Matrix-vector multiplication implemented in off-the-shelf DRAM for Low-Bit LLMs