## Driving HW-SW Co-Design for Software Defined Memory Systems



## SDM System Workstreams & Scope of Deliverables

|                                                                    | Frontier A<br>Local Memory Expansion                                                                                        |  |
|--------------------------------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------|--|
| Solution blue-prints [primary use cases]                           | Databases / Caching / Virtualization<br>Al/ML, HPC, others                                                                  |  |
| HW configurations<br>[new capabilities to<br>improve solution ROI] | CPU / GPU / Mem. expanders / Accelerators                                                                                   |  |
| HW specifications<br>[for emerging devices]                        | Form factor, thermal, device mgmt., security                                                                                |  |
| SW ecosystem                                                       | Use case specific SW platform readiness                                                                                     |  |
| SW & HW integration<br>[specification gaps closure]                | Caching controls<br>Page migrations for memory tiering<br>(for emerging memory, low-cost memory)<br>Hot / cold page mapping |  |
| Open-source benchmarking<br>[ecosystem consistency]                | Cachebench, Deepspeed, others                                                                                               |  |
| Compute near memory                                                | Basic Semantics                                                                                                             |  |





## SDM System Workstreams & Scope of Deliverables

|                                                                    | Frontier A<br>Local Memory Expansion                                                                                        | Frontier B<br>Pooled Memory Expansion                                                    | Frontier C<br>Switched Memory Fabrics                                                                                                                         |
|--------------------------------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Solution blue-prints [primary use cases]                           | Databases / Caching / Virtualization<br>Al/ML, HPC, others                                                                  | Virtualization                                                                           | To be developed                                                                                                                                               |
| HW configurations<br>[new capabilities to<br>improve solution ROI] | CPU / GPU / Mem. expanders / Accelerators                                                                                   | Multi-port and/or Multi-host<br>configuration & specifications<br>for memory controllers | Switch based fabrics<br>(config. differences for<br>within & across racks)                                                                                    |
| HW specifications<br>[for emerging devices]                        | Form factor, thermal, device mgmt., security                                                                                | Device failure handling<br>RAS definitions<br>Memory fencing and other such issues       | +<br>Switch definitions & specs<br>Multi-protocol (network, CXL, others)<br>Variable payload efficiency (byte to block)<br>Electrical & Optical interconnects |
| SW ecosystem                                                       | Use case specific SW platform readiness                                                                                     | +<br>Dynamic memory allocation capability<br>Memory overcommit capability                |                                                                                                                                                               |
| SW & HW integration<br>[specification gaps closure]                | Caching controls<br>Page migrations for memory tiering<br>(for emerging memory, low-cost memory)<br>Hot / cold page mapping | +<br>Pooled memory system management<br>Including interconnect config & mgmt.            | Needs focused work to sharpen scope                                                                                                                           |
| Open-source benchmarking<br>[ecosystem consistency]                | Cachebench, Deepspeed, others                                                                                               | mlc, others                                                                              | Pick compating                                                                                                                                                |
| Compute near memory                                                | Basic semantics                                                                                                             |                                                                                          | Rich semantics                                                                                                                                                |

