As high-performance computing (HPC) and artificial intelligence (AI) workloads scale in complexity and volume, energy efficiency has become a defining challenge for data centres worldwide. Enter Compute Express Link (CXL) 3.0—a groundbreaking interconnect standard that is reshaping how memory and compute resources are pooled, shared, and managed across heterogeneous systems. Among its most impactful innovations is the L0p low-power operational state, designed to maximize energy savings without sacrificing bandwidth or responsiveness. In this post, we’ll explore the evolving role of L0p in CXL 3.0, its importance in modern computing architectures, and how Synopsys Verification IP (VIP) empowers engineers to validate these features for next-generation deployments.
Compute Express Link (CXL) 3.0 is the backbone of next-gen HPC and AI systems, enabling seamless, high-speed, low-latency communication between CPUs, memory, accelerators, and other devices. The three protocols—CXL.io, CXL.cache, and CXL.mem—work together to enable resource sharing, memory pooling, and cache coherence, which are vital for cloud-scale and AI-centric environments.
L0p is a strategic low-power state where devices remain operational with reduced energy draw. Unlike deep sleep states that incur latency penalties, L0p enables rapid transitions back to full performance (L0) as soon as workload demands spike—a crucial capability for bursty AI inference or fluctuating HPC simulations. The Figure 1 illustrates a high-level architecture for HPC and AI acceleration using CXL 3.0 L0p. It shows CPUs connected to switches, which in turn link to CXL 3.0 devices.
Here are some key characteristics that make the L0p low-power state valuable in CXL 3.0 devices:
Figure 1: High-level block diagram illustrating HPC and AI acceleration using CXL 3.0 L0p for energy efficiency.
L0p plays an important role in CXL 3.0 for several reasons:
With memory pooling, CXL 3.0 enables flexible allocation of memory across CPUs and accelerators. L0p ensures that devices are always ready to serve memory requests without unnecessary power drain. For example, in cloud-native environments, L0p allows accelerators to “wake up” instantly when shared memory is needed for real-time inference or training.
The combination of L0p and CXL 3.0 is particularly beneficial for applications that require high performance but are sensitive to energy consumption:
Cloud Data Centers: CXL 3.0 with L0p supports elastic scaling while maintaining green computing standards.
Verification challenges continue to grow with each generation of evolving protocols. Specific challenges for CXL 3.0 L0p include:
Synopsys CXL 3.0 VIP offers a comprehensive suite for L0p verification, supporting the latest CXL 3.0 and PCIe specs. It leverages ARB/MUX ALMPs for dynamic link width negotiation, allowing for priority-based requests and real-time width adjustments based on aggregate demands. Engineers can orchestrate L0p transitions, configure NAK responses for robustness testing, and monitor trace logs for detailed debugging.
Synopsys CXL 3.0 VIP offers several features to streamline L0p verification:
Figure 2: L0p Downsize Sample Trace
Figure 3: L0p Up-size Sample Trace
Synopsys CXL 3.0 Verification IP offers industry-leading solutions for verifying new architectural features, Handling Higher Bandwidth & Speed, and ensuring compliance with latest CXL 3.0 standards. Key advantages include:
Figure 4: CXL 3.0 VIP Architecture Diagram
CXL 3.0’s L0p state is a game-changer for energy-efficient, high-performance computing environments. By intelligently balancing power and performance, L0p enables sustainable growth for AI and HPC workloads. Synopsys CXL 3.0 VIP provides a comprehensive solution for engineers to debug and validate the L0p feature efficiently. It offers flexibility to create dynamic and complex corner-case scenarios, enabling thorough verification of power management behaviour under various conditions. By supporting configurable L0p requests, NAK handling, and simultaneous request resolution, the VIP ensures robust testing of link width negotiation and fallback mechanisms. Combined with trace logging and in-built compliance checks, it accelerates the verification process while maintaining adherence to the CXL 3.0 specification.
Synopsys is partnering with early customers and collaborators to enhance the standard architecture for their next-generation designs, incorporating new features now available with the latest specifications.
Synopsys VIP is natively integrated with the Synopsys Verdi® Protocol Analyzer debug solution as well as Synopsys Verdi® Performance Analyzer. Running system-level payload on SoCs requires a faster hardware-based pre-silicon solution. Synopsys transactors, memory models, hybrid and virtual solutions based on Synopsys IP enable various verification and validation use-cases on the industry’s fastest verification hardware, Synopsys ZeBu® emulation and Synopsys HAPS® prototyping systems.