Contact Sales

Search Synopsys

Innovate Faster with Synopsys Multi-Die Solution

Explore our eBook for scalable multi-die solutions to boost innovation, productivity, and success.

Automotive Executive Guide: Rethinking Automotive Development

A guide to virtualization in software-defined vehicles for automotive leaders.

Mastering AI Chip Complexity

This eBook explores AI chip design trends, challenges,
and strategies for first-pass silicon success.

Ethernet Standards for Scale-Up AI: An Overview of ESUN, SUE, and UALink

Jon Ames

Jan 15, 2026 / 6 min read

Table of Contents

Scale Up vs. Scale Out: Why the Stack Must Change
ESUN: An OCP Workstream Coordinating Open Scale Up Ethernet
SUE: A Contributed Framework Detailing Pod Optimized Ethernet
UALink: Open Memory Semantic Interconnect Built on Ethernet Physicals
The Synopsys Advantage: 224G Ethernet PHY IP as the Common L1

Artificial Intelligence is transforming data center design. As models grow and compute clusters scale, networking inside the rack has become just as critical as the accelerators themselves. Traditional Ethernet was built for scale-out, connecting racks and buildings, but AI workloads demand something different: scale-up networking. This means ultra-fast, low-latency links between hundreds or even thousands of accelerators within a single pod.

AI pods have changed the rules of networking. Inside a rack, accelerators must share data with sub microsecond latency and deterministic performance while keeping tail latency in check. The industry is aligning behind open, Ethernet based approaches that deliver scaleup performance without vendor lock, most notably: ESUN (an OCP workstream), SUE (an OCP published framework), and UALink (a consortium standard for memory semantics).

In this article, we go deeper into each Ethernet-based initiative, its scope, the problems it solves, how it complements the others, and how Synopsys 224G IP provides the common physical layer for all three.

Scale Up vs. Scale Out: Why the Stack Must Change

Scale-out Ethernet connects racks and even entire data centers, handling traffic across large distances. In contrast, scale-up Ethernet operates inside a single pod, linking tens to thousands of accelerators over short, single-hop paths. This environment demands extremely low latency and high bandwidth because accelerators need to exchange data rapidly for AI workloads.

Figure 1. To process AI workloads effectively, the entire accelerator cluster must operate as one computer

To meet these requirements, industry groups are adapting Ethernet’s lower layers (L1 & L2) and transport behaviors for scale-up scenarios. The goal is to keep Ethernet’s operational familiarity and broad supply chain while optimizing for small headers, lossless transport, in-order delivery, and tight Quality of Service (QoS)—all critical for collective operations like all-reduce and shared-memory patterns in AI training.

ESUN: An OCP Workstream Coordinating Open Scale Up Ethernet

ESUN (Ethernet for Scale Up Networking) is an OCP workstream launched at OCP Global Summit 2025. It’s an open technical forum to advance Ethernet for the scaleup domain, focusing on how switches and NICs handle AI pod traffic: framing, lossless delivery, header optimization, and interoperability across vendors. ESUN explicitly coordinates with IEEE 802.3 and the Ultra Ethernet Consortium (UEC).

Figure 2 . ESUN blog diagram, taken from: Introducing ESUN: Advancing Ethernet for Scale-Up AI Infrastructure at OCP » Open Compute Project

Key technical elements

Lossless, low latency behavior in single hop/multi-hop topologies: Defines L2/L3 behaviors and QoS mechanisms suitable for pods, where tail latency dominates job completion time.
Header efficiency and switching pipelines: Encourages compact headers and switch features tuned for small payloads common in collectives.
Interop across vendors: Establishes a common baseline so XPU interfaces and Ethernet switch ASICs cooperate predictably in scale‑up deployments.

Why it matters

ESUN creates the standards coordination layer many operators want: open, vendor neutral L2/L3 behavior that they can implement on different switch silicon and NICs—without abandoning Ethernet. This complements endpoint/transport efforts (e.g., SUE-T in OCP and UEC profiles) and makes Ethernet the connective tissue for scaleup as well as scale out.

SUE: A Contributed Framework Detailing Pod Optimized Ethernet

SUE (Scale Up Ethernet) is a framework specification contributed to OCP (v1.0 released Sept 5, 2025) that spells out how an Ethernet based pod should work. It introduces an AI Fabric Header (AFH) to minimize per packet overhead and defines Link Level Retry (LLR) and Credit Based Flow Control (CBFC) for hop-by-hop reliability with deterministic tail latency.

Figure 3. Example use case in mesh deployment from SUE. Taken from: OCP SUE

Key technical elements

AI Fabric Header (AFH): A compressed header that carries only the fields scaleup needs (IDs, ordering hints, class), shrinking the header to payload ratio for the frequent 64byte units used in collectives.
LLR (Link Level Retry): Buffers and retransmits frames at the link layer to mask errors from higher layers—reducing tail latency by avoiding costly end to end retries.
CBFC (Credit Based Flow Control): Receiver advertised credits ensure the sender only transmits when buffer space exists, preventing drops with fine grained control compared to PFC (Priority-based Flow Control).
Single hop pod topology: Encourages parallel planes to increase throughput while keeping latency bounded—exactly what heavily synchronized AI jobs demand.

Relationship to UEC

SUE’s LLR/CBFC mechanisms align with UEC’s Ethernet extensions, which document control ordered sets and preamble signaling for LLR/CBFC at the PHY/PCS boundary—evidence of cross community convergence on scale up reliability.

Why it matters

SUE gives operators and silicon vendors a concrete “how to” for building Ethernet pods today. AFH reduces header overhead; LLR/CBFC deliver lossless, deterministic transport without heavyweight end to end retries; and the single hop model fits the core AI collective traffic shape.

UALink: Open Memory Semantic Interconnect Built on Ethernet Physicals

UALink is a consortium standard enabling load/store/atomic operations directly between accelerators and switches—turning a pod into a shared memory domain. The UALink 200G 1.0 spec (publicly released April 2025) supports 200G per lane signaling, x1/x2/x4 link configurations, and scales to up to 1,024 accelerators per pod with deterministic performance.

Figure 4. Scalable multi-node accelerator system with UALink high-speed interconnect. Taken from: UALink™ 200G 1.0 Specification Overview – UALink Consortium

Key technical elements

Four layer stack: Physical (leveraging standard Ethernet PHY components), Data Link (fixed FLITs + LLR/CRC), Transaction (compressed addressing), and Protocol (memory semantics).
Switch ecosystem for accelerators: UALink Switches connect accelerators with ID based routing and support Virtual Pod partitioning and security (UALinkSec).
Ecosystem alignment: Reuses Ethernet cables, connectors, retimers, and management—accelerating adoption while keeping BOM predictable.

OCP collaboration: OCP and UALink announced joint work to integrate UALink into community delivered AI clusters.

How it complements ESUN/SUE

UALink provides memory semantics and TL/Protocol behaviors; ESUN/SUE address L2/L3 framing, reliability, and switch behavior. Many deployments will use UALink inside the pod and ESUN/SUE aligned Ethernet behaviors for interoperability and operations, with standard Ethernet for scale out between pods.

Why it matters

AI workloads increasingly rely on shared-memory programming models, which traditional Ethernet cannot efficiently support. UALink addresses this by enabling load/store and atomic operations across accelerators, making a pod behave like a unified memory domain. This reduces software complexity, improves performance for memory-intensive models, and scales to 1,024 accelerators per pod—all while leveraging standard Ethernet physical components for cost efficiency and ecosystem compatibility.

Comparative Summary

Dimension	ESUN (OCP Workstream)	SUE (OCP Framework)	UALink (Consortium Spec)
Primary Scope	Open L2/L3 behaviors, headers, lossless delivery, interop across vendors	Pod level encapsulation + reliability (AFH, LLR, CBFC) for single hop	Memory semantics (load/store/atomic), accelerator to accelerator transactions
Standardization Host	OCP workstream coordinating with IEEE and UEC	OCP published spec (v1.0)	UALink Consortium
Key Benefits	Vendor neutral interop; open headers & QoS for AI pods	Deterministic tail latency; efficient small payload handling	Shared memory programming model; deterministic bandwidth/latency

The Synopsys Advantage: 224G Ethernet PHY IP as the Common L1

Figure 5. Networking layers, showing common L1 leveraging Synopsys IP

No matter which approach you choose—ESUN, SUE, or UALink—the physical layer (L1) is critical. It must deliver high signal integrity, margin, and jitter tolerance, with zero post-FEC BER across real-world channels. Synopsys 224G Ethernet PHY IP meets these demands enabling up to 1.6T Ethernet and supporting UALink 200G signaling, all while complying with evolving IEEE 802.3 and OIF-224G electrical specifications.

Summary

Open Ethernet is emerging as the foundation for scale-up AI networking. Together, ESUN, SUE, and UALink form a complementary stack: ESUN coordinates open Ethernet behaviors, SUE provides a pod-level blueprint with features like AFH and LLR/CBFC to reduce overhead and latency, and UALink introduces memory semantics for shared-memory workloads—scaling up to 1,024 accelerators per pod using Ethernet physical components. These efforts deliver lossless, deterministic performance without proprietary lock-in.

At the physical layer, high-speed Ethernet PHY technology enables all three approaches. Synopsys silicon-proven 224G PHY IP solution delivers the performance, interoperability, and robustness needed to accelerate deployment of open, multi-vendor scale-up fabrics—ensuring readiness for next-generation AI clusters.

Subscribe to the Synopsys IP Technical Bulletin

Includes in-depth technical articles, white papers, videos, upcoming webinars, product announcements and more.

Continue Reading

eBook

Explore the IP that enables high-performance, scalable AI systems

Download eBook

White Paper

224G PHY IP for AI Networks

Enhancing DAC Performance for Next-Gen Data

Download

White Paper

Improving COM Accuracy for 400G Ethernet

Modeling insights for 400Gbps designs

Download

Search Synopsys

Popular Content

Innovate Faster with Synopsys Multi-Die Solution

Explore our eBook for scalable multi-die solutions to boost innovation, productivity, and success.

Automotive Executive Guide: Rethinking Automotive Development

A guide to virtualization in software-defined vehicles for automotive leaders.

Mastering AI Chip Complexity

This eBook explores AI chip design trends, challenges, and strategies for first-pass silicon success.

Ethernet Standards for Scale-Up AI: An Overview of ESUN, SUE, and UALink

Synopsys IP Technical Bulletin

Scale Up vs. Scale Out: Why the Stack Must Change

ESUN: An OCP Workstream Coordinating Open Scale Up Ethernet

Key technical elements

Why it matters

SUE: A Contributed Framework Detailing Pod Optimized Ethernet

Key technical elements

Relationship to UEC

Why it matters

UALink: Open Memory Semantic Interconnect Built on Ethernet Physicals

Key technical elements

How it complements ESUN/SUE

Why it matters

Comparative Summary

The Synopsys Advantage: 224G Ethernet PHY IP as the Common L1

Summary

Subscribe to the Synopsys IP Technical Bulletin

Continue Reading

Explore the IP that enables high-performance, scalable AI systems

224G PHY IP for AI Networks

Improving COM Accuracy for 400G Ethernet

This eBook explores AI chip design trends, challenges,
and strategies for first-pass silicon success.

Synopsys IP
Technical Bulletin