diff --git a/README.md b/README.md
index 48c6484..3008473 100644
--- a/README.md
+++ b/README.md
@@ -2,6 +2,8 @@
 
 Tools, libraries and statistical software for automating, managing, monitoring and testing Umber Fi‑Wi networks.
 
+**Where it runs:** FiWiControl is built to run **on the Umber concentrator** — the Fi‑Wi control plane described in the architecture spec (**`html/Fi-Wi-L4S.php`** in this repo) — for **lab and customer** automation. Day-to-day development still uses **workstation** installs and **lab rigs** (e.g. Raspberry Pi) as in **`docs/install.md`**.
+
 **Naming:** The **Git repository** and checkout directory are **FiWiControl** (mixed case). The **Python distribution** and **import package** are **`fiwicontrol`** (all lowercase, PEP 8) — same project, different casing rules for Git vs Python. Use **`fiwicontrol`** for `pip install` / `import`, not `FiWiControl`.
 
 This repository ships that distribution (**`fiwicontrol`** on PyPI / `pip`) with import root **`fiwicontrol`**:
@@ -47,6 +49,8 @@ FiWiControl/
 ├── LICENSE
 ├── README.md
 ├── pyproject.toml
+├── html/
+│   └── Fi-Wi-L4S.php
 ├── docs/
 │   ├── install.md
 │   ├── node-control-asyncio-design.md
diff --git a/html/Fi-Wi-L4S.php b/html/Fi-Wi-L4S.php
new file mode 100644
index 0000000..e965d2b
--- /dev/null
+++ b/html/Fi-Wi-L4S.php
@@ -0,0 +1,16280 @@
+<?php
+// Public technical page – no auth required, but keep session for consistency
+session_start();
+?>
+<!DOCTYPE html>
+<html lang="en">
+  <head>
+    <meta
+      name="generator"
+      content="HTML Tidy for HTML5 for Linux version 5.8.0"
+    />
+    <meta charset="UTF-8" />
+    <title>
+      Umber Fi-Wi Architecture: Cellularized Wi-Fi with Dynamic Point Selection
+    </title>
+    <meta name="viewport" content="width=device-width, initial-scale=1" />
+    <meta
+      name="description"
+      content="Technical overview of Umber Networks' Fi-Wi architecture."
+    />
+    <style>
+      :root {
+        --accent: #174f8a;
+        --accent-soft: #f5f7fa;
+        --accent-warm: #fff9f0;
+        --border-soft: #dde2eb;
+        --text-main: #222;
+        --text-muted: #555;
+      }
+      * {
+        box-sizing: border-box;
+      }
+      body {
+        font-family:
+          system-ui,
+          -apple-system,
+          BlinkMacSystemFont,
+          "Segoe UI",
+          sans-serif;
+        line-height: 1.6;
+        max-width: 960px;
+        margin: 2rem auto;
+        padding: 0 1rem 3rem 1rem;
+        color: var(--text-main);
+        background: #ffffff;
+      }
+      h1,
+      h2,
+      h3,
+      h4 {
+        color: var(--accent);
+        margin-top: 1.8rem;
+        margin-bottom: 0.6rem;
+      }
+      h1 {
+        font-size: 2rem;
+        margin-top: 0;
+      }
+      h2 {
+        border-bottom: 1px solid var(--border-soft);
+        padding-bottom: 0.3rem;
+      }
+      a {
+        color: var(--accent);
+        text-decoration: none;
+      }
+      a:hover {
+        text-decoration: underline;
+      }
+      code {
+        font-family:
+          "SFMono-Regular", Menlo, Monaco, Consolas, "Liberation Mono",
+          "Courier New", monospace;
+        font-size: 0.95em;
+        background: var(--accent-soft);
+        padding: 0 0.22em;
+        border-radius: 3px;
+      }
+      pre {
+        background: var(--accent-soft);
+        border-radius: 5px;
+        padding: 0.8rem;
+        overflow-x: auto;
+        font-size: 0.9em;
+      }
+      ul,
+      ol {
+        margin-left: 1.2rem;
+      }
+      .callout {
+        border-left: 4px solid #f3a533;
+        padding: 0.7rem 0.9rem;
+        margin: 1.1rem 0;
+        background: var(--accent-warm);
+      }
+      .small {
+        font-size: 0.9em;
+        color: var(--text-muted);
+      }
+      hr {
+        border: none;
+        border-top: 1px solid var(--border-soft);
+        margin: 2rem 0;
+      }
+      nav.toc {
+        border: 1px solid var(--border-soft);
+        border-radius: 6px;
+        padding: 0.8rem 1rem;
+        background: #ffffff;
+        margin: 1.4rem 0 2rem 0;
+      }
+      nav.toc h2 {
+        font-size: 1.05rem;
+        margin-top: 0;
+        border-bottom: none;
+        padding-bottom: 0;
+      }
+      nav.toc ul {
+        list-style: none;
+        padding-left: 0;
+        margin: 0.4rem 0 0 0;
+      }
+      nav.toc li {
+        margin: 0.15rem 0;
+      }
+      nav.toc li ul {
+        margin-top: 0.1rem;
+        margin-left: 1rem;
+      }
+      header {
+        padding-bottom: 1.5rem;
+        margin-bottom: 2rem;
+        border-bottom: 1px solid var(--border-soft);
+      }
+      header h1 {
+        margin-bottom: 0.5rem;
+        margin-top: 0; /* Fix alignment with new eyebrow label */
+      }
+      .diagram-block {
+        margin: 2rem 0;
+        padding: 1rem 1.5rem;
+        background: #f7f9fc;
+        border-left: 4px solid #174f8a;
+        border-radius: 6px;
+      }
+
+      .diagram {
+        background: #ffffff;
+        padding: 1rem;
+        overflow-x: auto;
+        border-radius: 4px;
+        border: 1px solid #d6dbe4;
+        font-family: Consolas, Monaco, monospace;
+        font-size: 0.85rem;
+        line-height: 1.3rem;
+        white-space: pre;
+      }
+
+      .diagram-caption {
+        font-size: 0.9rem;
+        color: #333;
+        margin-top: 0.4rem;
+      }
+
+      table.comparison .section-row td {
+        background: #f0f3f9;
+        font-weight: 600;
+      }
+
+      table.comparison {
+        width: 100%;
+        border-collapse: collapse;
+        margin: 1.5rem 0;
+        font-size: 0.95rem;
+      }
+
+      table.comparison th,
+      table.comparison td {
+        border: 1px solid var(--border-soft);
+        padding: 0.4rem 0.6rem;
+        text-align: left;
+        vertical-align: top;
+      }
+
+      table.comparison th {
+        background: #f5f7fa;
+        font-weight: 600;
+      }
+      /* Floating Back to TOC Button */
+      .back-to-toc {
+        position: fixed;
+        bottom: 30px;
+        right: 30px;
+        background-color: var(--accent); /* Uses your existing blue */
+        color: white;
+        padding: 10px 15px;
+        border-radius: 5px;
+        font-size: 0.9rem;
+        font-weight: 600;
+        box-shadow: 0 4px 12px rgba(0, 0, 0, 0.15);
+        z-index: 1000;
+        opacity: 0.8;
+        transition: all 0.3s ease;
+      }
+      .back-to-toc:hover {
+        opacity: 1;
+        text-decoration: none;
+        transform: translateY(-2px);
+        box-shadow: 0 6px 16px rgba(0, 0, 0, 0.2);
+      }
+      /* Hide on very small screens if it covers text */
+      @media (max-width: 600px) {
+        .back-to-toc {
+          bottom: 20px;
+          right: 20px;
+          padding: 8px 12px;
+          font-size: 0.8rem;
+        }
+      }
+      /* Figure 15-1 Animation Styles */
+      .canvas-container {
+        display: flex;
+        gap: 20px;
+        justify-content: center;
+        margin-bottom: 30px;
+      }
+      .canvas-box {
+        flex: 1;
+        min-width: 400px;
+        background: white;
+        border-radius: 8px;
+        padding: 15px;
+        box-shadow: 0 2px 8px rgba(0, 0, 0, 0.1);
+      }
+      .canvas-box h3 {
+        margin: 0 0 10px 0;
+        text-align: center;
+        color: #555;
+      }
+      .per-display {
+        text-align: center;
+        margin-top: 8px;
+        font-size: 0.85rem;
+        font-weight: 600;
+      }
+      .per-value {
+        color: #dc3545;
+        font-size: 1.1rem;
+      }
+      .eigen-display {
+        text-align: center;
+        margin-top: 4px;
+        font-size: 0.8rem;
+        color: #666;
+      }
+      .eigen-value {
+        color: #174f8a;
+        font-weight: 600;
+      }
+      #canvasAuto,
+      #canvasFiwi,
+      #canvasAutoFlow,
+      #canvasFiwiFlow {
+        width: 100%;
+        height: 300px;
+        display: block;
+        background: #fafafa;
+        border: 1px solid #ddd;
+        border-radius: 4px;
+      }
+      /* Phase 2 highlight */
+      .phase2-box {
+        background: #e3f2fd;
+        border: 2px solid #2196f3;
+      }
+      .phase2-label {
+        color: #0d47a1;
+        font-weight: 700;
+        display: flex;
+        align-items: center;
+        gap: 10px;
+        font-size: 1.05rem;
+      }
+      /* Appendix styling */
+      .appendix-box {
+        background: #fff;
+        border-top: 4px solid #666;
+        padding: 30px;
+        margin-top: 50px;
+        max-width: 1400px;
+        margin-left: auto;
+        margin-right: auto;
+      }
+      .appendix-title {
+        font-size: 1.2rem;
+        font-weight: 700;
+        color: #333;
+        margin-bottom: 15px;
+        text-transform: uppercase;
+        letter-spacing: 0.5px;
+      }
+      .appendix-content {
+        display: grid;
+        grid-template-columns: 1fr 1fr;
+        gap: 40px;
+        line-height: 1.6;
+        font-size: 0.95rem;
+        color: #444;
+      }
+      .tech-note {
+        background: #f8f9fa;
+        border-left: 3px solid #2196f3;
+        padding: 10px 15px;
+        margin-top: 15px;
+        font-size: 0.85rem;
+        color: #555;
+      }
+    </style>
+  </head>
+  <body>
+    <header>
+      <div
+        style="
+          text-transform: uppercase;
+          font-size: 0.75rem;
+          font-weight: 700;
+          color: #666;
+          letter-spacing: 1px;
+          margin-bottom: 0.5rem;
+        "
+      >
+        Umber Networks Proprietary Architecture
+      </div>
+
+      <h1>
+        Umber Fi-Wi Architecture: Cellularized Wi-Fi, L4S, and RF Coordination
+      </h1>
+
+      <p class="small">
+        Timestamp-synchronized control loops, dynamic RF grouping, and multi-RRH
+        operation<br />
+        Umber Networks Fi-Wi Technical Architecture Overview (Version 1.1,
+        December 2025)
+      </p>
+    </header>
+
+    <blockquote
+      style="
+        margin: 30px 0;
+        padding: 15px 25px;
+        border-left: 4px solid #666;
+        font-style: italic;
+        background-color: #f9f9f9;
+      "
+    >
+      Zebras look like horses, but they are not the same... Zebras, despite
+      man's best efforts, cannot be <strong>tamed</strong>. The Wi-Fi we have
+      engineered today remains fundamentally a collection of autonomous,
+      uncoordinated things—zebras that simply cannot be
+      <strong>harnessed</strong>.<br />
+      <br />
+      Fi-Wi is architected from the ground up to be controllable, coordinated,
+      and directed — the horse we need for in-building communications and
+      sensing. As latency demands tighten and building densities increase, Fi-Wi
+      isn't just a better future; it's the future we can build today.
+    </blockquote>
+
+    <div class="section" id="section-0">
+      <h2>0. Technical Disclaimer</h2>
+
+      <p>
+        The material presented in this document describes the Fi-Wi architecture
+        and associated engineering concepts. It is provided "as is" for
+        discussion and exploratory design purposes only. Nothing in this
+        document constitutes a formal specification, performance guarantee,
+        regulatory assertion, or commitment to implement any feature described.
+      </p>
+
+      <p>
+        Several sections use simplified or idealized assumptions to illustrate
+        architectural differences between Wi-Fi, Multi-Link Operation (MLO), Low
+        Latency Low Loss Scalable throughput (L4S), and Fi-Wi queueing and
+        scheduling behavior. These examples are intended to clarify concepts
+        rather than fully model the non-linear and stochastic dynamics present
+        in operational wireless systems.
+      </p>
+
+      <p>
+        Real system behavior depends on hardware characteristics, RF topology,
+        firmware behavior, congestion patterns, environmental conditions, and
+        interactions with legacy Wi-Fi devices. Actual performance may differ
+        from the representative models and examples described here.
+      </p>
+
+      <p>
+        <strong>Important Note on Capabilities:</strong> This document describes
+        an architecture using Commercial Off-The-Shelf (COTS) Wi-Fi chipsets.
+        The system provides dynamic point selection, intelligent frequency
+        reuse, and centralized MAC scheduling. It does not provide RF phase
+        control, distributed MIMO, or coordinated simultaneous
+        transmission—capabilities that would require custom ASIC development.
+        All described features are achievable with commodity Wi-Fi hardware and
+        comply with unlicensed spectrum regulations.
+      </p>
+    </div>
+
+    <div class="section" id="section-0-1">
+      <h2>0.1 L4S Foundation and References</h2>
+
+      <p>
+        <strong>Low Latency, Low Loss, Scalable Throughput (L4S)</strong> is a
+        suite of IETF standards that extend the Internet's congestion control
+        mechanisms through
+        <strong>Explicit Congestion Notification (ECN)</strong> to support very
+        low queuing delays. L4S is a ratified protocol stack with multiple
+        production implementations.
+      </p>
+
+      <p>
+        Fi-Wi is architected specifically to provide the deterministic
+        underlying transport required to satisfy the strict queuing mandates
+        defined in these standards.
+      </p>
+
+      <h3>Core L4S Specifications</h3>
+
+      <ul style="list-style: none; padding-left: 0">
+        <li style="margin-bottom: 1rem">
+          <a href="https://www.rfc-editor.org/rfc/rfc9330" target="_blank"
+            ><strong>RFC 9330 – L4S Architecture</strong></a
+          ><br />
+          <span class="small"
+            >Defines the end-to-end service model and how L4S interoperates with
+            legacy traffic.</span
+          >
+        </li>
+
+        <li style="margin-bottom: 1rem">
+          <a href="https://www.rfc-editor.org/rfc/rfc9331" target="_blank"
+            ><strong>RFC 9331 – ECN Protocol for L4S</strong></a
+          ><br />
+          <span class="small"
+            >Specifies the ECN scalable marking behavior that enables rapid
+            congestion feedback loops.</span
+          >
+        </li>
+
+        <li>
+          <a href="https://www.rfc-editor.org/rfc/rfc9332" target="_blank"
+            ><strong>RFC 9332 – Dual-Queue Coupled AQM</strong></a
+          ><br />
+          <span class="small"
+            >Describes the DualPI² AQM algorithm used to isolate L4S flows from
+            legacy traffic.</span
+          >
+        </li>
+      </ul>
+
+      <h3>Transport & Production Status</h3>
+
+      <p>
+        L4S replaces <em>capacity-seeking</em> behavior (Reno/Cubic) with
+        <strong>pacing-based rate control</strong>. It is currently deployed in
+        production environments including:
+      </p>
+
+      <ul>
+        <li>
+          <strong>Apple:</strong> iOS/macOS for FaceTime and real-time services.
+        </li>
+
+        <li>
+          <strong>Comcast:</strong> Production deployment of L4S queue
+          management.
+        </li>
+
+        <li>
+          <strong>Linux Kernel:</strong> Native support in v5.18+ (TCP Prague).
+        </li>
+
+        <li>
+          <strong>DOCSIS 4.0:</strong> Mandatory L4S queue management support.
+        </li>
+      </ul>
+
+      <h3>Further Reading</h3>
+
+      <ul style="list-style: none; padding-left: 0">
+        <li style="margin-bottom: 1rem">
+          <a
+            href="https://datatracker.ietf.org/doc/html/draft-briscoe-iccrg-prague-congestion-control-01#section-2.5.1"
+            target="_blank"
+            ><strong>TCP Prague: Packet Pacing Requirements</strong></a
+          ><br />
+          <span class="small"
+            >Technical draft detailing why pacing is mandatory for L4S
+            compliance.</span
+          >
+        </li>
+
+        <li>
+          <a href="https://www.youtube.com/watch?v=GyXwvRKW0QE" target="_blank"
+            ><strong
+              >Video: Implementing TCP Prague Requirements for L4S</strong
+            ></a
+          ><br />
+          <span class="small"
+            >Bob Briscoe (L4S Author) explains the pacing and requirements stack
+            at Netdev 0x13.</span
+          >
+        </li>
+      </ul>
+    </div>
+
+    <hr />
+
+    <nav class="toc" id="toc">
+      <h2>Contents</h2>
+
+      <ul>
+        <li>
+          <a href="#section-1">1. Motivation and Problem Statement</a>
+        </li>
+
+        <li>
+          <a href="#section-2"
+            >2. The Wi-Fi Crisis: Why Evolution Failed and Control Was Lost</a
+          >
+          <ul>
+            <li>
+              <a href="#section-2.1"
+                >2.1 The Evolutionary Trap: Why Incremental Improvements
+                Failed</a
+              >
+            </li>
+
+            <li>
+              <a href="#section-2.2"
+                >2.2 The Density Paradox: More Capacity, Less Performance</a
+              >
+            </li>
+
+            <li>
+              <a href="#section-2.3">2.3 The Three Technical Failure Modes</a>
+              <ul>
+                <li>
+                  <a href="#section-2.3.1"
+                    >2.3.1 Protocol Tax: The Hidden Node Penalty</a
+                  >
+                </li>
+
+                <li>
+                  <a href="#section-2.3.2"
+                    >2.3.2 The MCS Matrix: Un-Engineerable Complexity</a
+                  >
+                </li>
+
+                <li>
+                  <a href="#section-2.3.3"
+                    >2.3.3 The Spatial Contention Cascade</a
+                  >
+                </li>
+              </ul>
+            </li>
+
+            <li>
+              <a href="#section-2.4">2.4 The Operator's Dilemma</a>
+            </li>
+
+            <li>
+              <a href="#section-2.5"
+                >2.5 Why Conventional Solutions Don't Scale</a
+              >
+            </li>
+
+            <li>
+              <a href="#section-2.6"
+                >2.6 The Client Side: L4S and the End of Uplink Contention</a
+              >
+            </li>
+
+            <li>
+              <a href="#section-2.7"
+                >2.7 The Strategic Reset: Splitting the Graph</a
+              >
+            </li>
+
+            <li>
+              <a href="#section-2.8"
+                >2.8 Interactive Visualization: The MCS Collapse Under Load</a
+              >
+            </li>
+          </ul>
+        </li>
+
+        <li>
+          <a href="#section-3">3. System Picture</a>
+          <ul>
+            <li>
+              <a href="#section-3.1"
+                >3.1 Classical Stack vs. Fi-Wi (The C-RAN Shift)</a
+              >
+            </li>
+
+            <li>
+              <a href="#section-3.2">3.2 Dual-Loop Control Model</a>
+            </li>
+          </ul>
+        </li>
+
+        <li>
+          <a href="#section-4">4. Key Fi-Wi Mechanisms</a>
+          <ul>
+            <li>
+              <a href="#section-4.1">4.1 Time Synchronization</a>
+              <ul>
+                <li>
+                  <a href="#section-4.1.1"
+                    >4.1.1 The Fronthaul Clock: PTP/802.1AS</a
+                  >
+                </li>
+                <li>
+                  <a href="#section-4.1.2">4.1.2 The 802.11 TSF Domain</a>
+                </li>
+                <li>
+                  <a href="#section-4.1.3"
+                    >4.1.3 The Concentrator as Time Origin</a
+                  >
+                </li>
+                <li>
+                  <a href="#section-4.1.4"
+                    >4.1.4 Time-Driven EDCA Orchestration</a
+                  >
+                </li>
+              </ul>
+            </li>
+
+            <li>
+              <a href="#section-4.2">4.2 Fi-Wi Shim Header</a>
+            </li>
+
+            <li>
+              <a href="#section-4.3">4.3 AQM / L4S Marking Placement</a>
+            </li>
+
+            <li>
+              <a href="#section-4.4">4.4 Centralized Packet Memory and DMA</a>
+            </li>
+
+            <li>
+              <a href="#section-4.5"
+                >4.5 RRH Edge Control via Beacon Power Shaping</a
+              >
+            </li>
+
+            <li>
+              <a href="#section-4.6"
+                >4.6 Fronthaul Requirements and Feasibility</a
+              >
+            </li>
+
+            <li>
+              <a href="#section-4.7"
+                >4.7 Precision Clock Synchronization over Fronthaul</a
+              >
+            </li>
+          </ul>
+        </li>
+
+        <li>
+          <a href="#section-5"
+            >5. Control Architecture: The Dual-Integrator System</a
+          >
+          <ul>
+            <li>
+              <a href="#section-5.1">5.1 The Two Integrators</a>
+            </li>
+
+            <li>
+              <a href="#section-5.2"
+                >5.2 The Outer Loop: L4S and Group Queue Dynamics</a
+              >
+            </li>
+
+            <li>
+              <a href="#section-5.3"
+                >5.3 The Inner Loop: MAC Aggregation and TXOPs</a
+              >
+            </li>
+
+            <li>
+              <a href="#section-5.4"
+                >5.4 System Integration: Time-Scale Separation</a
+              >
+            </li>
+          </ul>
+        </li>
+
+        <li>
+          <a href="#section-6">6. Airtime Domains and Dynamic Queue Grouping</a>
+          <ul>
+            <li>
+              <a href="#section-6.1"
+                >6.1 Why airtime determines queue structure</a
+              >
+            </li>
+
+            <li>
+              <a href="#section-6.2">6.2 Forming airtime groups dynamically</a>
+            </li>
+
+            <li>
+              <a href="#section-6.3"
+                >6.3 Room-Level RRH Density (FTTR-Class Deployment)</a
+              >
+            </li>
+          </ul>
+        </li>
+
+        <li>
+          <a href="#section-7">7. Queue Architecture for Fi-Wi</a>
+          <ul>
+            <li>
+              <a href="#section-7.1">7.1 Why queue architecture matters</a>
+            </li>
+
+            <li>
+              <a href="#section-7.2"
+                >7.2 The theoretical case: L4S makes most priority obsolete</a
+              >
+            </li>
+
+            <li>
+              <a href="#section-7.3">7.3 Practical complications</a>
+            </li>
+
+            <li>
+              <a href="#section-7.4">7.4 Minimal 3-queue structure</a>
+            </li>
+
+            <li>
+              <a href="#section-7.5">7.5 Pragmatic 5-queue structure</a>
+            </li>
+
+            <li>
+              <a href="#section-7.6">7.6 Numerical examples</a>
+            </li>
+
+            <li>
+              <a href="#section-7.7">7.7 Deployment strategy</a>
+            </li>
+
+            <li>
+              <a href="#section-7.8">7.8 WMM support in Fi-Wi</a>
+            </li>
+
+            <li>
+              <a href="#section-7.9">7.9 Summary</a>
+            </li>
+          </ul>
+        </li>
+
+        <li>
+          <a href="#section-8">8. RRH-Level Active Redundancy</a>
+          <ul>
+            <li>
+              <a href="#section-8.1">8.1 Uplink: duplicate reception</a>
+            </li>
+
+            <li>
+              <a href="#section-8.2">8.2 Downlink: per-packet steering</a>
+            </li>
+
+            <li>
+              <a href="#section-8.2.1"
+                >8.2.1 Listen-Before-Talk (LBT) and RRH Eligibility for Downlink
+                Scheduling</a
+              >
+            </li>
+
+            <li>
+              <a href="#section-8.3">8.3 Analogy to Wi-Fi 7 MLO</a>
+            </li>
+
+            <li>
+              <a href="#section-8.3.1"
+                >8.3.1 Fi-Wi vs Wi-Fi 7 MLO: Compliance and Control</a
+              >
+            </li>
+
+            <li>
+              <a href="#section-8.4"
+                >8.4 Preserving the “single bottleneck” L4S view</a
+              >
+            </li>
+          </ul>
+        </li>
+
+        <li>
+          <a href="#section-9"
+            >9. Dynamic Point Selection and Intelligent Frequency Reuse</a
+          >
+          <ul>
+            <li>
+              <a href="#section-9.1"
+                >9.1 Dynamic Point Selection: The Core Capability</a
+              >
+            </li>
+
+            <li>
+              <a href="#section-9.2">9.2 Selection Diversity on Uplink</a>
+            </li>
+
+            <li>
+              <a href="#section-9.3">9.3 Intelligent Frequency Reuse</a>
+            </li>
+
+            <li>
+              <a href="#section-9.4">9.4 Transparent Integration with L4S</a>
+            </li>
+
+            <li>
+              <a href="#section-9.5"
+                >9.5 Governing Station Media Access: The Control Hierarchy</a
+              >
+            </li>
+
+            <li>
+              <a href="#section-9.6"
+                >9.6 What Dynamic Point Selection Does NOT Enable</a
+              >
+            </li>
+
+            <li>
+              <a href="#section-9.7">9.7 Performance Expectations</a>
+            </li>
+
+            <li>
+              <a href="#section-9.8">9.8 Summary</a>
+            </li>
+          </ul>
+        </li>
+
+        <li>
+          <a href="#section-10"
+            >10. Fi-Wi value vs. Traditional Distributed APs</a
+          >
+          <ul>
+            <li>
+              <a href="#section-10.1">10.1 Deterministic low latency</a>
+            </li>
+
+            <li>
+              <a href="#section-10.2">10.2 Stable L4S behavior</a>
+            </li>
+
+            <li>
+              <a href="#section-10.3"
+                >10.3 Aggregation without losing visibility</a
+              >
+            </li>
+
+            <li>
+              <a href="#section-10.4">10.4 Building-scale coordination</a>
+            </li>
+
+            <li>
+              <a href="#section-10.5">10.5 Control Plane vs. Data Plane</a>
+            </li>
+
+            <li>
+              <a href="#section-10.6"
+                >10.6 Operational and lifecycle advantages</a
+              >
+            </li>
+          </ul>
+        </li>
+
+        <li>
+          <a href="#section-11"
+            >11. Power, Thermals, and RRH Hardware Envelope</a
+          >
+          <ul>
+            <li>
+              <a href="#section-11.1">11.1 Power budget per RRH</a>
+            </li>
+
+            <li>
+              <a href="#section-11.2">11.2 Retimer + optics power</a>
+            </li>
+
+            <li>
+              <a href="#section-11.3">11.3 Total RRH power envelope</a>
+            </li>
+
+            <li>
+              <a href="#section-11.4">11.4 Concentrator-side considerations</a>
+            </li>
+          </ul>
+        </li>
+
+        <li>
+          <a href="#section-12">12. PCIe Fronthaul (Gen3 x1 over Fiber)</a>
+          <ul>
+            <li>
+              <a href="#section-12.1">12.1 Why PCIe as the RRH interface</a>
+            </li>
+
+            <li>
+              <a href="#section-12.2">12.2 Gen3 x1 throughput</a>
+            </li>
+
+            <li>
+              <a href="#section-12.3"
+                >12.3 Latency characteristics and budget</a
+              >
+            </li>
+
+            <li>
+              <a href="#section-12.4">12.4 Mapping queues and metadata</a>
+            </li>
+
+            <li>
+              <a href="#section-12.5">12.5 PCIe Hot Swap</a>
+            </li>
+          </ul>
+        </li>
+
+        <li>
+          <a href="#section-13"
+            >13. Hardware Architecture: The Workstation Concentrator vs. The
+            Legacy AP</a
+          >
+          <ul>
+            <li>
+              <a href="#section-13.1"
+                >13.1 The Legacy Bottleneck: Anatomy of a Traditional AP</a
+              >
+            </li>
+
+            <li>
+              <a href="#section-13.2"
+                >13.2 The Fi-Wi Solution: The 92-Lane Fabric</a
+              >
+            </li>
+
+            <li>
+              <a href="#section-13.3"
+                >13.3 Dedicated Resources and Determinism</a
+              >
+            </li>
+          </ul>
+        </li>
+
+        <li>
+          <a href="#section-14"
+            >14. Hardware Queues and the Software Advantage</a
+          >
+          <ul>
+            <li>
+              <a href="#section-14.1">14.1 The Hardware Queue Problem</a>
+            </li>
+
+            <li>
+              <a href="#section-14.2">14.2 The DMA Ownership Constraint</a>
+            </li>
+
+            <li>
+              <a href="#section-14.3">14.3 Compensating Hardware</a>
+            </li>
+
+            <li>
+              <a href="#section-14.4">14.4 Fi-Wi's Architectural Solution</a>
+            </li>
+
+            <li>
+              <a href="#section-14.5">14.5 Economic and Strategic Impact</a>
+            </li>
+
+            <li>
+              <a href="#section-14.6">14.6 Architectural Principle</a>
+            </li>
+          </ul>
+        </li>
+
+        <li>
+          <a href="#section-15">15. Adaptive Control via Machine Learning</a>
+          <ul>
+            <li>
+              <a href="#section-15.1"
+                >15.1 The MCS State Graph as a Probability Current Network</a
+              >
+            </li>
+
+            <li>
+              <a href="#section-15.2"
+                >15.2 What Gets Learned: The Transition Rate Matrix</a
+              >
+            </li>
+
+            <li>
+              <a href="#section-15.3">15.3 Physics-Informed Learning</a>
+            </li>
+
+            <li>
+              <a href="#section-15.4"
+                >15.4 Training Data from Centralized Observability</a
+              >
+            </li>
+
+            <li>
+              <a href="#section-15.5">15.5 Transfer Learning Across Sites</a>
+            </li>
+
+            <li>
+              <a href="#section-15.6">15.6 The Learning Feedback Loop</a>
+            </li>
+
+            <li>
+              <a href="#section-15.7"
+                >15.7 The Multi-RRH Advantage: Learning the Spatial Network</a
+              >
+            </li>
+
+            <li>
+              <a href="#section-15.8"
+                >15.8 Operational Calibration: Zero-Occupancy Sounding</a
+              >
+            </li>
+
+            <li>
+              <a href="#section-15.9"
+                >15.9 Bounded Model Validation During Idle Periods</a
+              >
+            </li>
+
+            <li>
+              <a href="#section-15.10"
+                >15.10 Architectural Comparison: Why Autonomous APs Cannot
+                Learn</a
+              >
+            </li>
+          </ul>
+        </li>
+        <li>
+          <a href="#section-16"
+            >16. Concentrator Fast Path: DPDK, DMA, and Queue Determinism</a
+          >
+          <ul>
+            <li>
+              <a href="#section-16.1">16.1 Why a Kernel-Bypass Data Plane</a>
+            </li>
+            <li>
+              <a href="#section-16.2"
+                >16.2 The Memory Model: IOMMU, VFIO, and Hugepages</a
+              >
+            </li>
+            <li>
+              <a href="#section-16.3"
+                >16.3 Airtime Domains as Hardware Queue Partitions</a
+              >
+            </li>
+            <li>
+              <a href="#section-16.4">16.4 The L4S Marking Loop</a>
+            </li>
+            <li>
+              <a href="#section-16.5">16.5 Fault Isolation via IOMMU Groups</a>
+            </li>
+            <li>
+              <a href="#section-16.6">16.6 What DPDK Does and Does Not Solve</a>
+            </li>
+	    <li><a href="#section-16.7">16.7 DualPI2 Baseline: Control Law and Queue Structure</a>
+  <ul>
+    <li><a href="#section-16.7.1">16.7.1 The Two Queues</a></li>
+    <li><a href="#section-16.7.2">16.7.2 The Coupling Mechanism</a></li>
+    <li><a href="#section-16.7.3">16.7.3 Per-Domain State and the fiwi_update Interface</a></li>
+  </ul>
+</li>
+<li><a href="#section-16.8">16.8 Multi-RRH lcore Topology and Control Ownership</a>
+  <ul>
+    <li><a href="#section-16.8.1">16.8.1 RRH Assignment</a></li>
+    <li><a href="#section-16.8.2">16.8.2 Control and Data Flow</a></li>
+  </ul>
+</li>
+          </ul>
+        </li>
+
+        <li>
+          <a href="#section-17"
+            >17. Airtime-Assisted ECN: Airtime Debt as the Congestion Signal</a
+          >
+          <ul>
+            <li>
+              <a href="#section-17.1"
+                >17.1 The Bottleneck is Airtime, Not a Queue</a
+              >
+            </li>
+            <li>
+              <a href="#section-17.2">17.2 Airtime Debt Model (Per RRH)</a>
+            </li>
+            <li>
+              <a href="#section-17.3"
+                >17.3 Measuring Ground Truth (Hardware-Path-to-Status)</a
+              >
+            </li>
+            <li>
+              <a href="#section-17.4"
+                >17.4 Predicted Sojourn Time (S<sub>i</sub>)</a
+              >
+            </li>
+            <li>
+              <a href="#section-17.5"
+                >17.5 Implementation: DPDK Fast Path State</a
+              >
+            </li>
+            <li>
+              <a href="#section-17.6"
+                >17.6 Authoritative Congestion Signaling</a
+              >
+            </li>
+            <li><a href="#section-17.7">17.7 Slow-Path Observability</a></li>
+            <li>
+              <a href="#section-17.8"
+                >17.8 Telemetry Feedback: Netlink Calibration</a
+              >
+              <ul>
+                <li>
+                  <a href="#section-17.8.1"
+                    >17.8.1 Telemetry Application (DPDK lcore)</a
+                  >
+                </li>
+              </ul>
+            </li>
+	    <li><a href="#section-17.9">17.9 Visualization: The Airtime Debt Control Loop</a></li>
+          </ul>
+        </li>
+        <li><a href="#section-18">18. Summary</a></li>
+
+        <li>
+          <a href="#appendix-a"
+            >Appendix A: 802.11 Backoff Timing & Collapse Dynamics</a
+          >
+        </li>
+
+        <li>
+          <a href="#appendix-b"
+            >Appendix B: Channel State Information (CSI) and Learning</a
+          >
+        </li>
+
+        <li>
+          <a href="#appendix-c"
+            >Appendix C: Latency Hiding via Scatter-Gather DMA</a
+          >
+        </li>
+
+        <li>
+          <a href="#appendix-d"
+            >Appendix D: 802.11ax/be Features and Fi-Wi Integration</a
+          >
+        </li>
+
+        <li>
+          <a href="#appendix-e">Appendix E: ASIC Evolution to Complexity</a>
+        </li>
+
+        <li>
+          <a href="#appendix-f">Appendix F: A Day in the Life of a Packet</a>
+        </li>
+
+        <li>
+          <a href="#appendix-g"
+            >Appendix G: The Strategic Case for Fiber Infrastructure</a
+          >
+        </li>
+
+        <li>
+          <a href="#appendix-h"
+            >Appendix H: Centralized Observability and the ML Advantage</a
+          >
+        </li>
+
+        <li>
+          <a href="#appendix-i"
+            >Appendix I: Channel Width Orchestration and Service Time
+            Variance</a
+          >
+        </li>
+
+        <li>
+          <a href="#appendix-j"
+            >Appendix J: 10-Node MDU Simulation Methodology</a
+          >
+        </li>
+      </ul>
+    </nav>
+
+    <hr />
+
+    <h2 id="section-1">1. Motivation and Problem Statement</h2>
+
+    <blockquote
+      style="
+        margin: 30px 0;
+        padding: 15px 25px;
+        border-left: 4px solid #666;
+        font-style: italic;
+        background-color: #f9f9f9;
+      "
+    >
+      <p>
+        "Perfection is achieved, not when there is nothing more to add, but when
+        there is nothing left to take away." — Antoine de Saint-Exupéry
+      </p>
+
+      <p>
+        "Everything should be made as simple as possible, but not simpler." —
+        Albert Einstein
+      </p>
+    </blockquote>
+
+    <p>
+      With 23.3 billion Wi-Fi devices in use worldwide and 5.5 billion people
+      depending on internet connectivity, and growing, Wi-Fi has become the
+      primary way we access the internet. So much so many people think Wi-Fi is
+      the internet. It's how a home healthcare worker video-calls to check on a
+      patient, or a cancer patient connects to their support group. It’s how a
+      parent works remotely while their child attends school online, and how
+      lifelong learners access the information they need to grow. It’s how a
+      grandmother monitors her heart condition through a telehealth app. It’s
+      how a family member finds their next job, or how a neighbor orders a meal.
+    </p>
+
+    <p>
+      Running quietly in the background are autonomous systems we've come to
+      depend on: security cameras that alert us to threats, medical monitors
+      that track vital signs, smart home systems that manage climate and safety,
+      IoT sensors that detect water leaks or carbon monoxide. These systems
+      don't wait for us to notice problems—they operate continuously, silently,
+      keeping people safe.
+    </p>
+
+    <p>
+      We've moved far beyond entertainment and convenience. Wi-Fi now carries
+      the infrastructure of daily survival. When it breaks down under density or
+      congestion, it's not just buffering that fails. It's jobs, healthcare
+      access, human connection, and the life-safety systems we trust to work
+      when we're not watching. The $4.9 trillion Wi-Fi contributes to the global
+      economy isn't an abstract number. It's the cumulative value of billions of
+      human activities and critical systems that simply stop working when the
+      network fails.
+    </p>
+
+    <h4>Why Traditional Wi-Fi Cannot Support L4S</h4>
+
+    <p>
+      The infrastructure supporting all of this is failing at scale, and it must
+      be addressed for all. The industry is moving toward L4S and ECN-based
+      control to eliminate bufferbloat, but traditional Wi-Fi makes this
+      impossible. Legacy congestion-control loops fail by design once a single
+      flow saturates the bottleneck queue, and even modern ECN-based systems
+      such as L4S cannot converge when Wi-Fi hides queue depth, induces
+      collision storms, injects firmware-created delays that look like queues,
+      and constantly shifts transmission (PHY) rates through its rate-control
+      and aggregation machinery. Mesh networks and more APs catalyze intolerable
+      user experiences by injecting more uncoordinated radios into an already
+      chaotic RF environment. And because the AP industry understands these
+      limits, it is no surprise that even major vendors publicly state that L4S
+      cannot operate correctly over the products they sell.
+    </p>
+
+    <p>
+      Adding more Ethernet-attached APs makes it worse by creating more
+      overlapping contention domains. Hidden queues in SoCs, rate-control
+      firmware, and aggregation pipelines obscure the true bottleneck. In
+      control-theory terms: the bottleneck queue cannot expose its state, the
+      PHY rate is not stationary, and the closed loop cannot stabilize. This is
+      why user experience fails in many apartments and homes, in hotels, MDUs,
+      stadiums, and high-density buildings long before “capacity” is reached.
+    </p>
+
+    <p>
+      QoS cannot rescue this architecture. Because the bottleneck queue inside a
+      Wi-Fi AP has no information about actual flow urgency or priority, no QoS
+      mechanism can operate meaningfully. The only real solution is to avoid
+      congestion altogether — which is exactly what L4S researchers have
+      designed for and exactly what Fi-Wi supports.
+    </p>
+
+    <h4>Why Copper Infrastructure Has Reached Its Limits</h4>
+
+    <p>
+      While the protocol fails in the air, the physical infrastructure fails in
+      the walls - the industry’s traditional answer of running copper Ethernet
+      to APs — simply extends the lifetime of an architecture that has reached
+      its limits. Copper requires periodic rip-and-replace cycles: Cat5 becomes
+      Cat6, then Cat7, then Cat8. A home builder has no idea what communications
+      wiring to install. The RJ45 connector and its plastic tab is fragile,
+      outdated and end of life. And at 25G, 40G, or 100G, physics takes over:
+      copper loses signal in dB per inch. Data centers have abandoned structured
+      cabling (long-run copper) for core transport, restricting copper only to
+      short-reach intra-rack DACs. Fi-Wi applies this same logic to the
+      building: Fiber for the long haul (halls/walls), radio for the short hop.
+    </p>
+
+    <h4>How Fi-Wi Breaks Both Cycles</h4>
+
+    <p>
+      Fi-Wi breaks the cycle. Install fiber once — and never revisit behind
+      walls or ceilings again. The glass is permanent; only the optics evolve.
+      Fiber is already the universal medium for 100G/400G data centers, DWDM
+      long-haul transport, and now PCIe throughout a building with Fi-Wi. Remote
+      Radio Heads simply convert between fiber and 802.11, eliminating embedded
+      routing, rate-control SoCs, switching silicon, and the security-patch
+      treadmill they require. When Wi-Fi standards evolve, you replace the small
+      radio module(s) — that's all.
+    </p>
+
+    <div class="callout">
+      <strong>What is C-RAN?</strong><br />
+      Fi-Wi adapts the
+      <strong>Centralized/Cloud Radio Access Network (C-RAN)</strong>
+      architecture from 4G/5G cellular systems. In C-RAN, intelligence (baseband
+      processing) is centralized while radio heads are distributed. Fi-Wi
+      applies this proven approach to Wi-Fi, enabling building-scale
+      coordination impossible with autonomous access points.
+    </div>
+
+    <p>
+      Fi-Wi turns fiber combined with 802.11 into the permanent, predictable,
+      control-theory-friendly transport that the L4S control loop requires, and
+      treats 802.11 radio heads as the small, disposable, last-meters,
+      connector-free interface where the in-building network behaves
+      deterministically. And because fiber increases the long-term value of a
+      building, the investment is not just technically durable — it is
+      financially durable.
+    </p>
+
+    <h4>The Opportunity Is Here</h4>
+
+    <p>
+      There is no law of physics that says Wi-Fi cannot work at scale. The
+      collapse we're seeing in apartments, hotels, and high-density buildings
+      isn't inevitable. The researchers have shown engineers how to proceed. We
+      know how to build stable control loops. We know how to coordinate radios.
+      We know how to deploy permanent infrastructure.
+    </p>
+
+    <p>
+      The conditions for solving this are here, now. Engineering talent exists
+      across our industry. The market has already validated the foundation:
+      China's FTTR deployments have installed fiber to millions of rooms,
+      proving that permanent infrastructure at this scale is not just
+      feasible—it's already happening at volume. What's missing is capital
+      directed at the right architecture. Investors are essential to this
+      challenge. Their capital will enable the engineering to serve the market.
+      And, once proven, market signals will sustain the development, directing
+      human resources toward building what humanity needs for continued
+      advancement.
+    </p>
+
+    <p>
+      Fi-Wi is Umber's answer, but the underlying challenge belongs to all of
+      us. The 5.5 billion people depending on this infrastructure deserve better
+      than a system designed for convenience that we've repurposed for survival.
+      This is solvable engineering—the talent is ready, the manufacturing
+      exists, and the market is waiting. It's time we came together and fixed
+      this.
+    </p>
+
+    <div
+      class="callout"
+      style="background: #f5f9ff; border-left: 4px solid #174f8a"
+    >
+      <strong>About Umber Networks</strong><br />
+      Umber Networks was founded by Bob McMahon, a networking engineer with 35
+      years of experience building internet infrastructure. Bob created and
+      maintains Iperf2, the industry-standard network performance measurement
+      tool with over 3 million downloads worldwide. His career spans
+      foundational work on FDDI for the International Space Station (1989),
+      development of the Cisco Catalyst RSM routing module deployed worldwide,
+      and wireless chipset testing using statistical process controls at
+      Broadcom. Fi-Wi represents the culmination of decades solving congestion
+      control, wireless scaling, and real-time transport challenges at the
+      protocol and silicon level.
+    </div>
+
+    <hr />
+
+    <h2 id="section-2">
+      2. The Wi-Fi Crisis: Why Evolution Failed and Control Was Lost
+    </h2>
+
+    <p>
+      The failure of modern Wi-Fi to support low-latency applications (L4S) is
+      not a failure of bandwidth; it is a failure of <strong>control</strong>.
+      With 23.5 billion Wi-Fi devices deployed globally, the protocol has hit an
+      asymptotic limit where adding complexity yields diminishing returns.
+    </p>
+
+    <p>
+      As density rises, autonomous contention scales super-linearly—effectively
+      operating as the inverse of Metcalfe's Law. The result is a rising noise
+      floor and media access collisions that render unlicensed spectrum unusable
+      for the deterministic performance required by next-generation
+      applications.
+    </p>
+
+    <h3 id="section-2.1">
+      2.1 The Evolutionary Trap: Why Incremental Improvements Failed
+    </h3>
+
+    <p>
+      Evolutionary engineering is powerful; it gave us twenty-five years of
+      Wi-Fi speed improvements. But every evolutionary curve eventually hits an
+      asymptote—a point where adding more complexity yields diminishing returns.
+      We have reached that point.
+    </p>
+
+    <blockquote
+      style="
+        margin: 30px 0;
+        padding: 15px 25px;
+        border-left: 4px solid #666;
+        font-style: italic;
+        background-color: #f9f9f9;
+      "
+    >
+      "The IEEE 802.11 working group behaves like a composer writing a symphony
+      that effectively cannot be played. They continually add
+      instruments—4096-QAM, Puncturing, MLO—without considering that the
+      musician (the silicon) has only microseconds to react."
+    </blockquote>
+
+    <p>
+      The decision matrix for a Wi-Fi chip has exploded combinatorially. We can
+      trace this through the
+      <strong>Modulation and Coding Scheme (MCS) Table</strong>:
+    </p>
+
+    <ul>
+      <li>
+        <strong>1999 (802.11b):</strong> 4 choices (1, 2, 5.5, 11 Mbps). A
+        simple if/else ladder.
+      </li>
+
+      <li><strong>2003 (802.11g):</strong> 12 choices with OFDM.</li>
+
+      <li>
+        <strong>2009 (802.11n):</strong> 77 choices with MIMO and Channel
+        Bonding.
+      </li>
+
+      <li>
+        <strong>2013 (802.11ac):</strong> 80+ choices with 256-QAM and wider
+        channels.
+      </li>
+
+      <li>
+        <strong>2024 (802.11be / Wi-Fi 7):</strong> Thousands of permutations
+        involving 4096-QAM, 320 MHz channels, 16 Spatial Streams, and Puncturing
+        patterns.
+      </li>
+    </ul>
+
+    <p>
+      <strong>The Physical Trap:</strong> When the firmware engineer fails to
+      optimize the radio, can we simply redesign the chip? No, because of
+      <strong>RTL (Register Transfer Level) Accretion</strong>. In software,
+      engineers "refactor" unwieldy code. In hardware, refactoring is
+      economically forbidden. A complex SoC takes 18–24 months to validate;
+      removing "dead" logic risks breaking obscure corner cases. Consequently,
+      vendors only add; they never subtract. 802.11be logic wraps around
+      802.11ax logic, which wraps around 802.11ac logic—twenty-five years of
+      accumulated technical debt consuming area and leakage power.
+    </p>
+
+    <p>
+      <strong>The Market Signal:</strong> The ultimate proof that the standard
+      has reached gridlock is the behavior of market leaders like Samsung and
+      Apple. They no longer rush to support every new feature—they aggressively
+      whitelist features and blacklist others because complexity drains battery
+      and destabilizes connections. When the two largest consumers of wireless
+      silicon effectively stop buying the complexity argument, the evolutionary
+      roadmap is broken.
+    </p>
+
+    <h3 id="section-2.2">
+      2.2 The Density Paradox: More Capacity, Less Performance
+    </h3>
+
+    <p>
+      The fundamental instability of 802.11 stems from the
+      <strong>Birthday Paradox</strong> applied to media access. In an
+      autonomous system, as the number of contending stations (<em>n</em>)
+      increases linearly, the probability of collision increases
+      combinatorially:
+    </p>
+
+    <div
+      style="
+        background: #f5f7fa;
+        padding: 15px;
+        margin: 20px 0;
+        border-left: 4px solid #174f8a;
+        font-family: monospace;
+      "
+    >
+      Collision Pairs = n(n-1)/2<br />
+      For n=100 devices: 4,950 potential collision pairs<br />
+      At P(collision) = 1/N per pair, aggregate P(failure) → 1 as n → ∞
+    </div>
+
+    <p>
+      Simulation data confirms that even with moderate client density, collision
+      probability quickly exceeds 50%, forcing the network into a state of
+      "Drift" where latency becomes unbounded. Under these conditions, the
+      network is no longer constrained by PHY capacity, but by the probability
+      of successful media access.
+    </p>
+
+    <p>
+      This is <strong>Metcalfe's Law in reverse</strong>: instead of each new
+      node increasing the value of the network, each new node increases the
+      chance of interference and reduces usable capacity.
+    </p>
+
+    <h3 id="section-2.3">2.3 The Three Technical Failure Modes</h3>
+
+    <p>
+      The collapse of the operator model is driven by three distinct
+      architectural failures inherent to the 802.11 standard.
+    </p>
+
+    <h4 id="section-2.3.1">2.3.1 Protocol Tax: The Hidden Node Penalty</h4>
+
+    <p>
+      Standard Wi-Fi relies on Carrier Sense Multiple Access (CSMA), which
+      assumes that all stations can hear each other. In real-world MDU
+      (Multi-Dwelling Unit) environments, this assumption fails
+      catastrophically.
+    </p>
+
+    <p>
+      Field measurements using ESP32-based sensors reveal that hidden node
+      contention consumes 30-50% of available airtime in typical MDU
+      deployments—airtime paid for in spectrum acquisition costs but lost to
+      protocol overhead invisible to traditional monitoring. This represents a
+      massive protocol tax where significant airtime is consumed by retries and
+      backoff slots rather than payload delivery.
+    </p>
+
+    <h4 id="section-2.3.2">2.3.2 The MCS Matrix: Un-Engineerable Complexity</h4>
+
+    <p>
+      The most critical failure for a network operator is the
+      <strong>loss of state control</strong>. Modern 802.11ax supports 12 MCS
+      indices × 4 bandwidth options × 8 spatial stream configurations × 3 guard
+      intervals = <strong>&gt;1,000 valid PHY states</strong>. Autonomous rate
+      selection must navigate this space at sub-millisecond timescales under
+      non-stationary noise.
+    </p>
+
+    <p>This creates a <strong>Non-Stationary System</strong>:</p>
+
+    <ul>
+      <li>
+        <strong>Non-Stationary Chaos (Wi-Fi):</strong> The noise profile is
+        time-variant and driven by unbounded interference (neighbors). This
+        results in an unpredictable latency tail that cannot be engineered away.
+      </li>
+
+      <li>
+        <strong>Stationary Stochasticity (Wired Networks):</strong> By contrast,
+        cable and fiber networks operate on shielded media with known thermal
+        noise profiles. The distribution is time-invariant, allowing for
+        engineerable reliability (99.999%).
+      </li>
+    </ul>
+
+    <p>
+      Because Wi-Fi is non-stationary, autonomous rate selection under
+      contention has no bounded outcome. The IEEE 802.11 standard has allowed
+      the MCS table to explode into hundreds of valid permutations—a chaotic
+      state space that firmware must navigate in microseconds with incomplete
+      information.
+    </p>
+
+    <h4 id="section-2.3.3">2.3.3 The Spatial Contention Cascade</h4>
+
+    <p>
+      As load increases, the spatial precision of the network degrades.
+      Mathematical modeling shows that the condition number (κ)—a measure of how
+      well-conditioned the MIMO channel matrix is—degrades from 6 dB (excellent
+      spatial separation) to &gt;12 dB (severe interference) under load. This
+      collapse means that 4×4 MIMO effectively degrades to 2×2 or worse, turning
+      additional spatial streams into self-interference rather than capacity.
+    </p>
+
+    <p>
+      This degradation collapses the theoretical gains of Mu-MIMO, transforming
+      high-order spatial streams into interference rather than usable capacity.
+      The "Efficiency Paradox" emerges: Wi-Fi evolution has focused on shrinking
+      Payload Duration (faster PHY rates like 4096-QAM) while MAC Overhead (LBT,
+      Backoff, Preamble) remains constant. To amortize the overhead, chips must
+      build massive Aggregates (A-MPDUs). This destroys latency. We have
+      engineered a Ferrari engine (the PHY) inside a garbage truck (the MAC).
+    </p>
+
+    <h3 id="section-2.4">2.4 The Operator's Dilemma</h3>
+
+    <p>
+      For network operators—whether cable MSOs, telcos, or fiber providers—this
+      architectural chaos presents a fundamental business risk:
+      <strong>You own the customer experience, but not the air interface</strong
+      >.
+    </p>
+
+    <ul>
+      <li>
+        <strong>Wasted Infrastructure Investment:</strong> Investments in L4S
+        congestion control, low-latency transport, and multi-gigabit access are
+        negated if the final 10 meters are non-deterministic. The bottleneck
+        shifts from the core to the customer premise, and operators lose
+        visibility into the failure point.
+      </li>
+
+      <li>
+        <strong>The Silicon Trap:</strong> Vendors market 4096-QAM (Wi-Fi 7),
+        which requires ~40 dB SNR—physically impossible to sustain in dense
+        environments. Operators pay for silicon logic that creates marketing
+        value but zero operational value. The promised performance exists only
+        in anechoic chambers, not in apartments with 30+ interfering networks.
+      </li>
+
+      <li>
+        <strong>Operational Drag:</strong> The "un-engineerable gap" drives OPEX
+        through support calls, truck rolls, and customer churn. When a
+        subscriber's video call freezes, they call their ISP—even though the
+        failure is in the Wi-Fi AP, not the WAN link.
+      </li>
+
+      <li>
+        <strong>Competitive Vulnerability:</strong> Fiber-to-the-home providers
+        market deterministic performance as a premium feature. Operators relying
+        on legacy Wi-Fi CPE risk commoditization if they cannot match these
+        latency and reliability guarantees.
+      </li>
+    </ul>
+
+    <h3 id="section-2.5">2.5 Why Conventional Solutions Don't Scale</h3>
+
+    <p>
+      Traditional attempts to solve Wi-Fi density problems fail because they
+      address symptoms rather than the underlying architectural failure:
+    </p>
+
+    <ul>
+      <li>
+        <strong>More APs:</strong> Adding APs increases the number of contending
+        radios, exacerbating the Birthday Paradox. Collision probability grows
+        combinatorially. More APs simply create more independent collision
+        domains with the same per-domain collapse behavior.
+      </li>
+
+      <li>
+        <strong>DFS (Dynamic Frequency Selection):</strong> Regulatory
+        limitations and radar evacuation requirements make DFS unreliable for
+        latency-sensitive applications. When a radar is detected, the AP must
+        vacate the channel within seconds, causing multi-second outages.
+      </li>
+
+      <li>
+        <strong>6 GHz Spectrum:</strong> Additional spectrum doesn't solve
+        density—it just shifts the collision problem to a new band. The
+        fundamental CSMA/CA contention mechanism remains unchanged.
+      </li>
+
+      <li>
+        <strong>Wi-Fi 7 / MLO (Multi-Link Operation):</strong> More MCS states =
+        MORE chaos. Multi-Link Operation adds link diversity but doesn't
+        eliminate autonomous contention. Each link still suffers from the same
+        collision storms and hidden queue problems.
+      </li>
+
+      <li>
+        <strong>Mesh Networks:</strong> Mesh adds relay hops, multiplying
+        latency and cutting effective throughput by 50% per hop. Multi-hop mesh
+        in dense environments becomes a latency disaster.
+      </li>
+    </ul>
+
+    <p>
+      <strong>The Trillion-Dollar Context:</strong> The mobile industry spent
+      $600 billion building 5G to get scheduled, deterministic performance
+      outdoors. They understand that unlicensed spectrum + autonomous contention
+      = chaos. The genius of 5G is its architecture; its Achilles heel is its
+      cost. In recent auctions, 20 MHz of licensed mid-band spectrum sold for
+      over <strong>$17 billion</strong> for U.S. rights alone.
+    </p>
+
+    <p>
+      Fi-Wi applies the cellular C-RAN architecture indoors—but on unlicensed
+      spectrum that costs nothing. This is the arbitrage opportunity.
+    </p>
+
+    <h3 id="section-2.6">
+      2.6 The Client Side: L4S and the End of Uplink Contention
+    </h3>
+
+    <p>
+      The architectural reset is not limited to the infrastructure; it
+      fundamentally alters the behavior of the Station (STA). In legacy Wi-Fi,
+      the STA is an autonomous agent that fights for upstream airtime using EDCA
+      (Enhanced Distributed Channel Access). It maintains its own local WMM
+      queues and blindly transmits whenever it wins a contention window, often
+      oblivious to the fact that the AP's receive buffer is already full.
+    </p>
+
+    <p>
+      <strong>The L4S Inversion:</strong> With L4S, the "Quality of Service"
+      decision moves from the Wi-Fi card's firmware to the application's
+      congestion control algorithm. We replace the rigid, static categories of
+      WMM with the dynamic, adaptive responsiveness of
+      <strong>TCP Prague</strong> and other L4S-compliant congestion controls.
+    </p>
+
+    <ul>
+      <li>
+        <strong>Legacy Behavior (Push):</strong> The STA pushes packets into the
+        air as fast as the PHY allows. If the AP is congested, the STA doesn't
+        know until packets drop or layer 2 retries spike. Latency accumulates in
+        the STA's own driver queue because the "pipe" is smaller than the
+        "faucet."
+      </li>
+
+      <li>
+        <strong>Fi-Wi Behavior (Pull/Signal):</strong> The Fi-Wi Concentrator
+        detects congestion at the ingress of the air interface. It marks packets
+        (specifically ACKs going back to the STA) with the CE (Congestion
+        Experienced) bit. When the STA receives this mark,
+        <strong>TCP Prague</strong> <em>immediately</em> reduces its send
+        window. <strong>The STA voluntarily throttles itself.</strong>
+      </li>
+    </ul>
+
+    <p>
+      <strong>Eliminating the "Uplink Queue":</strong> This effectively
+      virtualizes the queue. Instead of a deep buffer sitting on the Wi-Fi chip
+      waiting to be transmitted, the packets are held in user-space memory on
+      the client device, waiting for the "go" signal (or rather, the absence of
+      a "stop" signal). The traffic never enters the contention domain until
+      there is guaranteed capacity to service it. The STA no longer needs
+      complex internal QoS schedulers because it is no longer trying to force
+      more data than the pipe can hold.
+    </p>
+
+    <div
+      class="callout"
+      style="
+        background: #eef2f5;
+        border-left: 4px solid #4a90e2;
+        padding: 20px;
+        margin: 30px 0;
+        font-family: sans-serif;
+      "
+    >
+      <h4 style="margin-top: 0; color: #2c3e50">
+        Technical Insight: The "Driver Queue" Trap
+      </h4>
+
+      <p>
+        In legacy systems, flow control happens at the driver level. When the
+        Wi-Fi card's hardware buffer fills up (the TX Ring), it signals the
+        Operating System to "Stop the Queue." The OS then buffers packets in
+        software (qdisc) until the hardware signals "Go."
+      </p>
+
+      <p>
+        <strong>This is catastrophic for latency.</strong> It creates a hidden
+        reservoir of old data sitting in the kernel, waiting for the hardware to
+        clear. By the time the hardware is ready, the packets in the OS queue
+        are already stale.
+      </p>
+
+      <p>
+        <strong>L4S eliminates this layer of buffering entirely.</strong>
+        Because TCP Prague adjusts the send rate to match the
+        <em>actual</em> airtime capacity (signaled via ECN), the application
+        never sends enough data to fill the hardware ring buffer. The driver
+        never has to assert flow control, the OS queue remains empty, and every
+        packet that hits the driver is fresh, ensuring immediate transmission.
+      </p>
+    </div>
+
+    <h3 id="section-2.7">2.7 The Strategic Reset: Splitting the Graph</h3>
+
+    <p>
+      Solving this requires a "Subtractive Architecture." Instead of adding more
+      features to the radio, we must remove them. The architectural breakthrough
+      of Fi-Wi is decoupling the <strong>MCS State Graph</strong> described in
+      Section 2.3.2 into its constituent parts:
+    </p>
+
+    <ul>
+      <li>
+        <strong>The Concentrator Owns the Edges (Navigation):</strong> The
+        computational complexity of Wi-Fi lies in the transitions—deciding
+        <em>when</em> to move from one state to another based on interference,
+        queue depth, and CSI. By moving this logic to a workstation-class
+        processor, the Concentrator can traverse the graph in software,
+        evaluating the cost of every edge in real-time. These edge costs are
+        computed from CSI vectors (channel quality), queue sojourn time
+        (congestion), and spatial correlation matrices (interference), allowing
+        the Concentrator to choose the lowest-cost path through the MCS state
+        space for each packet.
+      </li>
+
+      <li>
+        <strong>The Radio Head Owns the Nodes (Execution):</strong> The Radio
+        Head is reduced to a simple instrument that owns the physical states
+        (the nodes). It does not decide which node to occupy; it acts as a
+        stateless executor, converting the Concentrator's command
+        (<code>MCS_INDEX</code>, <code>N_SS</code>, <code>TXOP_DURATION</code>)
+        into the precise RF waveform without autonomous deliberation.
+      </li>
+
+      <li>
+        <strong>The Graph Is Not Static:</strong> Unlike traditional rate
+        adaptation (e.g., Minstrel), the Concentrator's graph is continuously
+        recomputed based on <strong>global state</strong>. The edge costs
+        reflect not just per-client SNR, but cross-client interference, queue
+        depth across all flows, and spatial correlation between RRHs. This
+        global visibility is impossible in autonomous APs, where each device
+        sees only its own local channel state.
+      </li>
+    </ul>
+
+    <p>
+      This architectural shift—from distributed chaos to centralized
+      control—mirrors the evolution from
+      <strong>analog transmission systems</strong> (noise-prone,
+      operator-invisible) to <strong>digital QAM</strong> (deterministic,
+      monitorable).
+      <strong>Fi-Wi completes this transformation for the last 10 meters</strong
+      >, moving the network from a model of probabilistic negotiation to one of
+      <strong>deterministic execution</strong>.
+    </p>
+
+    <p>
+      Section 13 describes the Concentrator's scheduling algorithm that
+      implements this graph traversal, while Appendix C details the RRH's
+      scatter-gather DMA mechanism that executes the chosen state transitions at
+      microsecond timescales.
+    </p>
+
+    <div
+      class="callout"
+      style="
+        background: #fff8f0;
+        border-left: 4px solid #d9534f;
+        padding: 20px;
+        margin: 30px 0;
+        font-family: sans-serif;
+      "
+    >
+      <h4 style="margin-top: 0; color: #c9302c">
+        Technical Insight: The QoS Fallacy
+      </h4>
+
+      <p>
+        Traditional QoS mechanisms in Wi-Fi—WMM access categories, priority
+        queues, and traffic shaping—reflect a fundamental architectural flaw:
+        <strong
+          >treating contention as inevitable and attempting to optimize it
+          through priority classes.</strong
+        >
+        This approach attempts to infer urgency by classifying packets, then
+        granting probabilistic access to the medium—essentially rolling dice
+        with weighted odds.
+      </p>
+
+      <p>
+        <strong>L4S changes the premise entirely.</strong> Flows signal their
+        tolerance for delay using ECN, allowing the network to signal sources to
+        control their own send rates. Across many flows, this controls the
+        aggregate arrival rates at the forwarding plane based on real-time queue
+        feedback rather than static classes.
+      </p>
+
+      <p>
+        In a Fi-Wi architecture, where all wireless transmissions are centrally
+        scheduled with unified state, traffic no longer competes through
+        contention. The Concentrator controls arrival rates to each Remote Radio
+        Head, ensuring packets are transmitted at the precise moment they are
+        needed.
+        <strong
+          >This deterministic scheduling replaces the probabilistic contention
+          that WMM attempts to optimize.</strong
+        >
+        Consequently, the complex web of traditional QoS queues is rendered
+        obsolete; we replace "Priority" (deciding who waits) with "Isolation"
+        (ensuring no one waits).
+      </p>
+    </div>
+
+    <h3 id="section-2.8">
+      2.8 Interactive Visualization: The MCS Collapse Under Load
+    </h3>
+
+    <p>
+      The following interactive simulation demonstrates the architectural
+      differences between Fi-Wi, autonomous APs, and mesh networks under varying
+      load conditions. It visualizes the
+      <strong>MCS State Graph</strong> discussed in Section 2.7, showing how
+      autonomous systems fail to navigate this state space under density.
+    </p>
+
+    <p>
+      Each "room" represents a device with a 4 × 12 grid of MCS states (4
+      spatial streams × 12 MCS indices). The
+      <strong>ghost node</strong> (dashed) shows the ideal state based on
+      channel quality, while the <strong>active node</strong> shows the actual
+      state selected by the rate control algorithm.
+    </p>
+
+    <div
+      style="
+        background: #0d0d0d;
+        padding: 20px;
+        border-radius: 8px;
+        margin: 30px 0;
+        position: relative;
+      "
+    >
+      <div
+        style="
+          position: absolute;
+          top: 0;
+          left: 0;
+          right: 0;
+          bottom: 0;
+          z-index: 5;
+          cursor: pointer;
+        "
+        onclick="window.open('mcs_fiwi.php', '_blank')"
+      ></div>
+
+      <div
+        style="
+          position: absolute;
+          top: 20px;
+          left: 50%;
+          transform: translateX(-50%);
+          z-index: 10;
+          background: rgba(23, 79, 138, 0.95);
+          color: white;
+          padding: 15px 30px;
+          border-radius: 8px;
+          font-weight: bold;
+          font-size: 1.1rem;
+          box-shadow: 0 4px 12px rgba(0, 0, 0, 0.5);
+          text-align: center;
+        "
+      >
+        Click anywhere to open interactive version ↗
+      </div>
+      <iframe
+        src="mcs_fiwi.php"
+        style="
+          width: 100%;
+          height: 1200px;
+          border: none;
+          border-radius: 6px;
+          pointer-events: none;
+        "
+        title="Fi-Wi MCS State Visualization (Preview)"
+      ></iframe>
+    </div>
+
+    <div
+      class="callout"
+      style="
+        background: #f5f7fa;
+        border-left: 4px solid #174f8a;
+        padding: 20px;
+        margin: 20px 0;
+      "
+    >
+      <h4 style="margin-top: 0; color: #174f8a">How to Use the Simulation</h4>
+
+      <p><strong>Quick Start - Try These Scenarios:</strong></p>
+
+      <ul>
+        <li>
+          Click <strong>"Stress Test"</strong> preset, then toggle between
+          <strong>Fi-Wi</strong> and <strong>Auton AP</strong> topologies
+        </li>
+
+        <li>
+          Watch the <strong>Collision Prob</strong> metric (top stats bar) and
+          the colored nodes in each room
+        </li>
+
+        <li>
+          In Autonomous mode, nodes turn red and drift from optimal positions as
+          collision probability rises
+        </li>
+      </ul>
+
+      <p><strong>Interactive Controls:</strong></p>
+
+      <ul>
+        <li>
+          <strong>Topology Buttons:</strong> "Fi-Wi" (green), "Auton AP"
+          (orange/red), "Mesh" (red-orange)
+        </li>
+
+        <li>
+          <strong>Algorithm Toggle:</strong> "Greedy" (aggressive) vs "L4S"
+          (latency-aware)
+        </li>
+
+        <li>
+          <strong>Load Sliders:</strong> Adjust Active Rooms (1-4), Total
+          Clients (1-50), Packet Pressure, and Aggregation
+        </li>
+
+        <li>
+          <strong>Scenario Presets:</strong> Light Load, Typical Office, Dense,
+          or Stress Test
+        </li>
+      </ul>
+
+      <p><strong>What to Watch For:</strong></p>
+
+      <ul>
+        <li>
+          <strong>Birthday Paradox:</strong> At 30+ clients in Autonomous mode,
+          collision probability exceeds 25% and performance collapses
+        </li>
+
+        <li>
+          <strong>Condition Number (κ):</strong> Below each room's grid - stays
+          ~6 dB in Fi-Wi, degrades to &gt;12 dB in Autonomous mode
+        </li>
+
+        <li>
+          <strong>Ghost vs Active nodes:</strong> In Fi-Wi they align
+          (centralized control works). In Autonomous mode they drift apart
+          (distributed control fails)
+        </li>
+
+        <li>
+          <strong>P99 Latency:</strong> Top stats - stays &lt;10ms in Fi-Wi,
+          explodes to 100-500ms in Autonomous mode under load
+        </li>
+      </ul>
+    </div>
+
+    <h4>Technical Details: Understanding the Visualization</h4>
+
+    <p>
+      <strong>MCS Grid:</strong> Each 4×12 grid shows all possible MCS states.
+      Top rows = Mu-MIMO (multi-user), bottom rows = standard 2×2 MIMO. Columns
+      = MCS index (0-11, higher = faster but needs better SNR).
+    </p>
+
+    <p>
+      <strong>Eigenvalues (λ₁, λ₂):</strong> Strength of spatial modes in the
+      MIMO channel. As density increases in autonomous mode, λ₂ collapses →
+      spatial interference.
+    </p>
+
+    <p>
+      <strong>Condition Number (κ):</strong> Ratio λ₁/λ₂ in dB. Low (~6 dB) =
+      good. High (&gt;12 dB) = Mu-MIMO degraded to single-stream. This directly
+      demonstrates the "Spatial Contention Cascade" from Section 2.3.3.
+    </p>
+
+    <p>
+      <strong>Collision Probability:</strong> Computed using Birthday Paradox
+      formula: n(n-1)/2 collision pairs. When this exceeds 50%, the network
+      enters "Drift" state with unbounded latency.
+    </p>
+
+    <h4>Why This Matters for Network Operators</h4>
+
+    <p>
+      This visualization proves the <strong>loss of control</strong> described
+      in Section 2.4. In autonomous mode, operators cannot engineer performance
+      because the system navigates a 1,000+ state MCS graph with no global
+      coordination.
+    </p>
+
+    <p>
+      In Fi-Wi mode, the Concentrator's global state visibility allows it to:
+    </p>
+
+    <ul>
+      <li>
+        Choose optimal MCS/SS combinations based on
+        <strong>current</strong> cross-room interference
+      </li>
+
+      <li>
+        Coordinate spatial reuse to prevent Mu-MIMO collapse (maintain κ &lt; 12
+        dB)
+      </li>
+
+      <li>
+        Eliminate collisions through centralized scheduling (Birthday Paradox
+        solved)
+      </li>
+
+      <li>Apply accurate L4S ECN marking with full queue visibility</li>
+    </ul>
+
+    <p>
+      The result: <strong>predictable, engineerable performance</strong> that
+      scales with density instead of collapsing. The difference becomes visceral
+      when you watch autonomous mode turn red under the same load that Fi-Wi
+      handles in green.
+    </p>
+
+    <hr />
+
+    <h2 id="section-3">3. System Picture</h2>
+
+    <div class="diagram-block">
+      <h4>
+        System Diagram: Fi-Wi Concentrator, Central Packet Memory, and Multiple
+        RRHs
+      </h4>
+
+      <pre class="diagram">
+                        ┌────────────────────────────────────────────┐
+                        │              Fi-Wi Concentrator            │
+                        │────────────────────────────────────────────│
+   L4S/ECN-aware        │                                            │
+   traffic from LAN/    │   ┌────────────────────────────────────┐   │
+   WAN (IP/802.3)  ─────┼─▶│    Central Packet Memory & Queues  │   │
+                        │   │  • Per-flow / per-tenant queues    │   │
+                        │   │  • Per-airtime-domain queues       │   │
+                        │   │  • Enqueue timestamps (µs)         │   │
+                        │   └───────────────┬────────────────────┘   │
+                        │                   │                        │
+                        │   ┌───────────────▼────────────────────┐   │
+                        │   │   L4S/AQM & Scheduler              │   │
+                        │   │  • Sojourn-time based ECN marking  │   │
+                        │   │  • TXOP length control (≈250 µs)   │   │
+                        │   │  • RF grouping & spatial streams   │   │
+                        │   └───────────────┬────────────────────┘   │
+                        │                   │ PCIe over fiber        │
+                        └───────────────────┼────────────────────────┘
+                                            │
+        ┌───────────────────────────────────┼───────────────────────────────────┐
+        │                                   │                                   │
+        │                                   │                                   │
+┌───────▼─────────┐                ┌────────▼─────────┐                ┌────────▼─────────┐
+│   RRH #1        │                │   RRH #2         │                │   RRH #3         │
+│ (Thin MAC/PHY)  │                │ (Thin MAC/PHY)   │                │ (Thin MAC/PHY)   │
+│  • RF front end │                │  • RF front end  │                │  • RF front end  │
+│  • DFE + FFT    │                │  • DFE + FFT     │                │  • DFE + FFT     │
+│  • Minimal MAC  │                │  • Minimal MAC   │                │  • Minimal MAC   │
+│  • DMA engine   │                │  • DMA engine    │                │  • DMA engine    │
+│  • PTP sync     │                │  • PTP sync      │                │  • PTP sync      │
+└───────┬─────────┘                └────────┬─────────┘                └────────┬─────────┘
+        │                                   │                                   │
+        │                                   │                                   │
+        │                 PCIe-over-fiber links (no deep queues in RRHs)        │
+        │                                   │                                   │
+        │                                   │                                   │
+┌───────▼─────────┐                ┌────────▼────────┐                 ┌────────▼─────────┐
+│   RRH #4        │     ...        │   RRH #N        │                 │   Wi-Fi STAs     │
+│ (Thin MAC/PHY)  │                │ (Thin MAC/PHY)  │     (Rooms, AP-like cells, clients)│
+│  • RF front end │                │  • RF front end │                 │  • Phones        │
+│  • DFE + FFT    │                │  • DFE + FFT    │                 │  • Laptops       │
+│  • Minimal MAC  │                │  • Minimal MAC  │                 │  • IoT devices   │
+│  • DMA engine   │                │  • DMA engine   │                 │                  │
+│  • PTP sync     │                │  • PTP sync     │                 │                  │
+└─────────────────┘                └─────────────────┘                 └──────────────────┘
+  </pre
+      >
+      <p class="diagram-caption">
+        <strong>Key properties:</strong> Central packet memory and queues live
+        entirely in the concentrator, where L4S-aware AQM and scheduling operate
+        on true bottleneck queues. RRHs are kept as simple hardware endpoints
+        (RF + minimal MAC + DMA + PTP), with no deep local buffering or
+        autonomous AP logic. This enables stable L4S behavior, explicit TXOP
+        control, and software-defined evolution of queueing and RF policies.
+      </p>
+    </div>
+
+    <h3 id="section-3.1">3.1 Classical Stack vs. Fi-Wi (The C-RAN Shift)</h3>
+
+    <p>
+      To understand Fi-Wi, we must first unlearn the definition of an "Access
+      Point."
+    </p>
+
+    <div class="callout">
+      <strong
+        >Reality Check 1: The RRH is a Micro-Bridge, Not an Access Point</strong
+      ><br />
+      The industry treats the AP as a "Router on the Ceiling." Fi-Wi replaces
+      this with a
+      <strong>Tunneling Bridge</strong>.
+      <ul>
+        <li>
+          <strong>A Traditional AP</strong> is a general-purpose computer
+          running a full
+          <strong>Linux Operating System (millions of lines of code)</strong>.
+          It performs complex Layer-3 routing, NAT, firewalling, and background
+          housekeeping tasks that add latency and security risks.
+        </li>
+
+        <li>
+          <strong>A Fi-Wi RRH</strong> is a
+          <strong>Layer-1/2 Micro-Bridge</strong>. It runs
+          <strong>zero lines of Linux</strong>. It essentially acts as a media
+          converter, tunneling raw 802.11 frames directly onto the 802.3/PCIe
+          fabric to the Concentrator without higher-layer processing.
+        </li>
+      </ul>
+      <strong>The Shift:</strong> The RRH does not "process" the network; it
+      "extends" it. It is a transparent pipe that bridges the airgap to the
+      fiber, leaving all decision-making to the central brain.
+    </div>
+
+    <div class="callout">
+      <strong>Reality Check 2: Coordination vs. Control</strong><br />
+      Traditional "Centralized Controllers" (like Cisco/Aruba) provide
+      <strong>Coordination</strong>. They tell APs which channels to use or
+      which clients to kick, but the AP still decides exactly when to transmit
+      every packet. The "Control Loop" is still distributed.<br />
+      <br />
+      Fi-Wi provides <strong>Control</strong>. The Concentrator does not
+      "suggest" a schedule; it <strong>executes</strong> it. It tells the RRH:
+      "Transmit these specific bytes at exactly microsecond T." There is no
+      disagreement, no race condition, and no distributed chaos.
+    </div>
+
+    <p>
+      In a typical
+      <strong>controller-managed enterprise Wi-Fi deployment</strong>, a
+      centralized controller (e.g., Cisco WLC, Aruba Mobility Controller,
+      Ubiquiti UniFi Controller) coordinates AP configuration: channel
+      assignment, transmit power, client steering recommendations, and SSID
+      management. However,
+      <strong>each AP remains autonomous at the data plane</strong>:
+    </p>
+
+    <ul>
+      <li>Each AP maintains its own queues and packet buffers</li>
+
+      <li>
+        Each AP makes its own MAC scheduling, aggregation, and TXOP decisions
+      </li>
+
+      <li>
+        APs have no visibility into other APs' per-packet state, queue depths,
+        or precise timing
+      </li>
+
+      <li>CSI (Channel State Information) remains local to each AP</li>
+
+      <li>
+        There is no shared timestamp reference or coordinated DMA across APs
+      </li>
+    </ul>
+
+    <p>
+      These systems are <strong>loosely-coupled</strong>: the controller manages
+      the control plane (configuration, policy) but the data plane — queuing,
+      MAC scheduling, aggregation, and packet forwarding — remains
+      <strong>distributed and autonomous</strong> across individual APs.
+    </p>
+
+    <p>
+      <strong>In Umber Fi-Wi (C-RAN for Wi-Fi)</strong>, we split the AP and
+      cellularize the RF domain, down to room-level. The concentrator sees all
+      flows, all queues, and all RRHs. The RRHs handle 802.11 MAC/PHY but are
+      tightly time-synchronized and behave as DMA-driven PHY/MAC endpoints
+      rather than autonomous APs. A set of RRHs and their shared queues form a
+      <strong>cellularized Wi-Fi domain</strong> within the building, often at
+      “cell per room” granularity.
+    </p>
+
+    <p>
+      <strong>Fi-Wi centralizes both control plane AND data plane</strong> with
+      shared state across all RRHs. The concentrator doesn't just configure
+      RRHs; it directly manages their queues, schedules their TXOPs, and
+      maintains unified timestamp-synchronized state across the entire
+      cellularized RF domain.
+    </p>
+
+    <h3 id="section-3.2">3.2 Dual-Loop Control Model</h3>
+
+    <p>
+      Conceptually, Fi-Wi decouples the system into two nested feedback loops,
+      separated by timescale:
+    </p>
+
+    <div class="diagram">
+      Outer loop (End-to-End Latency): [ L4S Sender ] ──(ms)──&gt; [ Group Queue
+      ] ──&gt; [ Feedback (ECN) ] Inner loop (MAC Efficiency): [ Aggregation
+      Buffer ] ──(µs)──&gt; [ Airtime / PHY ]
+    </div>
+
+    <p>
+      The <strong>Outer Loop</strong> manages congestion and end-to-end latency
+      (Internet speed). The <strong>Inner Loop</strong> manages MAC efficiency
+      and radio timing (Airtime).
+    </p>
+
+    <p>
+      <strong>The Problem with Legacy Wi-Fi:</strong> Traditional APs couple
+      these loops unpredictably, creating "sawtooth" latency patterns that
+      confuse TCP.
+    </p>
+
+    <p>
+      <strong>The Fi-Wi Solution:</strong> By centralizing both loops in the
+      Concentrator, Fi-Wi enforces a strict
+      <strong>Time-Scale Separation</strong>. The Inner Loop runs so fast (3–5
+      kHz) that it appears as "constant service" to the slower Outer Loop (10–20
+      Hz), allowing L4S to stabilize perfectly.
+    </p>
+
+    <p>
+      <em
+        >(See <a href="#section-5">Section 5: Control Architecture</a> for the
+        rigorous control-theoretic analysis and stability criteria.)</em
+      >
+    </p>
+
+    <hr />
+
+    <h2 id="section-4">4. Key Fi-Wi Mechanisms</h2>
+
+    <h3 id="section-4.1">4.1 Time Synchronization</h3>
+
+    <p>
+      Fi-Wi operates across two distinct time domains simultaneously. The first
+      is the concentrator's internal master clock, disciplined via PTP/802.1AS
+      over the PCIe fronthaul (detailed in Section 4.7). The second is the
+      802.11 TSF (Target Sync Function) domain that 802.11 clients use to
+      coordinate with the MAC layer. In a traditional AP these two clocks are
+      decoupled — the AP runs one TSF and one clock. In Fi-Wi, with 24 RRHs each
+      presenting a TSF-aware BSS, managing the relationship between them is a
+      foundational architectural responsibility of the concentrator.
+    </p>
+
+    <h4 id="section-4.1.1">4.1.1 The Fronthaul Clock: PTP/802.1AS</h4>
+
+    <p>
+      The concentrator synchronizes its master clock to all attached RRHs on the
+      order of microseconds (and substantially tighter when using PCIe-native
+      timing mechanisms such as PTM — see Section 4.7 for the full hardware
+      chain). This master clock gives every packet:
+    </p>
+
+    <ul>
+      <li>
+        A meaningful <strong>ingress timestamp</strong> when it is enqueued in
+        central memory
+      </li>
+
+      <li>
+        A meaningful <strong>service timestamp</strong> when the RRH actually
+        transmits it over the air
+      </li>
+
+      <li>
+        A consistent time base for measuring queue <em>sojourn time</em> and
+        service behavior per RRH or per RF group
+      </li>
+    </ul>
+
+    <p>
+      This clock lives entirely inside the Fi-Wi domain. Clients never see it
+      directly. It is the coordinate system in which shim header timestamps
+      (Section 4.2), AQM marking decisions (Section 4.3), and the ML training
+      corpus (Section 15) are all expressed. Because all packet timestamps,
+      service events, and queue measurements are expressed in this single master
+      time domain, Fi-Wi can compute precise per-packet sojourn times
+      independent of the TSF domain, enabling stable ECN marking and L4S control
+      across the system.
+    </p>
+
+    <h4 id="section-4.1.2">4.1.2 The 802.11 TSF Domain</h4>
+
+    <p>
+      The 802.11 TSF is a 64-bit microsecond counter that every client
+      associates with a BSS. Clients set their local TSF from beacons. They use
+      it to wake from power save at the right moment, to interpret TBTT (Target
+      Beacon Transmission Time), and to coordinate TXOP timing. The TSF is the
+      only MAC-visible clock the 802.11 standard exposes at the MAC layer.
+    </p>
+
+    <p>
+      In a traditional single-AP deployment this is trivial: one AP, one TSF,
+      one beacon stream. In Fi-Wi it is not. Consider a client in a room served
+      by two RRHs in the same airtime domain. That client will receive beacons
+      from both RRHs. If those beacons carry inconsistent TSF values, even small
+      inconsistencies can lead to misaligned power-save wakeups, ambiguous TBTT
+      interpretation, and in some implementations degraded performance or
+      reassociation. The coherence of the TSF domain across all RRHs in a BSS is
+      not optional; it is a hard correctness requirement.
+    </p>
+
+    <p>
+      Fi-Wi satisfies this requirement by construction:
+      <strong>the concentrator generates all beacon frames</strong>. No RRH
+      constructs its own beacon. The concentrator writes the TSF value into
+      every beacon before dispatching it to the appropriate RRH for
+      transmission. Because all TSF values originate from the same source and
+      are derived from the same master clock, they are consistent by design
+      rather than by coordination protocol. Within a given BSS, TSF values are
+      identical across all participating RRHs; multiple TSF domains arise only
+      when multiple BSS instances are present.
+    </p>
+
+    <h4 id="section-4.1.3">4.1.3 The Concentrator as Time Origin</h4>
+
+    <p>
+      The concentrator maintains 25 simultaneous time references: its own
+      PTP-disciplined master clock and one 802.11 TSF per RRH. Each TSF has its
+      own epoch (established at BSS creation) and its own drift correction term,
+      derived from periodic synchronization updates over the fronthaul
+      (PTP/802.1AS or PCIe PTM), which bound long-term drift. The concentrator
+      knows the exact affine mapping between the master clock and every
+      client-visible TSF domain at all times:
+    </p>
+
+    <pre><code>TSF_i(t) = (t_master - epoch_i) + drift_correction_i(t)
+</code></pre>
+
+    <p>
+      Any event — a packet enqueue, an ECN mark, a TXOP start, a beacon
+      transmission — can be expressed in any of the 25 frames without loss of
+      precision. This is the time-domain analog of a coordinate transformation:
+      the concentrator is the origin from which all other reference frames are
+      derived, and any event timestamp can be mapped between frames via a known,
+      invertible affine transform, updated continuously via the fronthaul
+      synchronization loop.
+    </p>
+
+    <div class="diagram-block">
+      <h4>Figure 4.1-1: The Concentrator as Time Origin</h4>
+      <pre class="diagram">
+Concentrator master clock (PTP-disciplined)
+  │
+  ├─ Master frame: all shim timestamps, sojourn times, AQM marks, ML labels
+  │
+  ├─ TSF_1:  epoch_1, drift_1(t)  →  beacon stream for RRH 1  ┐
+  ├─ TSF_2:  epoch_2, drift_2(t)  →  beacon stream for RRH 2  │ identical within
+  ├─ TSF_3:  epoch_3, drift_3(t)  →  beacon stream for RRH 3  │ a given BSS
+  │   ...                                                       ┘
+  └─ TSF_24: epoch_24, drift_24(t) → beacon stream for RRH 24
+
+Any event E has coordinates in all 25 frames simultaneously.
+Mapping between any two frames: affine transform, known at the concentrator,
+updated continuously via the fronthaul sync loop.
+    </pre
+      >
+      <p class="diagram-caption">
+        The concentrator as the origin of 25 simultaneous time reference frames
+        (for a 24-RRH deployment). Client-visible TSF domains are derived from
+        the master clock via known affine transforms. Within a BSS, TSF values
+        are identical across participating RRHs.
+      </p>
+    </div>
+
+    <div class="callout">
+      <strong>Why Distributed APs Cannot Do This</strong><br />
+
+      <p>
+        In a controller-managed AP deployment, each AP runs its own TSF
+        independently. The controller can nudge APs toward a common time
+        reference via 802.11v BSS Transition Management or out-of-band NTP, but
+        it does not generate beacon frames — each AP does. This means TSF values
+        across APs can diverge by the inter-AP sync error (typically tens to
+        hundreds of microseconds with Ethernet-based PTP, more without it).
+      </p>
+
+      <p>
+        A client roaming between two such APs may see a TSF discontinuity at
+        handoff. Power-save state, TBTT alignment, and any MAC-layer timing
+        assumption the client holds must be renegotiated. In Fi-Wi, roaming
+        between RRHs within the same concentrator domain is a TSF-transparent
+        event: the client's TSF counter simply continues, because the new RRH's
+        beacon carries the same TSF value the old one would have carried at that
+        moment. The client does not know a handoff occurred at the MAC layer.
+      </p>
+    </div>
+
+    <p>
+      This unified time model also enables the concentrator to schedule
+      transmissions across RRHs against a single global timeline, rather than
+      relying on independent per-RRH contention processes. TSF continuity across
+      RRH handoffs is a direct consequence of centralized beacon generation, and
+      it is what makes Fi-Wi's active redundancy claims in Section 8
+      operationally credible: per-packet steering between RRHs is transparent to
+      clients because the client's MAC-layer time reference never changes. This
+      unified time model enables not only precise measurement, but coordinated
+      control of transmission behavior across RRHs, as described in Section
+      4.1.4.
+    </p>
+
+    <h4 id="section-4.1.4">4.1.4 Time-Driven EDCA Orchestration</h4>
+
+    <p>
+      The unified time model described above is not only a measurement
+      framework; it is the foundation for Fi-Wi's centralized MAC scheduling. In
+      conventional 802.11 deployments, EDCA (Enhanced Distributed Channel
+      Access) operates as a stochastic contention mechanism: each AP
+      independently selects random backoff values within its CWmin/CWmax range,
+      and medium access emerges probabilistically.
+    </p>
+
+    <p>
+      In Fi-Wi, EDCA is not treated as a distributed random process. It is
+      treated as a <strong>centrally orchestrated actuation layer</strong>,
+      driven by the concentrator's master time reference.
+    </p>
+
+    <p>Because the concentrator maintains:</p>
+
+    <ul>
+      <li>A global view of all per-RRH transmit queues</li>
+      <li>Precise enqueue and service timestamps in the master clock domain</li>
+      <li>A coherent TSF domain across all RRHs participating in a BSS</li>
+    </ul>
+
+    <p>
+      it can shape medium access behavior across RRHs by dynamically controlling
+      EDCA parameters on a per-radio basis. The key parameters are:
+    </p>
+
+    <ul>
+      <li>
+        <strong>CWmin / CWmax</strong>: control the distribution of backoff
+        intervals
+      </li>
+      <li>
+        <strong>AIFS</strong>: offsets contention start times between RRHs
+      </li>
+      <li>
+        <strong>TXOP</strong>: defines the service quantum once access is
+        obtained
+      </li>
+    </ul>
+
+    <p>
+      By assigning narrowly bounded contention windows and staggered AIFS values
+      across RRHs, the concentrator can bias contention outcomes such that one
+      RRH is overwhelmingly likely to win access at a given moment. Rotating
+      these parameters over time creates a
+      <strong>soft time-division multiplexing (TDM)</strong> effect using
+      standard EDCA semantics.
+    </p>
+
+    <p>
+      This transformation is only possible because all RRHs share a common time
+      reference. The concentrator can schedule EDCA parameter updates relative
+      to the master clock and ensure that all RRHs apply them in a coordinated
+      manner. Without this shared time base, independent EDCA processes would
+      quickly decorrelate and revert to stochastic contention.
+    </p>
+
+    <p>Conceptually, the concentrator executes a scheduling loop:</p>
+
+    <pre><code>for each scheduling interval:
+  observe queue state across RRHs        // centralized visibility
+  select next RRH (or RF group) to serve // queue-aware decision
+  assign EDCA parameters (CWmin, CWmax, AIFS, TXOP)
+  enforce timing relative to master clock // coordinated application
+</code></pre>
+
+    <p>
+      The result is not strict TDMA — 802.11 contention semantics are preserved
+      and the system remains compliant with standard client behavior — but the
+      distribution of outcomes is shaped by the concentrator. Over short time
+      horizons, access becomes highly predictable and service intervals can be
+      bounded. This has two critical consequences:
+    </p>
+
+    <ul>
+      <li>
+        <strong>Deterministic service timing:</strong> Packets experience
+        bounded and observable delay between enqueue and transmission, enabling
+        stable AQM and ECN marking.
+      </li>
+
+      <li>
+        <strong>Queue-aware scheduling:</strong> RRHs with pending traffic can
+        be prioritized immediately, while idle RRHs do not contend unnecessarily
+        for airtime.
+      </li>
+    </ul>
+
+    <p>
+      Because TSF values are consistent across RRHs, these scheduling decisions
+      are MAC-transparent to clients. From the client's perspective, the network
+      behaves as a single, coherent AP with stable timing characteristics, even
+      as transmissions are steered across multiple physical radios.
+    </p>
+
+    <div class="callout">
+      <strong>Why Distributed AP Systems Cannot Replicate This</strong><br />
+
+      <p>
+        Controller-based Wi-Fi systems can configure EDCA parameters on
+        individual APs, but they cannot coordinate their application in time
+        with sufficient precision. Each AP maintains its own clock, its own
+        contention process, and its own transmit queues.
+      </p>
+
+      <p>
+        Without a shared time origin and centralized queue visibility, EDCA
+        remains a probabilistic mechanism. Attempts to tune contention
+        parameters across APs produce statistical bias at best, not
+        deterministic scheduling. The lack of a unified time domain prevents
+        coordinated rotation of access privileges across radios.
+      </p>
+
+      <p>
+        Fi-Wi's ability to treat EDCA as a controllable scheduling primitive is
+        a direct consequence of the concentrator's role as both the time origin
+        and the sole owner of transmit queues.
+      </p>
+    </div>
+
+    <p>
+      This time-driven EDCA orchestration is the mechanism by which Fi-Wi
+      converts the inherently stochastic 802.11 MAC into a
+      <strong>predictable, centrally scheduled system</strong> — completing the
+      chain from time synchronization through queue observability to stable L4S
+      control.
+    </p>
+
+    <h3 id="section-4.2">4.2 Fi-Wi Shim Header</h3>
+
+    <p>
+      Between 802.3/IP and the fronthaul link we add a small internal metadata
+      header. Conceptual form:
+    </p>
+
+    <pre><code>struct FiWiMeta {
+  uint64_t seq;          // fronthaul sequence number
+  uint64_t t_ingress_us; // time packet enqueued into group queue (central DRAM)
+  uint32_t txop_id;      // TXOP this MSDU is in
+  uint8_t  mpdu_idx;     // index within aggregate
+  uint8_t  mpdu_cnt;     // total MSDUs in this TXOP
+  uint8_t  ecn_flags;    // CE applied? which queue? reason bits
+  uint32_t qlen_pkts;    // queue depth snapshot at TXOP start
+};
+</code></pre>
+    <p>This header is visible only inside the Fi-Wi domain. It lets us:</p>
+
+    <ul>
+      <li>Record timestamps and queue snapshots at enqueue and dequeue</li>
+
+      <li>
+        Mark ECN based on <strong>central queue state</strong> at the moment of
+        service
+      </li>
+
+      <li>
+        Reconstruct at the concentrator exactly when a packet was queued and
+        actually transmitted
+      </li>
+
+      <li>
+        Compute a real <strong>feedback delay</strong>
+        <code>Td = now - t_mark</code> per flow or per RF group
+      </li>
+    </ul>
+
+    <h3 id="section-4.3">4.3 AQM / L4S Marking Placement</h3>
+
+    <p>
+      We choose the <strong>group queues in the concentrator</strong>—each
+      corresponding to a cellularized airtime domain shared by one RRH or by
+      multiple interfering RRHs—as the <em>only</em> places where deep queues
+      are allowed and where we apply ECN:
+    </p>
+
+    <ul>
+      <li><strong>Downlink</strong>: group queues feeding downlink RRHs</li>
+
+      <li>
+        <strong>Uplink</strong>: group queues collecting uplink traffic from
+        RRHs
+      </li>
+    </ul>
+
+    <p>
+      Other queues (within RRH hardware, on the fiber/fronthaul link) are kept
+      shallow via pacing and controlled descriptor posting. The group queues
+      become the <strong>single bottlenecks</strong> in each cellularized
+      airtime domain, which is exactly what L4S wants: a small number of stable,
+      well-behaved bottlenecks with known behavior. The control policy is
+      explicitly tuned to keep both <strong>average</strong> and
+      <strong>tail</strong> queueing delay low.
+    </p>
+
+    <h3 id="section-4.4">4.4 Centralized Packet Memory and DMA</h3>
+
+    <div class="callout">
+      <strong>DMA (Direct Memory Access): Why RRHs Can Be Simple</strong><br />
+
+      <p>
+        <strong>The Standard AP Architecture:</strong> Traditional Wi-Fi chips
+        already use DMA to move packets from host memory to the radio without
+        CPU involvement. But they require a <em>local</em> CPU to create
+        descriptors, manage buffers, and run the network stack. Every AP is a
+        complete computer running millions of lines of Linux.
+      </p>
+
+      <p><strong>The Fi-Wi Innovation: DMA Over Distance (not RDMA)</strong></p>
+
+      <p>
+        Fi-Wi extends the PCIe bus over fiber, allowing the RRH's DMA engine to
+        read and write <strong>remote memory</strong> in the Concentrator. To
+        the RRH silicon, memory 100 meters away appears "local"—accessible with
+        the same PCIe transactions a traditional Wi-Fi chip uses to access DRAM
+        10 millimeters away on the motherboard.
+      </p>
+
+      <p>
+        <strong>Result:</strong> The local CPU, local DRAM, and entire Linux
+        stack can be eliminated. The RRH becomes a pure "micro-bridge"—just DMA
+        + MAC/PHY logic.
+      </p>
+
+      <p><strong>The Silicon Cost Difference:</strong></p>
+
+      <table
+        border="1"
+        cellpadding="5"
+        cellspacing="0"
+        style="
+          border-collapse: collapse;
+          margin: 10px 0;
+          width: 100%;
+          background: #fff;
+        "
+      >
+        <tr>
+          <th style="background: #eef; width: 30%">Component</th>
+          <th style="background: #eef; width: 35%">Traditional AP</th>
+          <th style="background: #eef; width: 35%">Fi-Wi RRH</th>
+        </tr>
+
+        <tr>
+          <td>
+            <strong>MAC/PHY Silicon</strong><br />
+            <span style="font-size: 0.85em; color: #666"
+              >(802.11 Radio Logic)</span
+            >
+          </td>
+          <td>
+            <strong>~15-20M gates</strong><br />
+            <span style="font-size: 0.85em">MIMO, error correction, etc.</span
+            ><br />
+            <em>Complexity dictated by physics</em>
+          </td>
+          <td>
+            <strong>~15-20M gates</strong><br />
+            <span style="font-size: 0.85em">Same physics, same complexity</span
+            ><br />
+            <em>No savings here</em>
+          </td>
+        </tr>
+
+        <tr style="background: #fffacd">
+          <td>
+            <strong>Host SoC / CPU</strong><br />
+            <span style="font-size: 0.85em; color: #666">(The "Brains")</span>
+          </td>
+          <td>
+            <strong>~50-100M gates</strong><br />
+            Multi-core ARM CPU<br />
+            DDR4 controller<br />
+            Peripherals, caches, etc.
+          </td>
+          <td>
+            <strong>~100K-500K gates</strong><br />
+            Simple DMA state machine<br />
+            Descriptor buffer only<br />
+            <strong>100-1000x simpler</strong>
+          </td>
+        </tr>
+
+        <tr>
+          <td><strong>DRAM</strong></td>
+          <td>
+            256MB - 1GB DDR4<br />
+            (Required for OS + buffers)
+          </td>
+          <td>
+            16-64KB SRAM<br />
+            (Descriptor storage only)
+          </td>
+        </tr>
+
+        <tr>
+          <td><strong>Operating System</strong></td>
+          <td>
+            Linux (millions of LOC)<br />
+            Requires security patches
+          </td>
+          <td>
+            None<br />
+            Zero software attack surface
+          </td>
+        </tr>
+
+        <tr style="background: #e8f4ea">
+          <td><strong>Total Silicon</strong></td>
+          <td><strong>~70-120M gates</strong></td>
+          <td><strong>~15-20M gates</strong></td>
+        </tr>
+      </table>
+
+      <p><strong>Direct Implications:</strong></p>
+
+      <ul>
+        <li>
+          <strong>Cost:</strong> Silicon cost scales with die area. Removing the
+          CPU/SoC slashes per-RRH hardware cost by 40-60%.
+        </li>
+
+        <li>
+          <strong>Power:</strong> The eliminated CPU/DRAM would consume 5-10W
+          alone. Fi-Wi targets 11W total RRH power budget—less than many APs
+          burn on the CPU subsystem alone.
+        </li>
+
+        <li>
+          <strong>Thermal:</strong> Lower power density enables passive cooling.
+          No fans, smaller heatsinks, higher reliability.
+        </li>
+
+        <li>
+          <strong>Security:</strong> Zero lines of operating system code means
+          zero OS vulnerabilities. No Linux kernel to patch, no privilege
+          escalation bugs, no remote code execution surface.
+        </li>
+
+        <li>
+          <strong>Reliability:</strong> Fewer transistors = fewer failure modes.
+          Simpler hardware lasts longer.
+        </li>
+      </ul>
+
+      <p><strong>The Economic Model:</strong></p>
+
+      <p>
+        Traditional Architecture: 50 APs = 50 CPUs, 50 DRAM modules, 50 power
+        supplies, 50 Linux installations, 50 security update cycles.
+      </p>
+
+      <p>
+        Fi-Wi Architecture: 1 powerful Concentrator (workstation-class) + 50
+        simple RRHs (DMA + radio only).
+      </p>
+
+      <p>
+        <strong>Total system cost is lower</strong> because you're paying for
+        intelligence once, not 50 times.
+      </p>
+
+      <p><strong>Why Incumbents Cannot Do This:</strong></p>
+
+      <p>
+        Traditional AP vendors have already optimized their SoC designs—the CPU,
+        DRAM controller, and peripherals are as efficient as they can be. But
+        their <em>architecture requires</em> these components at every radio
+        because each AP operates autonomously. Even if they wanted to simplify,
+        the distributed control model forces complexity at the edge.
+      </p>
+
+      <p>
+        Fi-Wi's centralized architecture enables the per-radio simplification.
+        <strong
+          >This is a structural cost advantage, not a manufacturing
+          efficiency.</strong
+        >
+        Replicating it would require incumbents to abandon their entire product
+        line and business model—a classic Innovator's Dilemma.
+      </p>
+
+      <p>
+        <strong>Bottom Line:</strong> C-RAN works because
+        <strong>silicon economics favor centralized intelligence</strong>. The
+        gate count difference isn't cosmetic—it's the foundation of Fi-Wi's
+        cost, power, and reliability advantages.
+      </p>
+    </div>
+
+    <p>In Fi-Wi, packet memory is centralized in the concentrator:</p>
+
+    <ul>
+      <li>
+        Each RF airtime domain has one or more
+        <strong>group queues</strong> implemented in concentrator DRAM.
+      </li>
+
+      <li>
+        RRHs act as <strong>DMA engines</strong> that push/pull frames to/from
+        those central queues.
+      </li>
+
+      <li>
+        On-chip RRH buffers are deliberately small and just-in-time filled, to
+        avoid hidden queues.
+      </li>
+    </ul>
+
+    <div class="diagram">
+      Central DRAM (Fi-Wi Concentrator) ──────────────────────────────── Group
+      queue A → RRH1, RRH2 (shared RF cell) Group queue B → RRH3 (isolated cell)
+      Group queue C → RRH4–RRH7 (shared RF cell) ... Queues live centrally; RRHs
+      are DMA clients draining those queues into airtime.
+    </div>
+
+    <p>This design:</p>
+
+    <ul>
+      <li>
+        Makes all meaningful queues <strong>fully visible</strong> to the
+        control plane
+      </li>
+
+      <li>
+        Lets AQM / ECN marking run exactly where the integrator (queue) lives
+      </li>
+
+      <li>
+        Ensures there are no uncontrolled “mystery buffers” in the RF domain
+      </li>
+    </ul>
+
+    <h3 id="section-4.5">4.5 RRH Edge Control via Beacon Power Shaping</h3>
+
+    <p>
+      Because the Fi-Wi concentrator maintains <strong>shared state</strong> for
+      the entire RF domain, it can directly control the
+      <strong>RF footprint</strong> of each RRH by adjusting per-RRH beacon
+      transmit power. This alters:
+    </p>
+
+    <ul>
+      <li>
+        <strong>Coverage footprint</strong> – how far each RRH is visible as a
+        candidate AP
+      </li>
+
+      <li>
+        <strong>Cell edge location</strong> – which STA sees which RRH as the
+        dominant beacon source
+      </li>
+
+      <li>
+        <strong>Interference domains</strong> – how many RRHs mutually interfere
+        for a given channel
+      </li>
+
+      <li>
+        <strong>Candidate RRHs for grouping</strong> – beacon shaping influences
+        which RRHs should share a queue
+      </li>
+    </ul>
+
+    <p>
+      Beacon power is one of the most effective tools for
+      <strong>dynamic RF cell shaping</strong> because it affects STA
+      association and roaming decisions without modifying data-plane PHY rates.
+      By lowering beacon power at certain RRHs and raising it at others, the
+      concentrator can:
+    </p>
+
+    <ul>
+      <li>Steer a STA toward a preferred RRH or RF group</li>
+
+      <li>
+        Increase isolation between RRHs to allow more independent airtime
+        domains
+      </li>
+
+      <li>
+        Reduce unnecessary overlap that causes hidden-node problems or collapses
+        spatial reuse
+      </li>
+
+      <li>
+        Stabilize spatial-stream structure by improving per-RRH dominance or
+        diversity in the CSI
+      </li>
+    </ul>
+
+    <p>
+      Traditional controller+AP systems attempt similar behavior but lack true
+      <em>shared state</em> because each AP maintains its own queueing and PHY
+      decisions. In Fi-Wi, beacon shaping is coordinated with:
+    </p>
+
+    <ul>
+      <li>Current queue group assignments</li>
+
+      <li>CSI-driven spatial-stream and eigenvector analysis</li>
+
+      <li>Predicted load and collapse risk</li>
+    </ul>
+
+    <p>
+      This makes beacon power a first-class control variable in defining and
+      stabilizing the boundaries of each cellularized RF domain.
+    </p>
+
+    <h3 id="section-4.6">4.6 Fronthaul Requirements and Feasibility</h3>
+
+    <p>
+      The Fi-Wi architecture requires deterministic, low-latency fronthaul links
+      between the concentrator and RRHs. Because RRHs function as DMA engines
+      accessing centralized packet memory (Section 4.4), Umber's implementation
+      uses <strong>PCIe (PCI Express) over fiber</strong> rather than Ethernet.
+      This section quantifies bandwidth, latency, and jitter requirements, and
+      demonstrates that PCIe over fiber not only meets these requirements but
+      provides superior performance compared to network-based alternatives.
+    </p>
+
+    <h4>4.6.1 Why PCIe Over Fiber?</h4>
+
+    <p>
+      The choice of PCIe over fiber instead of Ethernet is driven by the Fi-Wi
+      architectural model:
+    </p>
+
+    <p>
+      <strong>RRHs as DMA engines:</strong> Each RRH directly reads packet
+      descriptors from concentrator DRAM, fetches packet data, and writes
+      received packets back to memory. This is native PCIe behavior—exactly how
+      a network card or storage controller operates.
+    </p>
+
+    <p>
+      <strong>Latency advantage:</strong> PCIe avoids the network stack
+      entirely:
+    </p>
+
+    <ul>
+      <li>No TCP/IP processing</li>
+
+      <li>No Ethernet framing/deframing</li>
+
+      <li>No switch store-and-forward</li>
+
+      <li>Direct memory-to-memory transfers</li>
+    </ul>
+
+    <p>
+      <strong>Determinism:</strong> PCIe provides guaranteed bandwidth
+      allocation and predictable latency through:
+    </p>
+
+    <ul>
+      <li>Hardware-enforced QoS</li>
+
+      <li>Credit-based flow control (no packet drops)</li>
+
+      <li>Transaction-level ordering guarantees</li>
+    </ul>
+
+    <p>
+      <strong>Simplicity:</strong> The RRH sees the concentrator's memory space
+      directly. No protocol translation, no socket APIs, no network
+      configuration.
+    </p>
+
+    <h4>4.6.2 PCIe Bandwidth Requirements</h4>
+
+    <p>Each RRH requires bandwidth for:</p>
+
+    <p><strong>1. Downlink packet DMA (concentrator → RRH)</strong></p>
+
+    <p>
+      For an RRH serving one or more STAs with aggregate capacity
+      C<sub>eff</sub>:
+    </p>
+
+    <pre>
+BW<sub>DL</sub> = C<sub>eff</sub> · (1 + OH<sub>desc</sub>)                    (4.1)
+</pre
+    >
+    <p>
+      where OH<sub>desc</sub> accounts for DMA descriptors, metadata, and PCIe
+      TLP (Transaction Layer Packet) overhead (typically 10-20%).
+    </p>
+
+    <p>
+      <strong>Example:</strong> For C<sub>eff</sub> = 600 Mbps (typical 802.11ax
+      2×2 MIMO) with OH<sub>desc</sub> = 0.15:
+    </p>
+
+    <pre>
+BW<sub>DL</sub> = 600 · 1.15 = 690 Mbps
+</pre
+    >
+    <p><strong>2. Uplink packet DMA (RRH → concentrator)</strong></p>
+
+    <p>
+      Typically symmetric or slightly higher than downlink due to ACKs and
+      control frames:
+    </p>
+
+    <pre>
+BW<sub>UL</sub> ≈ BW<sub>DL</sub> · 1.1 ≈ 760 Mbps                   (4.2)
+</pre
+    >
+    <p><strong>3. CSI and status updates</strong></p>
+
+    <p>
+      Channel State Information and MAC statistics are written to concentrator
+      memory via PCIe:
+    </p>
+
+    <pre>
+BW<sub>CSI</sub> = N<sub>sta</sub> · N<sub>sc</sub> · N<sub>tx</sub> · N<sub>rx</sub> · B<sub>sample</sub> · f<sub>CSI</sub>    (4.3)
+</pre
+    >
+    <p>
+      For N<sub>sta</sub>=4, N<sub>sc</sub>=234, N<sub>tx</sub>=2,
+      N<sub>rx</sub>=2, B<sub>sample</sub>=24 bits, f<sub>CSI</sub>=50 Hz:
+    </p>
+
+    <pre>
+BW<sub>CSI</sub> = 4.49 Mbps per RRH
+</pre
+    >
+    <p><strong>4. Control and command traffic (concentrator → RRH)</strong></p>
+
+    <p>
+      Configuration updates, timing sync corrections, power/channel commands:
+    </p>
+
+    <pre>
+BW<sub>control</sub> ≈ 1-5 Mbps per RRH                         (4.4)
+</pre
+    >
+    <p><strong>Total bidirectional bandwidth per RRH:</strong></p>
+
+    <pre>
+BW<sub>total</sub> = BW<sub>DL</sub> + BW<sub>UL</sub> + BW<sub>CSI</sub> + BW<sub>control</sub>           (4.5)
+BW<sub>total</sub> ≈ 690 + 760 + 4.5 + 2 = 1456 Mbps ≈ 1.5 Gbps
+</pre
+    >
+    <h4>4.6.3 PCIe Link Configuration</h4>
+
+    <p>PCIe bandwidth is determined by generation and lane count:</p>
+
+    <table
+      border="1"
+      cellpadding="5"
+      cellspacing="0"
+      style="border-collapse: collapse"
+    >
+      <tr>
+        <th>PCIe Gen</th>
+        <th>Per-Lane Rate</th>
+        <th>x1 Link</th>
+        <th>x4 Link</th>
+        <th>x8 Link</th>
+      </tr>
+
+      <tr>
+        <td>Gen 3</td>
+        <td>~8 GT/s</td>
+        <td>
+          ~985 MB/s<br />
+          (7.88 Gbps)
+        </td>
+        <td>
+          ~3.94 GB/s<br />
+          (31.5 Gbps)
+        </td>
+        <td>
+          ~7.88 GB/s<br />
+          (63 Gbps)
+        </td>
+      </tr>
+
+      <tr>
+        <td>Gen 4</td>
+        <td>~16 GT/s</td>
+        <td>
+          ~1.97 GB/s<br />
+          (15.75 Gbps)
+        </td>
+        <td>
+          ~7.88 GB/s<br />
+          (63 Gbps)
+        </td>
+        <td>
+          ~15.75 GB/s<br />
+          (126 Gbps)
+        </td>
+      </tr>
+
+      <tr>
+        <td>Gen 5</td>
+        <td>~32 GT/s</td>
+        <td>
+          ~3.94 GB/s<br />
+          (31.5 Gbps)
+        </td>
+        <td>
+          ~15.75 GB/s<br />
+          (126 Gbps)
+        </td>
+        <td>
+          ~31.5 GB/s<br />
+          (252 Gbps)
+        </td>
+      </tr>
+    </table>
+
+    <p>
+      <em
+        >Note: Effective bandwidth accounts for 128b/130b encoding (Gen 3+) and
+        protocol overhead.</em
+      >
+    </p>
+
+    <p><strong>RRH link sizing:</strong> For 1.5 Gbps per RRH requirement:</p>
+
+    <ul>
+      <li>
+        <strong>PCIe Gen 3 x1:</strong> 7.88 Gbps available → 5.2× margin per
+        RRH ✓
+      </li>
+
+      <li>
+        <strong>PCIe Gen 4 x1:</strong> 15.75 Gbps available → 10.5× margin per
+        RRH ✓✓
+      </li>
+    </ul>
+
+    <p>
+      A single PCIe Gen 3 x1 lane is sufficient per RRH with substantial
+      headroom.
+    </p>
+
+    <h4>4.6.4 Concentrator PCIe Topology</h4>
+
+    <p>
+      The concentrator must aggregate multiple RRH connections. Consider a
+      50-RRH deployment:
+    </p>
+
+    <p><strong>Total aggregate bandwidth requirement:</strong></p>
+
+    <pre>
+BW<sub>aggregate</sub> = N<sub>RRH</sub> · BW<sub>total</sub>                      (4.6)
+BW<sub>aggregate</sub> = 50 · 1.5 Gbps = 75 Gbps (peak)
+</pre
+    >
+    <p>With 40% average utilization (typical for building-wide traffic):</p>
+
+    <pre>
+BW<sub>typical</sub> = 75 · 0.40 = 30 Gbps
+</pre
+    >
+    <p><strong>Architecture Options:</strong></p>
+
+    <p><strong>Option 1: PCIe switch fabric</strong></p>
+
+    <ul>
+      <li>
+        Use external PCIe switches (e.g., Broadcom PEX, Microchip Switchtec) to
+        fan out.
+      </li>
+
+      <li><em>Pros:</em> High port density.</li>
+
+      <li>
+        <em>Cons:</em> Adds a "hop" of latency (100-200ns); introduces
+        head-of-line blocking if the uplink to CPU is oversubscribed.
+      </li>
+    </ul>
+
+    <p><strong>Option 2: Multi-host server (Dual Socket)</strong></p>
+
+    <ul>
+      <li>Standard dual-CPU server (e.g., Dual Xeon/EPYC).</li>
+
+      <li><em>Pros:</em> Massive lane count (128+ lanes).</li>
+
+      <li>
+        <em>Cons:</em> <strong>NUMA (Non-Uniform Memory Access)</strong>. If a
+        packet arrives on CPU1 but the processing queue is on CPU2's memory, the
+        cross-socket traversal adds significant latency and jitter, violating
+        L4S requirements.
+      </li>
+    </ul>
+
+    <div class="callout">
+      <strong
+        >Option 3: The Fi-Wi Choice — Workstation-Class Single-Socket</strong
+      ><br />
+      To achieve perfect determinism, Fi-Wi standardizes on
+      <strong>High-End Desktop (HEDT) / Workstation silicon</strong> (e.g., AMD
+      Threadripper Pro or Intel Xeon W-3400 series).
+      <ul>
+        <li>
+          <strong>128 Native Lanes:</strong> Sufficient to directly attach ~100
+          RRHs without external switches.
+        </li>
+
+        <li>
+          <strong>Single Memory Domain:</strong> All RRHs write to a unified
+          DRAM space. No NUMA penalties.
+        </li>
+
+        <li>
+          <strong>Zero Switching:</strong> Every RRH has a dedicated copper
+          trace to the CPU die.
+        </li>
+      </ul>
+      This "Goldilocks" topology enables the
+      <strong>Non-Blocking Architecture</strong> detailed in
+      <a href="#section-13">Section 13</a>.
+    </div>
+
+    <h4>4.6.5 PCIe Over Fiber: Physical Layer</h4>
+
+    <p>
+      Standard PCIe uses copper traces on motherboards (limited to ~30cm at Gen
+      3/4 speeds). To reach RRHs distributed throughout a building, PCIe signals
+      are carried over fiber using optical transceivers.
+    </p>
+
+    <p><strong>Technologies:</strong></p>
+
+    <p><strong>1. Active Optical Cables (AOC)</strong></p>
+
+    <ul>
+      <li>Integrated fiber + transceivers in cable assembly</li>
+
+      <li>Plug directly into PCIe card slots</li>
+
+      <li>Lengths: 5-100 meters typical</li>
+
+      <li>Cost: $100-300 per cable depending on length/gen</li>
+
+      <li>Latency: &lt;1 µs additional vs. copper</li>
+    </ul>
+
+    <p><strong>2. Optical PCIe adapter cards</strong></p>
+
+    <ul>
+      <li>PCIe card with optical SFP/QSFP cages</li>
+
+      <li>Use standard single-mode or multi-mode fiber</li>
+
+      <li>Lengths: up to 10km (SM fiber), 300m (MM fiber)</li>
+
+      <li>
+        Cost: $200-500 per endpoint card + $50-100 per transceiver + fiber
+      </li>
+
+      <li>More flexible for custom topologies</li>
+    </ul>
+
+    <p><strong>3. PCIe fabric extenders</strong></p>
+
+    <ul>
+      <li>Protocol conversion (PCIe ↔ proprietary optical)</li>
+
+      <li>Examples: Dolphin IXS600, One Stop Systems</li>
+
+      <li>Support longer distances and more complex topologies</li>
+
+      <li>Cost: $500-2000 per endpoint</li>
+    </ul>
+
+    <p>
+      <strong>Recommended approach for Fi-Wi:</strong> Optical PCIe adapter
+      cards with standard fiber infrastructure, providing flexibility and
+      leveraging commodity fiber installation.
+    </p>
+
+    <h4>4.6.6 Latency Analysis</h4>
+
+    <p>PCIe over fiber latency components:</p>
+
+    <table
+      border="1"
+      cellpadding="5"
+      cellspacing="0"
+      style="border-collapse: collapse"
+    >
+      <tr>
+        <th>Component</th>
+        <th>Latency</th>
+      </tr>
+
+      <tr>
+        <td>PCIe TLP formation (concentrator)</td>
+        <td>0.2-0.5 µs</td>
+      </tr>
+
+      <tr>
+        <td>Optical transceiver (TX)</td>
+        <td>0.1-0.3 µs</td>
+      </tr>
+
+      <tr>
+        <td>Fiber propagation (100m)</td>
+        <td>0.5 µs</td>
+      </tr>
+
+      <tr>
+        <td>Optical transceiver (RX)</td>
+        <td>0.1-0.3 µs</td>
+      </tr>
+
+      <tr>
+        <td>PCIe TLP processing (RRH)</td>
+        <td>0.2-0.5 µs</td>
+      </tr>
+
+      <tr>
+        <td>PCIe switch (if used)</td>
+        <td>0.1-0.3 µs per hop</td>
+      </tr>
+
+      <tr>
+        <td><strong>Total one-way</strong></td>
+        <td><strong>1.2-2.4 µs</strong></td>
+      </tr>
+
+      <tr>
+        <td><strong>Round-trip (DMA read)</strong></td>
+        <td><strong>2.4-4.8 µs</strong></td>
+      </tr>
+    </table>
+
+    <p><strong>Comparison to Ethernet:</strong></p>
+
+    <table
+      border="1"
+      cellpadding="5"
+      cellspacing="0"
+      style="border-collapse: collapse"
+    >
+      <tr>
+        <th>Fronthaul Type</th>
+        <th>Round-Trip Latency</th>
+        <th>Determinism</th>
+      </tr>
+
+      <tr>
+        <td>PCIe over fiber</td>
+        <td>2.4-4.8 µs</td>
+        <td>Excellent (credit-based)</td>
+      </tr>
+
+      <tr>
+        <td>10GbE (cut-through)</td>
+        <td>10-30 µs</td>
+        <td>Good (with QoS)</td>
+      </tr>
+
+      <tr>
+        <td>10GbE (store-forward)</td>
+        <td>20-100 µs</td>
+        <td>Fair (subject to congestion)</td>
+      </tr>
+    </table>
+
+    <p>
+      PCIe over fiber provides <strong>5-10× lower latency</strong> than even
+      optimized Ethernet, which is critical for the inner control loop (Section
+      B) operating at 200-500 µs timescales.
+    </p>
+
+    <h4>4.6.7 Jitter and Determinism</h4>
+
+    <p>
+      PCIe's credit-based flow control eliminates congestion drops and provides
+      deterministic latency:
+    </p>
+
+    <ul>
+      <li>
+        <strong>No packet loss:</strong> Sender only transmits when receiver has
+        buffer credits
+      </li>
+
+      <li><strong>No queuing jitter:</strong> Fixed transaction ordering</li>
+
+      <li>
+        <strong>Predictable latency:</strong> TLP latency = propagation + fixed
+        processing
+      </li>
+    </ul>
+
+    <p>
+      <strong>Measured jitter:</strong> PCIe over fiber typically exhibits
+      &lt;50 ns jitter, well under the 200 ns budget for 1 µs time
+      synchronization (Section 4.1).
+    </p>
+
+    <p>
+      This determinism is impossible to achieve with Ethernet without
+      time-sensitive networking (TSN) extensions, which add complexity and cost.
+    </p>
+
+    <h4>4.6.8 Distance Limitations</h4>
+
+    <p>
+      PCIe over fiber distance depends on optical budget and signal integrity:
+    </p>
+
+    <table
+      border="1"
+      cellpadding="5"
+      cellspacing="0"
+      style="border-collapse: collapse"
+    >
+      <tr>
+        <th>PCIe Gen</th>
+        <th>Multi-Mode Fiber</th>
+        <th>Single-Mode Fiber</th>
+      </tr>
+
+      <tr>
+        <td>Gen 3 (8 GT/s)</td>
+        <td>300 m</td>
+        <td>10 km</td>
+      </tr>
+
+      <tr>
+        <td>Gen 4 (16 GT/s)</td>
+        <td>100 m</td>
+        <td>2-10 km</td>
+      </tr>
+
+      <tr>
+        <td>Gen 5 (32 GT/s)</td>
+        <td>50-100 m</td>
+        <td>2 km</td>
+      </tr>
+    </table>
+
+    <p>
+      <strong>Fi-Wi requirement:</strong> Building-scale deployments require
+      ≤100 m reach, easily achieved with Gen 3/4 over multi-mode fiber or any
+      generation over single-mode fiber.
+    </p>
+
+    <h4>4.6.9 Cost Analysis</h4>
+
+    <p><strong>PCIe over fiber cost per RRH:</strong></p>
+
+    <table
+      border="1"
+      cellpadding="5"
+      cellspacing="0"
+      style="border-collapse: collapse"
+    >
+      <tr>
+        <th>Component</th>
+        <th>Cost (approx.)</th>
+      </tr>
+
+      <tr>
+        <td>RRH-side PCIe optical adapter</td>
+        <td>$150-300</td>
+      </tr>
+
+      <tr>
+        <td>Fiber pair (50m installed)</td>
+        <td>$50-100</td>
+      </tr>
+
+      <tr>
+        <td>Optical transceiver pair</td>
+        <td>$50-100</td>
+      </tr>
+
+      <tr>
+        <td>PCIe switch port allocation</td>
+        <td>$100-200</td>
+      </tr>
+
+      <tr>
+        <td><strong>Total per RRH</strong></td>
+        <td><strong>$350-700</strong></td>
+      </tr>
+    </table>
+
+    <p><strong>Comparison to network alternatives:</strong></p>
+
+    <table
+      border="1"
+      cellpadding="5"
+      cellspacing="0"
+      style="border-collapse: collapse"
+    >
+      <tr>
+        <th>Approach</th>
+        <th>Cost per RRH</th>
+        <th>Latency</th>
+        <th>Determinism</th>
+      </tr>
+
+      <tr>
+        <td>PCIe over fiber</td>
+        <td>$350-700</td>
+        <td>2-5 µs</td>
+        <td>Excellent</td>
+      </tr>
+
+      <tr>
+        <td>10GbE + TSN</td>
+        <td>$300-600</td>
+        <td>10-30 µs</td>
+        <td>Good</td>
+      </tr>
+
+      <tr>
+        <td>Standard 10GbE</td>
+        <td>$200-400</td>
+        <td>20-100 µs</td>
+        <td>Fair</td>
+      </tr>
+    </table>
+
+    <p>
+      PCIe over fiber costs moderately more than standard Ethernet but delivers
+      5-10× better latency and superior determinism. For Fi-Wi's DMA-based
+      architecture, this cost is justified by the performance and architectural
+      simplicity gains.
+    </p>
+
+    <p>
+      For context: a typical enterprise AP costs $500-2000, and a cellular small
+      cell costs $1000-5000. The fronthaul cost is comparable to or less than
+      the radio cost difference, making it economically viable.
+    </p>
+
+    <h4>4.6.10 Alternative: Hybrid PCIe + Ethernet</h4>
+
+    <p>
+      For deployments where PCIe over fiber infrastructure is unavailable, a
+      hybrid approach is possible:
+    </p>
+
+    <ul>
+      <li>
+        <strong>Time-critical path:</strong> Use PCIe over fiber for DMA packet
+        transfers
+      </li>
+
+      <li>
+        <strong>Control plane:</strong> Use standard Ethernet for CSI,
+        statistics, configuration
+      </li>
+    </ul>
+
+    <p>
+      This reduces PCIe bandwidth requirements (only packet data, not
+      CSI/control) and allows leveraging existing Ethernet infrastructure for
+      non-latency-critical traffic.
+    </p>
+
+    <p>
+      However, the pure PCIe approach is architecturally cleaner and avoids the
+      complexity of dual-protocol RRH implementation.
+    </p>
+
+    <h4>4.6.11 Comparison to Cellular Fronthaul Standards</h4>
+
+    <p>For context, cellular systems use:</p>
+
+    <p><strong>CPRI (Common Public Radio Interface):</strong></p>
+
+    <ul>
+      <li>Carries digitized I/Q samples</li>
+
+      <li>Bandwidth: 614 Mbps to 24 Gbps per radio</li>
+
+      <li>Latency: &lt;5 µs typical</li>
+
+      <li>Protocol: Line-coded serial (not PCIe or Ethernet)</li>
+    </ul>
+
+    <p><strong>eCPRI (Enhanced CPRI) / Fronthaul Gateway:</strong></p>
+
+    <ul>
+      <li>Functional split moves some PHY to RRH</li>
+
+      <li>Bandwidth: 1-10 Gbps typical (10-20× less than CPRI)</li>
+
+      <li>Transport: Ethernet-based</li>
+
+      <li>Latency: 10-100 µs</li>
+    </ul>
+
+    <p><strong>Fi-Wi (PCIe over fiber):</strong></p>
+
+    <ul>
+      <li>Full MAC/PHY at RRH (only Ethernet frames on fronthaul)</li>
+
+      <li>Bandwidth: 1-2 Gbps per RRH (comparable to eCPRI)</li>
+
+      <li>Transport: PCIe (native DMA)</li>
+
+      <li>
+        Latency: 2-5 µs (between CPRI and eCPRI, but with packet-level split)
+      </li>
+    </ul>
+
+    <p>
+      Fi-Wi's functional split and PCIe transport provides a unique balance:
+      lower bandwidth than CPRI, lower latency than eCPRI, and native
+      integration with the DMA-based architecture.
+    </p>
+
+    <h4>4.6.12 Summary: PCIe Over Fiber Enables Fi-Wi Architecture</h4>
+
+    <table
+      border="1"
+      cellpadding="5"
+      cellspacing="0"
+      style="border-collapse: collapse"
+    >
+      <tr>
+        <th>Requirement</th>
+        <th>Target</th>
+        <th>Achieved with PCIe Gen 3 x1</th>
+      </tr>
+
+      <tr>
+        <td>Bandwidth per RRH</td>
+        <td>~1.5 Gbps</td>
+        <td>✓ 7.88 Gbps (5× margin)</td>
+      </tr>
+
+      <tr>
+        <td>Aggregate (50 RRH)</td>
+        <td>~30 Gbps avg</td>
+        <td>✓ PCIe switch or multi-CPU</td>
+      </tr>
+
+      <tr>
+        <td>Round-trip latency</td>
+        <td>&lt;10 µs</td>
+        <td>✓ 2.4-4.8 µs</td>
+      </tr>
+
+      <tr>
+        <td>Jitter</td>
+        <td>&lt;200 ns</td>
+        <td>✓ &lt;50 ns (credit-based)</td>
+      </tr>
+
+      <tr>
+        <td>Distance</td>
+        <td>≤100 m</td>
+        <td>✓ 300m MM / 10km SM</td>
+      </tr>
+
+      <tr>
+        <td>Determinism</td>
+        <td>No drops, predictable</td>
+        <td>✓ Credit-based flow control</td>
+      </tr>
+
+      <tr>
+        <td>Cost per RRH</td>
+        <td>&lt;$700</td>
+        <td>✓ $350-700</td>
+      </tr>
+    </table>
+
+    <p><strong>Why PCIe over fiber is the right choice for Fi-Wi:</strong></p>
+
+    <ol>
+      <li>
+        <strong>Native DMA model:</strong> RRHs are DMA engines—PCIe is the
+        natural transport
+      </li>
+
+      <li>
+        <strong>Lowest latency:</strong> 2-5 µs vs. 10-100 µs for Ethernet
+      </li>
+
+      <li>
+        <strong>Perfect determinism:</strong> Credit-based flow control
+        eliminates jitter and drops
+      </li>
+
+      <li>
+        <strong>Architectural simplicity:</strong> No network stack, no protocol
+        translation
+      </li>
+
+      <li>
+        <strong>Proven technology:</strong> Used in HPC, storage (NVMe-oF), and
+        telecom
+      </li>
+    </ol>
+
+    <p>
+      The deterministic, sub-5-microsecond fronthaul is what enables Fi-Wi's
+      centralized control, time synchronization, and single-bottleneck queueing
+      architecture. Unlike Wi-Fi mesh, controller-based systems with
+      over-the-air backhaul, or even Ethernet-based approaches, PCIe over fiber
+      provides the predictable substrate needed for the control loops described
+      in Appendices A and B to operate with the precision required for
+      sub-millisecond tail latency control.
+    </p>
+
+    <div class="section" id="section-4.7">
+      <h3 id="section-4.7">
+        4.7 Precision Clock Synchronization over Fronthaul
+      </h3>
+
+      <p>
+        The "cellularization" of Wi-Fi relies on a unified timebase. In the
+        Fi-Wi architecture, time is not merely used for logging; it is a
+        <strong>control variable</strong>. To achieve coordinated scheduling,
+        accurate queue measurements, and seamless mobility, every RRH must share
+        a precise understanding of "now" down to the microsecond level.
+      </p>
+
+      <p>
+        To achieve this, Fi-Wi establishes a strict
+        <strong>Hierarchical Clock Tree</strong> over the PCIe fronthaul,
+        leveraging the native determinism of the bus rather than the best-effort
+        nature of packet switching.
+      </p>
+
+      <h4>4.7.1 The Concentrator as Grandmaster (GM)</h4>
+
+      <p>
+        The Fi-Wi Concentrator acts as the
+        <strong>PTP Grandmaster (IEEE 1588v2 / 802.1AS)</strong> for the entire
+        building. It houses the primary reference oscillator (typically a
+        high-stability OCXO).
+      </p>
+
+      <ul>
+        <li>
+          <strong>Downstream Distribution:</strong> The Concentrator distributes
+          the master clock reference to all connected RRHs via the fronthaul
+          links.
+        </li>
+
+        <li>
+          <strong>PCIe Precision Time Measurement (PTM):</strong> Over the
+          PCIe-over-fiber links, Fi-Wi leverages PCIe PTM (a hardware-native
+          feature since PCIe Gen 3) to continuously calculate and compensate for
+          propagation delay variations in the fiber.
+        </li>
+
+        <li>
+          <strong>RRH Frequency Lock:</strong> Each RRH maintains a local clock
+          disciplined to the Concentrator's master clock, ensuring frequency
+          stability and preventing long-term drift.
+        </li>
+      </ul>
+
+      <div class="diagram-block">
+        <h4>Diagram 4-2: The Fi-Wi Clock Tree Topology</h4>
+
+        <pre class="diagram">
+          External Reference (Optional GPS/GNSS)
+                       │
+                       ▼
+    ┌──────────────────────────────────────────────┐
+    │            Fi-Wi Concentrator                │
+    │     [ High-Stability Ocillator (OCXO) ]     │ ◄── Grandmaster (GM)
+    │           (System Timebase t0)               │
+    └──────────────────┬───────────────────────────┘
+                       │ PCIe PTM / Hardware Sync
+                       │ (Compensates for fiber flight time)
+          ┌────────────┼─────────────┐
+          ▼            ▼             ▼
+    ┌───────────┐ ┌───────────┐ ┌───────────┐
+    │   RRH 1   │ │   RRH 2   │ │   RRH 3   │      ◄── Slaves
+    │ [LocalOsc]│ │ [LocalOsc]│ │ [LocalOsc]│
+    │  Locked   │ │  Locked   │ │  Locked   │
+    └─────┬─────┘ └─────┬─────┘ └─────┬─────┘
+          │             │             │
+          ▼             ▼             ▼
+     Frequency-Coordinated Operation
+    </pre
+        >
+      </div>
+
+      <h4>4.7.2 What Clock Synchronization Actually Enables</h4>
+
+      <p>
+        A defining advantage of the Fi-Wi architecture is the use of "Hard
+        Synchronization" via PCIe, rather than "Soft Synchronization" via
+        Ethernet. While Ethernet-based APs rely on IEEE 1588 PTP, they are
+        subject to switch jitter and software stack latency. PCIe over fiber
+        eliminates these variables.
+      </p>
+
+      <table class="comparison">
+        <thead>
+          <tr>
+            <th>Feature</th>
+            <th>Fi-Wi (PCIe over Fiber)</th>
+            <th>Traditional APs (Ethernet)</th>
+          </tr>
+        </thead>
+
+        <tbody>
+          <tr>
+            <td><strong>Protocol</strong></td>
+            <td>
+              <strong>PCIe PTM (Precision Time Measurement)</strong><br />
+              <span class="small">Hardware-native, bus-level messages</span>
+            </td>
+            <td>
+              <strong>IEEE 1588 PTP</strong><br />
+              <span class="small">Packet-based, software/firmware stack</span>
+            </td>
+          </tr>
+
+          <tr>
+            <td><strong>Sync Accuracy</strong></td>
+            <td>
+              <strong>20-50 nanoseconds</strong><br />
+              <span class="small">Bus cycle precision + fiber margin</span>
+            </td>
+            <td>
+              <strong>100ns – 10µs</strong><br />
+              <span class="small">Highly dependent on network load</span>
+            </td>
+          </tr>
+
+          <tr>
+            <td><strong>Jitter Source</strong></td>
+            <td>
+              <strong>Minimal</strong><br />
+              <span class="small">Point-to-point hardware flow control</span>
+            </td>
+            <td>
+              <strong>High</strong><br />
+              <span class="small"
+                >Switch queuing & software interrupt latency</span
+              >
+            </td>
+          </tr>
+
+          <tr>
+            <td><strong>CPU Overhead</strong></td>
+            <td>
+              <strong>Zero</strong><br />
+              <span class="small">Handled entirely by PCIe PHY/Controller</span>
+            </td>
+            <td>
+              <strong>Moderate to High</strong><br />
+              <span class="small"
+                >CPU must interrupt to process sync packets</span
+              >
+            </td>
+          </tr>
+
+          <tr>
+            <td><strong>Primary Benefits</strong></td>
+            <td>
+              Accurate L4S timestamps, TSF synchronization, unified timeline for
+              clients
+            </td>
+            <td>Basic time sync for logging and management</td>
+          </tr>
+        </tbody>
+      </table>
+
+      <p>
+        <strong>Important Note:</strong> While frequency-locked clocks provide
+        excellent timing consistency, they do not enable RF phase control or
+        coordinated simultaneous transmission. COTS Wi-Fi chips have independent
+        RF synthesizers with arbitrary phase offsets that cannot be controlled
+        externally. The value of clock synchronization lies in accurate
+        timestamping for L4S queue management and consistent TSF counters for
+        seamless client mobility, not in RF phase alignment.
+      </p>
+
+      <h4>4.7.3 Operating Modes: GPS-Disciplined vs. Free-Wheeling</h4>
+
+      <p>
+        The Concentrator's clock behavior depends on the deployment environment
+        and regulatory requirements. There are two distinct modes of operation:
+      </p>
+
+      <h5>Mode A: GPS-Disciplined (Absolute Synchronization)</h5>
+
+      <p>
+        In this mode, the Concentrator is connected to an external GNSS
+        (GPS/Galileo) receiver. The internal oscillator is disciplined to align
+        with <strong>UTC (Coordinated Universal Time)</strong>. This connects
+        the internal timing of the Fi-Wi system to external absolute time.
+      </p>
+
+      <h5>Mode B: Free-Wheeling (Relative Synchronization)</h5>
+
+      <p>
+        In deep indoor environments (basements, bunkers) where GPS is
+        unavailable, or cost-sensitive deployments where 6 GHz AFC is not
+        required, the Concentrator operates in
+        <strong>Free-Wheeling</strong> mode.
+      </p>
+
+      <div class="callout">
+        <strong
+          >The Engineering Reality: Timing Consistency vs. Absolute Time</strong
+        ><br />
+        For dynamic RRH selection and coordinated scheduling, what matters is
+        consistent timing across RRHs, not absolute UTC accuracy. As long as all
+        RRHs maintain synchronized TSF counters relative to the Concentrator,
+        the system can provide seamless mobility and accurate queue
+        measurements—even if the system's concept of "UTC" is drifting by
+        seconds per year relative to atomic time.<br />
+        <br />
+        Because all RRHs are frequency-locked to the same Concentrator
+        oscillator, if the Concentrator drifts, the entire system drifts in
+        unison. This uniform time base enables coordinated operation without
+        requiring external time references for basic functionality.
+      </div>
+
+      <h4>4.7.4 When Absolute Time Becomes Mandatory</h4>
+
+      <p>
+        While Free-Wheeling mode is sufficient for core system operation,
+        <strong>GPS-Disciplined (Absolute)</strong> mode becomes mandatory when
+        the Fi-Wi system interacts with external systems that require UTC
+        timestamps:
+      </p>
+
+      <ol>
+        <li>
+          <strong>6 GHz AFC (Automated Frequency Coordination):</strong> To
+          operate at <strong>Standard Power</strong> in the 6 GHz band
+          (essential for outdoor or large-venue coverage), the FCC requires the
+          system to check a central database for incumbent microwave links. The
+          database operates on UTC. The Concentrator must sign its request with
+          a precise, absolute timestamp and geolocation. A drifting clock will
+          cause the AFC request to be rejected, forcing the system into Low
+          Power Indoor (LPI) mode.
+        </li>
+
+        <li>
+          <strong>Inter-Concentrator Handoffs (Multi-Building Roaming):</strong>
+          In a campus environment with two distinct Concentrators (e.g.,
+          Building A and Building B), a client roaming between them may
+          experience time jumps. If Concentrator A and B are free-wheeling
+          independently, their timestamps may differ by seconds. This jump can
+          break high-level security protocols (like Kerberos or 802.1X
+          re-authentication) that reject "replay attacks" based on timestamp
+          windows.
+        </li>
+
+        <li>
+          <strong>Correlated Debugging:</strong> If a user reports a
+          connectivity drop at 10:04 AM, but the Concentrator has drifted by 45
+          seconds, the system logs will be stamped 10:04:45. Correlating Fi-Wi
+          logs with client-side logs (which are usually synced to NTP/Cellular
+          time) becomes operationally difficult, complicating root-cause
+          analysis.
+        </li>
+      </ol>
+
+      <h4>4.7.5 RRH Clock Distribution Hardware</h4>
+
+      <p>
+        Standard enterprise APs utilize free-running crystal oscillators with
+        ~20 ppm frequency error. This causes TSF counters to drift relative to
+        each other, making seamless mobility difficult. To achieve the timing
+        consistency required for Fi-Wi's coordinated operation, the RRH hardware
+        architecture must be fundamentally different.
+      </p>
+
+      <p>
+        <strong>The Fi-Wi Solution:</strong> The RRH hardware uses
+        <strong>Mobile-Class Wi-Fi Silicon</strong> (which natively supports
+        external clock inputs) driven by a
+        <strong>Fronthaul-Recovered Precision Clock</strong>.
+      </p>
+
+      <div class="diagram-block">
+        <h4>Diagram 4-3: RRH Precision Clock Distribution Chain</h4>
+
+        <pre class="diagram">
+┌──────────────────────────────────────────────────────────────────────────────┐
+│                        RRH CLOCK DISTRIBUTION ARCHITECTURE                   │
+└──────────────────────────────────────────────────────────────────────────────┘
+
+        [ PCIe Over Fiber ]
+                 │
+                 │ (1) PTM Timestamps (Implicit Clock)
+                 ▼
+   ┌─────────────────────────────┐
+   │      RRH FPGA / Retimer     │
+   │   (Clock Recovery Circuit)  │
+   └─────────────┬───────────────┘
+                 │
+                 │ (2) "Dirty" Recovered Clock (High Jitter)
+                 ▼
+   ┌─────────────────────────────┐           ┌─────────────────────────────┐
+   │    JITTER ATTENUATOR IC     │           │    WI-FI 7 SOC (Client)     │
+   │    (e.g., Si5395 / LMK05)   │           │                             │
+   │                             │           │                             │
+   │   ┌─────────────────────┐   │           │    ┌───────────────────┐    │
+   │   │  Digital Servo Loop │   │ (3) Clean │    │   Internal PLL    │    │
+   │   │      (DSPLL)        │───┼───────────┼───►│ (RF Synthesizer)  │    │
+   │   └─────────────────────┘   │ 40 MHz    │    └─────────┬─────────┘    │
+   │                             │ Reference │              │              │
+   └─────────────────────────────┘           └──────────────┼──────────────┘
+                                                            │
+                                                            ▼
+                                                   [ 5 GHz / 6 GHz ]
+                                                   [ RF Carrier    ]
+                                                   (Independent phase per RRH)
+    </pre
+        >
+        <p class="diagram-caption">
+          <strong>Signal Flow:</strong> The RRH recovers a noisy clock from the
+          PCIe fronthaul. A digital Jitter Attenuator cleans the signal using an
+          internal DSP servo loop. This provides the ultra-low phase noise
+          reference required for 4096-QAM while maintaining frequency lock to
+          the Concentrator's timebase. Note: The Wi-Fi chip's internal PLL
+          establishes its own RF carrier phase, which is independent across
+          RRHs.
+        </p>
+      </div>
+
+      <p>The clock distribution chain operates as follows:</p>
+
+      <ol>
+        <li>
+          <strong>Concentrator (Grandmaster):</strong> Distributes the master
+          timebase via PTM packets over the PCIe-over-fiber link.
+        </li>
+
+        <li>
+          <strong>RRH FPGA / Retimer:</strong> Recovers the implicit clock from
+          the PCIe bitstream or explicit PTM timestamps.
+        </li>
+
+        <li>
+          <strong>Network Synchronizer (Jitter Attenuator):</strong>
+          <ul>
+            <li>
+              <em>Component:</em> e.g., Silicon Labs Si5395 or TI LMK05318.
+            </li>
+
+            <li>
+              <em>Function:</em> Feeds the "dirty" recovered clock digitally
+              into this dedicated IC.
+            </li>
+
+            <li>
+              <em>Cleaning:</em> The IC uses an internal, narrow-bandwidth DSP
+              servo loop to filter out PCIe transport jitter, synthesizing a
+              pristine 40 MHz reference.
+            </li>
+          </ul>
+        </li>
+
+        <li>
+          <strong>Wi-Fi SoC (Client SKU):</strong> The cleaned signal is fed
+          directly into the chip's <code>Ext_Ref</code> /
+          <code>XO_IN</code> pin. The chip's internal PLLs lock to this external
+          frequency reference, ensuring consistent TSF counter operation across
+          all RRHs.
+        </li>
+      </ol>
+
+      <div class="callout">
+        <strong
+          >Architectural Decision: Digital Holdover vs. Voltage Control</strong
+        ><br />
+        Fi-Wi uses a <strong>Digital Network Synchronizer</strong> rather than a
+        traditional VCTCXO servo loop. In a VCTCXO design, any noise on the
+        analog control voltage line translates directly into phase noise, which
+        degrades 4096-QAM EVM. By using digital jitter attenuation, the control
+        loop remains in the digital domain until final synthesis, ensuring
+        ultra-low phase noise while providing superior holdover stability if the
+        fiber link flickers.
+      </div>
+
+      <h4>4.7.6 Why Mobile Wi-Fi SKUs?</h4>
+
+      <p>
+        Fi-Wi explicitly selects
+        <strong>Mobile/Client Wi-Fi 7 chipsets</strong> (e.g., Qualcomm
+        FastConnect or Broadcom BCM43xx client series) rather than traditional
+        Enterprise AP SKUs. This choice is driven by specific architectural
+        needs:
+      </p>
+
+      <ul>
+        <li>
+          <strong>External Clock Support:</strong> Mobile chips are designed to
+          share a single high-precision TCXO with a cellular modem and GPS. They
+          expose dedicated <code>Ext_Ref</code> pins that accept an external
+          drive signal, whereas many AP chips expect a passive crystal
+          resonator. This external clock capability enables consistent TSF
+          counter operation across all RRHs.
+        </li>
+
+        <li>
+          <strong>PCIe Native:</strong> These chips are designed as PCIe
+          endpoints to communicate with host processors (Snapdragon or x86),
+          fitting perfectly into the Fi-Wi DMA architecture.
+        </li>
+
+        <li>
+          <strong>Power & Thermal Profile:</strong> Optimized for
+          battery-powered devices (&lt; 3W), these chips fit the strict thermal
+          envelope required for fanless, cell-per-room RRH enclosures.
+        </li>
+      </ul>
+
+      <h4>4.7.7 What Clock Synchronization Does NOT Enable</h4>
+
+      <p>
+        It is important to understand the limitations of frequency-locked clocks
+        with COTS Wi-Fi hardware:
+      </p>
+
+      <ul>
+        <li>
+          <strong>No RF Phase Control:</strong> The Wi-Fi chip's internal RF
+          synthesizer PLL establishes its own carrier phase. This phase is
+          arbitrary and cannot be controlled or predicted from the external
+          clock input.
+        </li>
+
+        <li>
+          <strong>No Coordinated Simultaneous Transmission:</strong> Without RF
+          phase control, multiple RRHs transmitting on the same channel at the
+          same time create random interference, not constructive combining. Each
+          RRH must operate independently using CSMA/CA.
+        </li>
+
+        <li>
+          <strong>No Scheduled Transmission:</strong> In unlicensed spectrum,
+          every transmission must perform Listen Before Talk (LBT) / CSMA/CA.
+          You cannot command an RRH to "transmit at TSF timestamp X" as this
+          would violate regulatory requirements.
+        </li>
+      </ul>
+
+      <div class="callout">
+        <strong>Key Insight:</strong> The frequency-locked clock discipline
+        ensures that TSF counters increment synchronously across all RRHs. This
+        enables <em>consistent timing</em> for seamless mobility and accurate
+        queue measurements—but does not enable RF phase control or coordinated
+        simultaneous transmission. Those capabilities would require custom ASIC
+        development with externally-controllable RF synthesizers, which is
+        beyond the scope of COTS Wi-Fi chipsets.
+      </div>
+    </div>
+
+    <hr />
+
+    <h2 id="section-5">5. Control Architecture: The Dual-Integrator System</h2>
+
+    <p>
+      A rigorous control-theoretic analysis of Wi-Fi reveals a fundamental
+      challenge: there are not one, but
+      <strong>two distinct integrators</strong> in the transmit path. In
+      traditional autonomous APs, these integrators are coupled in undefined
+      ways, leading to instability (bufferbloat) and poor interaction with TCP
+      congestion control. Fi-Wi explicitly separates these integrators, applies
+      distinct control laws to each, and enforces a strict
+      <strong>Time-Scale Separation</strong> to guarantee system stability.
+    </p>
+
+    <h3 id="section-5.1">5.1 The Two Integrators</h3>
+
+    <p>
+      To achieve stability, we must model and control two distinct accumulation
+      processes:
+    </p>
+
+    <ol>
+      <li>
+        <strong>The Outer Integrator (Group Queue):</strong> Located in the
+        Concentrator. This accumulates packets based on the mismatch between
+        arriving traffic (internet speed) and the wireless link capacity. It
+        operates on the <strong>RTT timescale</strong> (milliseconds).
+      </li>
+
+      <li>
+        <strong>The Inner Integrator (Aggregation Buffer):</strong> Located
+        logically between the Concentrator and RRH. This accumulates packets to
+        build 802.11 A-MPDU aggregates for PHY efficiency. It operates on the
+        <strong>TXOP timescale</strong> (hundreds of microseconds).
+      </li>
+    </ol>
+
+    <h3 id="section-5.2">5.2 The Outer Loop: L4S and Group Queue Dynamics</h3>
+
+    <p>
+      The primary bottleneck managed by the AQM (Active Queue Management) is the
+      <strong>Group Queue</strong>. This loop drives the end-to-end congestion
+      control (L4S/TCP).
+    </p>
+
+    <h4>5.2.1 Queue Dynamics</h4>
+
+    <p>
+      The queue depth Q(t) evolves based on the mismatch between the arrival
+      rate λ(t) and the effective service rate μ(t):
+    </p>
+
+    <pre>
+dQ/dt = λ(t - τ_fwd) - μ(t)
+</pre
+    >
+    <h4>5.2.2 The PI² Control Law</h4>
+
+    <p>
+      Fi-Wi uses a PI² controller to calculate a marking probability \( p(t) \),
+      targeting a shallow queue reference \( Q_{ref} \) (typically 200 µs). This
+      provides a coherent signal to L4S senders:
+    </p>
+
+    <pre>
+p(t) = K_alpha * (Q(t) - Q_ref) + K_beta * ∫ (Q(t) - Q_ref) dt
+</pre
+    >
+    <div class="callout">
+      <strong>Concept Shift: AQM vs. Active Rate Management (ARM)</strong><br />
+
+      <p>
+        Traditional congestion control relies on
+        <strong>Active Queue Management (AQM)</strong>: a queue must physically
+        build up before the network detects congestion and signals the sender to
+        slow down. The goal is to <em>manage</em> the queue size.
+      </p>
+
+      <p>
+        L4S enables a new paradigm called
+        <strong>Active Rate Management (ARM)</strong>.
+      </p>
+
+      <ul>
+        <li>
+          <strong>The Mechanism:</strong> Instead of waiting for a queue, ARM
+          uses a <strong>Policer</strong> to mark packets the instant the
+          arrival rate exceeds the service rate.
+        </li>
+
+        <li>
+          <strong>The Result:</strong> The sender (e.g., TCP Prague) receives
+          immediate feedback and adjusts its rate <em>before</em> a queue can
+          form.
+        </li>
+
+        <li>
+          <strong>Key Difference:</strong> "AQM is best to control Classic
+          latency (Queue Managed). ARM is best to avoid L4S latency (Queue
+          Avoided)."
+        </li>
+      </ul>
+
+      <p class="small">
+        <em
+          >Reference: Koen De Schepper, "Understanding Latency 4.0", December
+          2025.</em
+        ><br />
+        <a
+          href="https://www.youtube.com/watch?v=m8-0QI0AXPM&amp;t=1155s"
+          target="_blank"
+          >Watch the explanation (19:15)</a
+        >
+      </p>
+    </div>
+
+    <h3 id="section-5.3">5.3 The Inner Loop: MAC Aggregation and TXOPs</h3>
+
+    <p>
+      The <strong>Inner Loop</strong> manages the trade-off between PHY
+      efficiency (large aggregates) and latency (small aggregates). In
+      traditional APs, this integrator is effectively unbounded to maximize
+      benchmark scores, creating a "sawtooth" latency pattern that confuses TCP.
+    </p>
+
+    <p>Fi-Wi bounds this integrator via two mechanisms:</p>
+
+    <ul>
+      <li>
+        <strong>Bounded TXOP Duration:</strong> Limiting TXOPs to ≈250 µs forces
+        the aggregation integrator to discharge frequently (3–5 kHz).
+      </li>
+
+      <li>
+        <strong>Bounded Aggregate Size:</strong> Limiting \( A_{max} \) ensures
+        the "sawtooth" amplitude never exceeds the L4S latency budget.
+      </li>
+    </ul>
+
+    <h3 id="section-5.4">5.4 System Integration: Time-Scale Separation</h3>
+
+    <p>
+      For the nested loops to remain stable, the Inner Loop must look like
+      "constant service" to the Outer Loop. This requires the Inner Loop
+      bandwidth (<em>ω<sub>mac</sub></em
+      >) to be significantly higher than the Outer Loop bandwidth (<em
+        >ω<sub>tcp</sub></em
+      >):
+    </p>
+
+    <pre>
+ω_mac &gt;&gt; ω_tcp   (typically &gt; 20:1 ratio)
+</pre
+    >
+    <h4 id="section-5.4.1">5.4.1 Frequency Domain Constraint</h4>
+
+    <p>
+      By forcing the MAC to operate at a frequency of 3–5 kHz (via 250 µs
+      TXOPs), the aggregation noise is pushed high enough that it is naturally
+      filtered out by the TCP loop (which operates at 10–20 Hz).
+    </p>
+
+    <h4 id="section-5.4.2">
+      5.4.2 A-MPDU Aggregation Coherence and ECN Marking Precision
+    </h4>
+
+    <p>
+      The 250 µs TXOP constraint serves a dual purpose: it maintains time-scale
+      separation <em>and</em> ensures L4S receives coherent ECN feedback.
+      Traditional Wi-Fi's massive A-MPDU aggregation creates a fundamental
+      mismatch between Layer 2 efficiency and Layer 3 control precision.
+    </p>
+
+    <h4>The Aggregation-Feedback Mismatch</h4>
+
+    <p>
+      In wide-channel deployments (160 MHz), APs build large A-MPDU aggregates
+      containing dozens of IP packets to amortize MAC overhead. This creates
+      three control-loop pathologies:
+    </p>
+
+    <ul>
+      <li>
+        <strong>Binary ECN Feedback:</strong> Instead of smooth, per-packet
+        congestion signals, the sender receives bursts of marks (when the
+        aggregate transmits) or extended silence (during assembly), violating
+        the PI² controller's assumption of sub-RTT feedback granularity.
+      </li>
+
+      <li>
+        <strong>Buffering-Induced Latency:</strong> Building efficient
+        aggregates requires multi-millisecond buffering. This artificial delay
+        appears as congestion to L4S senders, causing premature rate reduction
+        even when the medium is idle.
+      </li>
+
+      <li>
+        <strong>Sojourn Time Measurement Loss:</strong> Once packets enter
+        hardware aggregation pipelines, per-packet queueing delay becomes
+        invisible, preventing accurate ECN marking per RFC 9332.
+      </li>
+    </ul>
+
+    <h4>Fi-Wi's Coherence Strategy</h4>
+
+    <p>Fi-Wi resolves this through coordinated design:</p>
+
+    <ol>
+      <li>
+        <strong>40 MHz Channel Width:</strong> Narrower channels require smaller
+        aggregates, naturally increasing MAC service frequency. More frequent
+        transmissions with smaller payloads ensure sojourn time measurement
+        occurs at packet granularity.
+      </li>
+
+      <li>
+        <strong>Concentrator-Level ECN Marking:</strong> The Concentrator
+        performs sojourn time measurement and ECN marking
+        <em>before</em> handing packets to RRHs for PHY transmission, preserving
+        microsecond-level queueing visibility.
+      </li>
+
+      <li>
+        <strong>Bounded TXOP Duration:</strong> The 250 µs maximum ensures MAC
+        service frequency remains &gt;10× higher than L4S control frequency (~1
+        RTT), enabling senders to interpret ECN marks as smooth probability
+        signals rather than discrete bursts.
+      </li>
+    </ol>
+
+    <p>
+      This approach maintains the benefits of A-MPDU efficiency while preserving
+      the feedback coherence L4S requires. The result: DualQ can sustain its
+      ~1ms target drain time without artificial inflation from aggregate
+      assembly delays. <em>For detailed analysis, see Appendix I.7.</em>
+    </p>
+
+    <h4 id="section-5.4.3">5.4.3 Design Parameters for Stability</h4>
+
+    <p>
+      Fi-Wi uses these parameters to ensure the system remains critically
+      damped:
+    </p>
+
+    <table
+      border="1"
+      cellpadding="5"
+      cellspacing="0"
+      style="border-collapse: collapse; width: 100%"
+    >
+      <thead>
+        <tr style="background-color: #f5f7fa">
+          <th>Loop</th>
+          <th>Parameter</th>
+          <th>Target Value</th>
+          <th>Rationale</th>
+        </tr>
+      </thead>
+
+      <tbody>
+        <tr>
+          <td><strong>Outer</strong></td>
+          <td>Queue Reference</td>
+          <td>200 µs</td>
+          <td>Maintains ultra-low queuing delay.</td>
+        </tr>
+
+        <tr>
+          <td><strong>Outer</strong></td>
+          <td>Update Interval</td>
+          <td>5 ms (~1 RTT)</td>
+          <td>Matches typical control loop frequency.</td>
+        </tr>
+
+        <tr>
+          <td><strong>Inner</strong></td>
+          <td>Target TXOP</td>
+          <td>250 µs</td>
+          <td>Ensures ω<sub>mac</sub> &gt;&gt; ω<sub>tcp</sub>.</td>
+        </tr>
+
+        <tr>
+          <td><strong>Inner</strong></td>
+          <td>Max Aggregate</td>
+          <td>32 MSDUs</td>
+          <td>Limits tail latency contribution.</td>
+        </tr>
+      </tbody>
+    </table>
+
+    <hr />
+
+    <h2 id="section-6">6. Airtime Domains and Dynamic Queue Grouping</h2>
+
+    <p>
+      In Fi-Wi, the core rule is:
+      <strong>there is one deep queue per independent airtime resource</strong>.
+      The physical queue lives in concentrator memory, but it represents the
+      airtime of one RRH or a <strong>dynamic group of RRHs</strong> whose RF
+      signals are coupled strongly enough to behave like a single cell.
+    </p>
+
+    <p>
+      If two RRHs can interfere, they cannot transmit simultaneously and
+      therefore must share a single logical queue. If RRHs are RF-isolated, each
+      receives its own queue. This preserves the “one bottleneck per control
+      loop” structure required by L4S.
+    </p>
+
+    <h3 id="section-6.1">6.1 Why airtime determines queue structure</h3>
+
+    <p>
+      Service at each queue corresponds to over-the-air transmission. Any RRHs
+      that share RF space must share a service process and therefore share a
+      queue. RRHs that do not interfere have independent airtime and get
+      independent queues.
+    </p>
+
+    <div class="diagram">
+      Concentrator Queues (central DRAM, cellularized domains)
+      ──────────────────────────────────────────────────────── Queue A (airtime
+      domain A) ├── RRH1 └── RRH2 Queue B (airtime domain B) └── RRH3 Queue C
+      (airtime domain C) ├── RRH4 ├── RRH5 ├── RRH6 └── RRH7 Queue D (airtime
+      domain D) └── RRH8
+    </div>
+
+    <h3 id="section-6.2">6.2 Forming airtime groups dynamically</h3>
+
+    <p>
+      Crucially, these RF groups and their queues are
+      <strong>not static</strong>. The concentrator forms and maintains airtime
+      domains dynamically using:
+    </p>
+
+    <ul>
+      <li>
+        <strong>Static provisioning</strong> from building layout / installer
+        input as an initial guess
+      </li>
+
+      <li>
+        <strong>Passive RF discovery</strong> (RSSI correlation, channel
+        utilization, beacon strength)
+      </li>
+
+      <li>
+        <strong>Runtime adaptation</strong> (retry bursts, rate shifts,
+        noise-floor correlation, airtime logs, CSI)
+      </li>
+    </ul>
+
+    <p>
+      Beyond simple interference, Fi-Wi’s groupings also consider the
+      <strong>spatial structure of the channels</strong>:
+    </p>
+
+    <ul>
+      <li>
+        CSI reveals the dominant
+        <strong>MIMO eigenvectors and eigenvalues</strong> between a set of RRHs
+        and a set of STAs.
+      </li>
+
+      <li>
+        Groups can be chosen so that RRHs in the same domain not only share
+        airtime, but can
+        <strong>jointly form useful spatial streams</strong> and nulls when
+        coordinated.
+      </li>
+
+      <li>
+        RRHs whose channels are nearly parallel for a given STA might be grouped
+        differently from RRHs whose channels provide orthogonal eigenmodes.
+      </li>
+    </ul>
+
+    <p>Over time, the Fi-Wi system continuously adjusts:</p>
+
+    <ul>
+      <li>Which RRHs belong to the same airtime domain</li>
+
+      <li>Which RRHs can be treated as isolated cells</li>
+
+      <li>
+        How much capacity is budgeted per domain, including how many
+        <strong>effective spatial streams</strong> it can carry
+      </li>
+    </ul>
+
+    <p>
+      Groups may merge if interference appears or split if RRHs become
+      effectively isolated (e.g., after a channel change or power adjustment,
+      including beacon power shaping). The AQM and ECN marking logic always runs
+      at the <em>current</em> group queue, so L4S always sees a single,
+      well-defined bottleneck per cellularized domain.
+    </p>
+
+    <p>
+      Because all RRHs expose real-time CSI, queue metrics, retry statistics,
+      airtime usage, and beacon reports into the concentrator’s shared state,
+      Fi-Wi can form RF groups that are tuned not just for coverage but for:
+    </p>
+
+    <ul>
+      <li>Interference minimization</li>
+
+      <li>Maximum usable spatial streams (eigenvector structure)</li>
+
+      <li>Tail-latency behavior under L4S marking</li>
+    </ul>
+
+    <h3 id="section-6.3">6.3 Room-Level RRH Density (FTTR-Class Deployment)</h3>
+
+    <p>
+      Fi-Wi is not designed around a small number of big AP cells per floor. The
+      architecture assumes something much closer to
+      <strong>Fiber-to-the-Room (FTTR)</strong>: one <em>cell per room</em>,
+      with fiber or equivalent deterministic fronthaul feeding small RRHs in
+      each room.
+    </p>
+
+    <p>
+      In higher-end deployments, each room can contain
+      <strong>multiple RRHs (e.g., 2–4 per room)</strong> to support:
+    </p>
+
+    <ul>
+      <li>
+        <strong>Location services</strong>, using multi-angle phase and RSSI
+        measurements across RRHs
+      </li>
+
+      <li>
+        <strong>Multi-static sensing</strong> for occupancy, motion, and other
+        sensing applications
+      </li>
+
+      <li>
+        <strong>Increased spatial-stream or eigenmode diversity</strong>,
+        enabling higher-rank MIMO than a single AP could offer
+      </li>
+
+      <li>
+        <strong>Robust redundancy</strong> — automatic handovers or multipath
+        transmission without STA involvement
+      </li>
+    </ul>
+
+    <div class="diagram">
+      Room-level Fi-Wi layout (conceptual) [Fi-Wi Concentrator] │ Fiber /
+      fronthaul │ ┌──────────┼──────────┬──────────┐ │ │ │ │ Room 1 Room 2 Room
+      3 Room 4 │ │ │ │ RRH1..4 RRH5..8 RRH9..12 RRH13..16 (2–4/rm) (2–4/rm)
+      (2–4/rm) (2–4/rm)
+    </div>
+
+    <p>
+      This density dramatically improves RF control. With RRHs separated by just
+      a few meters, the concentrator sees:
+    </p>
+
+    <ul>
+      <li>Sharper CSI contrasts between RRHs</li>
+
+      <li>More stable eigenvectors for multi-RRH coordination</li>
+
+      <li>Finer-grained airtime-group detection</li>
+
+      <li>
+        Better prediction of collapse risk or interference coupling at the room
+        level
+      </li>
+    </ul>
+
+    <div class="diagram">
+      Within a single room (example: 4 RRHs) Ceiling plan (top view)
+      ─────────────────────── RRH-A RRH-B ●-----------● | | | | ●-----------●
+      RRH-C RRH-D All four RRHs feed central queues with shared state and CSI.
+    </div>
+
+    <p>
+      Traditional AP-based architectures cannot achieve this cleanly because
+      <strong>they lack shared state</strong> and maintain separate, isolated
+      queues and PHY/MAC processes in each AP. Even with a central controller,
+      they are limited to heuristic steering and static power/channel tweaks.
+    </p>
+
+    <p>Fi-Wi, by contrast:</p>
+
+    <ul>
+      <li>
+        Places the queues and packet memory in the concentrator, not in room APs
+      </li>
+
+      <li>
+        Makes every RRH part of a <strong>shared state space</strong> driven by
+        centralized timing, CSI, queueing, and scheduling
+      </li>
+
+      <li>
+        Uses dynamic RF grouping to decide which RRHs are in the same airtime
+        domain and which are isolated cells
+      </li>
+
+      <li>
+        Treats multi-RRH-per-room deployments as an opportunity for richer
+        spatial streams and sensing, not as a management headache
+      </li>
+    </ul>
+
+    <p>
+      A cell-per-room architecture makes Fi-Wi fundamentally different from
+      controller-based Wi-Fi: it behaves more like
+      <strong>cellular small cells with centralized coordination</strong> than
+      like a set of autonomous APs.
+    </p>
+
+    <hr />
+
+    <h2 id="section-7">7. Queue Architecture for Fi-Wi</h2>
+
+    <p>
+      Fi-Wi centralizes packet memory, queueing, AQM, and TXOP scheduling inside
+      the concentrator. Because the concentrator is the true bottleneck for all
+      wireless transmissions, Fi-Wi can use a clean, minimal queue structure
+      that behaves predictably under load and exposes stable delay semantics to
+      L4S congestion controllers. This stands in contrast to traditional APs,
+      where dozens of hidden queues (per-station, per-TID, firmware rings,
+      retry/BA windows, PS-poll buffers, rate-control queues) produce variable
+      and unobservable queueing delay.
+    </p>
+
+    <p>
+      This section describes Fi-Wi’s queue architecture, why WMM priority
+      becomes largely unnecessary, and how centralized TXOP scheduling
+      eliminates the stochastic contention that drives Wi-Fi collapse in legacy
+      systems. The goal is simple: a minimal number of queues, explicit queue
+      semantics, and predictable latency for all traffic classes.
+    </p>
+
+    <h3 id="section-7.1">7.1 Why queue architecture matters</h3>
+
+    <p>
+      Because all packets live in the concentrator’s memory until the moment
+      they are transmitted over the air, Fi-Wi can explicitly control:
+    </p>
+
+    <ul>
+      <li>exact queue depths per airtime domain,</li>
+
+      <li>sojourn-time measurement for L4S AQM,</li>
+
+      <li>TXOP length and pacing (typically 200–250&nbsp;µs),</li>
+
+      <li>which RRH transmits each TXOP, based on grouping and CSI.</li>
+    </ul>
+
+    <p>
+      This allows Fi-Wi to do what distributed APs cannot: construct a
+      consistent, visible bottleneck queue that L4S congestion controllers can
+      lock onto with stable behavior.
+    </p>
+
+    <h3 id="section-7.2">
+      7.2 The theoretical case: L4S makes most priority obsolete
+    </h3>
+
+    <p>
+      If queue delay is capped around 500&nbsp;µs, legacy WMM categories provide
+      little additional value. For example, consider a voice stream:
+    </p>
+
+    <pre class="diagram">
+Voice codec: 80 bytes every 20 ms  (64 kbps)
+Transmit time at 1 Gbps: ~0.64 µs
+L4S queue target:        500 µs
+Voice latency budget:    ~150,000 µs
+
+Queue share: 500 / 150,000 = 0.3%
+</pre
+    >
+    <p>
+      If L4S keeps queueing delay under ~500&nbsp;µs, then all traffic —
+      including voice — stays far inside its latency budget. WMM’s role in
+      combatting bufferbloat disappears when bufferbloat itself is removed.
+    </p>
+
+    <h3 id="section-7.3">7.3 Practical complications</h3>
+
+    <p>Three real-world issues motivate a cautious design:</p>
+
+    <h4>• UDP does not respond to ECN</h4>
+
+    <p>Voice and video often use UDP. They:</p>
+
+    <ul>
+      <li>do not slow down when marked with ECN,</li>
+
+      <li>can starve TCP/QUIC if left unchecked,</li>
+
+      <li>may need guardrails even when queue delay is low.</li>
+    </ul>
+
+    <p>
+      Fi-Wi can mitigate this using
+      <strong>per-flow fair queuing</strong> inside the L4S queue, keeping UDP
+      in check without needing a separate WMM hierarchy.
+    </p>
+
+    <h4>• Airtime vs. queue time</h4>
+
+    <pre class="diagram">
+Total latency = Queue delay + Contention delay + TX delay + Retry delay
+                    ^^^^^^^^^^^^
+             L4S controls this
+</pre
+    >
+    <p>
+      WMM historically manipulates AIFS, CW, and TXOP to reduce contention
+      delay. Fi-Wi eliminates contention entirely using
+      <strong>centralized TXOP scheduling</strong>, so WMM’s airtime hacks lose
+      relevance.
+    </p>
+
+    <h4>• Failure modes and defense-in-depth</h4>
+
+    <p>Even L4S can fail under:</p>
+
+    <ul>
+      <li>broken ECN in sender stacks,</li>
+
+      <li>paths that zero or strip ECN bits,</li>
+
+      <li>AQM or scheduler bugs during early rollout,</li>
+
+      <li>extreme bursts that exceed control-loop bandwidth.</li>
+    </ul>
+
+    <p>
+      Hence, Fi-Wi benefits from a small amount of priority separation, at least
+      in early deployments.
+    </p>
+
+    <h3 id="section-7.4">7.4 Minimal 3-queue structure</h3>
+
+    <p>
+      The <strong>theoretically sufficient</strong> minimal queue architecture
+      for Fi-Wi is three queues:
+    </p>
+
+    <ul>
+      <li>
+        <strong>Management/Control Queue (Q<sub>mgmt</sub>)</strong><br />
+        Strict priority; beacons, probe responses, authentication, association,
+        RTS/CTS, BA frames. Small, bounded, and never ECN-marked.
+      </li>
+
+      <li>
+        <strong>L4S Queue (Q<sub>L4S</sub>)</strong><br />
+        ECT(1) traffic with dual-queue AQM; may use per-flow fair queuing
+        internally.
+      </li>
+
+      <li>
+        <strong>Classic Queue (Q<sub>classic</sub>)</strong><br />
+        ECT(0) and legacy traffic; part of the coupled AQM but with classic
+        behavior.
+      </li>
+    </ul>
+
+    <div class="diagram-block">
+      <h4>Figure 7-1: Minimal 3-Queue Fi-Wi Architecture</h4>
+
+      <pre class="diagram">
+                    ┌──────────────────────────────────────────┐
+                    │               Concentrator               │
+                    │ (Central Packet Memory • AQM • TXOP)     │
+                    └──────────────────────────────────────────┘
+                                   ▲
+                                   │
+                     ┌─────────────┼──────────────────┐
+                     │             │                  │
+                     │             │                  │
+            ┌────────┴───┐   ┌─────┴─────┐    ┌───────┴──────┐
+            │ Q_mgmt     │   │ Q_L4S      │    │ Q_classic     │
+            │ (Strict    │   │ (ECT(1),   │    │ (ECT(0),      │
+            │  priority) │   │  dual-Q)   │    │  classic)     │
+            └──────┬─────┘   └─────┬─────┘    └──────┬────────┘
+                   │               │                  │
+                   └───────────────┼──────────────────┘
+                                   │
+                          TXOP Scheduler
+                  (Build AMPDU • Select RRH • 200–250µs)
+                                   │
+         ┌─────────────────────────┼──────────────────────────┐
+         │                         │                          │
+     ┌───▼───┐               ┌─────▼─────┐              ┌─────▼─────┐
+     │  RRH1 │               │   RRH2    │              │   RRH3    │
+     │ (PHY) │               │  (PHY)    │              │  (PHY)    │
+     └───────┘               └───────────┘              └───────────┘
+</pre
+      >
+      <p class="diagram-caption">
+        The minimal Fi-Wi queue architecture contains a strict-priority
+        management queue plus dual-queue L4S (L4S + Classic). All buffering
+        lives in the concentrator; RRHs keep no deep queues. L4S senders see a
+        clean single-bottleneck model, and all 802.11 management frames bypass
+        AQM entirely for correctness.
+      </p>
+    </div>
+
+    <p>
+      In this design, <strong>WMM is unnecessary</strong> at the wireless
+      bottleneck. All data traffic benefits from the same controlled queue
+      delay, and fairness is enforced by per-flow scheduling rather than EDCA.
+    </p>
+
+    <h3 id="section-7.5">7.5 Pragmatic 5-queue structure</h3>
+
+    <p>
+      A more conservative deployment uses <strong>five queues</strong> per
+      airtime domain:
+    </p>
+
+    <ol>
+      <li>
+        <strong>Q<sub>mgmt</sub></strong> — Management & control (strict
+        priority)
+      </li>
+
+      <li>
+        <strong>Q<sub>L4S-hi</sub></strong> — High-priority L4S (voice, control)
+      </li>
+
+      <li>
+        <strong>Q<sub>classic-hi</sub></strong> — High-priority classic (legacy
+        VoIP)
+      </li>
+
+      <li>
+        <strong>Q<sub>L4S-be</sub></strong> — L4S best-effort (bulk QUIC/TCP)
+      </li>
+
+      <li>
+        <strong>Q<sub>classic-be</sub></strong> — Classic best-effort (legacy
+        devices)
+      </li>
+    </ol>
+
+    <div class="diagram-block">
+      <h4>
+        Figure 7-2: Pragmatic 5-Queue Fi-Wi Architecture (Defense-in-Depth)
+      </h4>
+
+      <pre class="diagram">
+                       ┌───────────────────────────────────────────┐
+                       │                Concentrator                │
+                       │  (Central Packet Memory • AQM • TXOP)      │
+                       └───────────────────────────────────────────┘
+                                   ▲
+                                   │
+          ┌─────────────── Five Logical Queues Per Airtime Domain ────────────────┐
+          │                                                                        │
+    ┌─────┴─────┐   ┌─────────┬──────────┬──────────┬──────────┬─────────┬────────┘
+    │ Q_mgmt     │   │ Q_L4S-hi│ Q_classic-hi│ Q_L4S-be │ Q_classic-be │
+    │ (priority) │   │ (Voice) │ (Legacy VoIP) │ (Bulk TCP/QUIC) │ (Legacy bulk) │
+    └─────┬──────┘   └──────┬──────────────┬──────────────┬──────┘
+          │                 │              │              │
+          └─────────────────┼──────────────┼──────────────┘
+                            │
+                      TXOP Scheduler
+                (Build AMPDU • Select RRH • Delay Targets)
+                            │
+      ┌─────────────────────┼──────────────────────────┐
+      │                     │                          │
+   ┌──▼───┐            ┌────▼────┐                 ┌────▼────┐
+   │ RRH1 │            │ RRH2    │                 │ RRH3    │
+   │ (PHY)│            │ (PHY)   │                 │ (PHY)   │
+   └──────┘            └─────────┘                 └─────────┘
+</pre
+      >
+      <p class="diagram-caption">
+        The 5-queue design provides a two-tier priority system across L4S and
+        Classic traffic. This conservative architecture offers compatibility
+        with legacy UDP voice/video, while still keeping Fi-Wi’s centralized L4S
+        semantics intact. Over time, deployments can collapse from 5 queues to 3
+        as performance data validates the simpler model.
+      </p>
+    </div>
+
+    <h3 id="section-7.6">7.6 Numerical examples</h3>
+
+    <p>
+      Consider 10 simultaneous HD video calls (~20&nbsp;Mbps total) plus a
+      saturating background TCP flow:
+    </p>
+
+    <p><strong>Legacy WMM:</strong></p>
+
+    <ul>
+      <li>Voice/video in AC_VI or AC_VO.</li>
+
+      <li>Bulk traffic in AC_BE.</li>
+
+      <li>Queues: video ~10&nbsp;ms, best-effort ~100&nbsp;ms under load.</li>
+    </ul>
+
+    <p><strong>Fi-Wi with L4S + fair queuing:</strong></p>
+
+    <ul>
+      <li>20&nbsp;Mbps / 1&nbsp;Gbps ≈ 2% of airtime.</li>
+
+      <li>Queue held near 500&nbsp;µs.</li>
+
+      <li>
+        Voice/video see ≈10–20&nbsp;µs queue contribution after fair-share
+        weighting.
+      </li>
+    </ul>
+
+    <p>
+      This is roughly <strong>1000× lower queueing latency</strong> than legacy
+      WMM systems, and it applies to all traffic, not only traffic in a
+      “priority” AC.
+    </p>
+
+    <h3 id="section-7.7">7.7 Deployment strategy</h3>
+
+    <p>Fi-Wi can phase its queue structure over time:</p>
+
+    <ul>
+      <li>
+        <strong>Phase 1 (Conservative)</strong>: 5-queue system for operator
+        comfort and defense-in-depth.
+      </li>
+
+      <li>
+        <strong>Phase 2 (Aggressive)</strong>: collapse to the minimal 3-queue
+        system once production evidence confirms L4S stability across real
+        workloads.
+      </li>
+    </ul>
+
+    <p>Metrics to monitor include:</p>
+
+    <ul>
+      <li>Voice MOS under background load,</li>
+
+      <li>p99/p999 latency for all flows,</li>
+
+      <li>video stall rates,</li>
+
+      <li>behavior under intentional overload tests.</li>
+    </ul>
+
+    <h3 id="section-7.8">7.8 WMM support in Fi-Wi</h3>
+
+    <p>WMM exists to correct three historical problems in distributed Wi-Fi:</p>
+
+    <ul>
+      <li>
+        <strong>Bufferbloat</strong> (queues growing into tens or hundreds of
+        milliseconds),
+      </li>
+
+      <li>
+        <strong>UDP starving TCP</strong> due to non-responsive senders, and
+      </li>
+
+      <li>
+        <strong>stochastic EDCA contention</strong> that produces unfair and
+        unpredictable airtime.
+      </li>
+    </ul>
+
+    <p>Fi-Wi removes the root causes of these behaviors:</p>
+
+    <ul>
+      <li>
+        Queueing lives exclusively in the concentrator, tightly bounded
+        (200–500&nbsp;µs).
+      </li>
+
+      <li>
+        Per-flow fair queuing inside the L4S queue prevents UDP starvation.
+      </li>
+
+      <li>Centralized TXOP scheduling eliminates EDCA contention entirely.</li>
+    </ul>
+
+    <p>
+      Because of this, full WMM support at the air bottleneck is
+      <strong>not necessary</strong>. However, Fi-Wi <em>does</em> support WMM
+      semantics for:
+    </p>
+
+    <ul>
+      <li>
+        <strong>policy compatibility</strong> with existing enterprise QoS
+        frameworks,
+      </li>
+
+      <li>
+        <strong>client expectations</strong> (e.g., phones marking DSCP=EF),
+      </li>
+
+      <li>
+        <strong>defense-in-depth</strong> during overload or L4S misbehavior.
+      </li>
+    </ul>
+
+    <p>Fi-Wi handles WMM as an <strong>admission-time mapping</strong>:</p>
+
+    <ul>
+      <li>
+        WMM <em>voice/video</em> → high-priority L4S or Classic queues,
+        depending on ECN capability.
+      </li>
+
+      <li>
+        WMM <em>best-effort</em> → Q<sub>L4S-be</sub> or Q<sub>classic-be</sub>.
+      </li>
+
+      <li>WMM <em>background</em> → Classic best-effort.</li>
+    </ul>
+
+    <p>
+      This preserves compatibility while avoiding the complexity and
+      unpredictability of EDCA-based priority systems. Over time, Fi-Wi
+      deployments can rely on pure L4S semantics and collapse WMM to a
+      compatibility shim, not a required scheduling mechanism.
+    </p>
+
+    <h3 id="section-7.9">7.9 Summary</h3>
+
+    <p>Fi-Wi’s centralized queue architecture enables:</p>
+
+    <ul>
+      <li>
+        a <strong>minimal</strong> 3-queue design (management, L4S, classic),
+      </li>
+
+      <li>
+        a <strong>pragmatic</strong> 5-queue variant for early deployment,
+      </li>
+
+      <li>predictable delay semantics compatible with L4S,</li>
+
+      <li>elimination of WMM’s complexity and EDCA’s stochastic behavior,</li>
+
+      <li>clean integration with centralized TXOP scheduling.</li>
+    </ul>
+
+    <p>
+      Traditional Wi-Fi uses WMM to work around bufferbloat and contention.
+      Fi-Wi removes those problems entirely through tight queue control, shared
+      state, and central scheduling. Priority becomes a policy choice — not a
+      crutch for an unstable MAC.
+    </p>
+
+    <p>
+      In Fi-Wi, the Carve-Out ensures the voice packet (L4S) bypasses the
+      accumulated Classic bulk data completely. The file download continues to
+      saturate the link, but the
+      <strong
+        >latency of the L4S flow is decoupled from the load of the Classic
+        flow</strong
+      >.
+    </p>
+
+    <ul>
+      <li>
+        <strong>Classic View:</strong> "I am getting 950 Mbps, and my ping is
+        20ms."
+      </li>
+
+      <li>
+        <strong>L4S View:</strong> "I am getting 50 Mbps, and my ping is 1ms."
+      </li>
+
+      <li>
+        <strong>Result:</strong> Both flows coexist on the same RF channel
+        without the bulk flow destroying the real-time flow.
+      </li>
+    </ul>
+
+    <h2 id="section-8">8. RRH-Level Active Redundancy</h2>
+
+    <p>
+      Fi-Wi’s centralized shared state across RRHs makes it natural to treat
+      multiple radios as an <strong>active redundant set</strong> for the same
+      STA or room. This is analogous in spirit to 802.11be’s
+      <strong>Multi-Link Operation (MLO)</strong>, where a single multi-link
+      device (MLD) can use multiple links for reliability and capacity. In
+      Fi-Wi, the concentrator is the coordination point leveraging shared state,
+      and the RRHs are the distributed radios providing multiple RF paths.
+    </p>
+
+    <h3 id="section-8.1">8.1 Uplink: Duplicate Reception & Diversity</h3>
+
+    <p>
+      In many deployments, a client STA will be audible at more than one RRH
+      (overlapping coverage). On the uplink, Fi-Wi exploits this spatial
+      diversity to improve reliability without requiring changes to the client.
+    </p>
+
+    <ol>
+      <li>
+        <strong>Multi-Point Reception:</strong> Multiple RRHs may receive the
+        same MPDU from a transmitting STA, potentially at different SNR/MCS
+        levels.
+      </li>
+
+      <li>
+        <strong>Forwarding:</strong> Each RRH decodes the frame locally. If the
+        Frame Check Sequence (FCS) passes, the RRH timestamps the frame (using
+        the shared global timebase), attaches metadata (RSSI, SNR, Channel State
+        Information), and forwards it to the Concentrator via the
+        PCIe-over-Fiber link.
+      </li>
+
+      <li>
+        <strong>Post-Detection Selection:</strong> Effectively, the Concentrator
+        acts as a <em>Post-Detection Selection Diversity</em> combiner:
+        <ul>
+          <li>
+            <strong>Identification:</strong> The Concentrator buffers incoming
+            frames for a defined window (e.g., 100 µs) and identifies duplicates
+            by matching the <strong>Source MAC</strong> and
+            <strong>802.11 Sequence Control</strong> fields (Sequence Number +
+            Fragment Number).
+          </li>
+
+          <li>
+            <strong>Selection:</strong> If multiple RRHs capture the same frame
+            successfully, the Concentrator compares their metadata and forwards
+            only the single copy with the <strong>highest SNR</strong>,
+            discarding the others.
+          </li>
+        </ul>
+      </li>
+    </ol>
+
+    <p>
+      This approach leverages the spatial diversity of distributed RRHs to
+      mitigate shadowing and multipath fading. Because the selection logic
+      operates on valid MAC frames (after FCS verification) rather than raw I/Q
+      samples, this architecture maintains compatibility with standard COTS
+      Wi-Fi silicon at the Radio Head.
+    </p>
+
+    <div class="diagram">
+      Uplink redundancy STA │ (same frame) ╱ ╲ RRH1 RRH2 │ │ └──► Fi-Wi
+      Concentrator ◄──┘ (dedup + select)
+    </div>
+
+    <h3 id="section-8.2">8.2 Downlink: per-packet steering</h3>
+
+    <p>
+      On the downlink, the concentrator can treat multiple RRHs as candidate
+      transmitters for a given STA or room:
+    </p>
+
+    <ul>
+      <li>
+        Each downlink packet is enqueued once in the
+        <strong>group queue</strong> for that airtime domain.
+      </li>
+
+      <li>
+        The scheduler assigns the corresponding TXOP to one RRH (or occasionally
+        duplicates across two RRHs) based on:
+        <ul>
+          <li>Recent CSI and SNR from each RRH to that STA</li>
+
+          <li>Per-RRH load and airtime usage</li>
+
+          <li>Policy (reliability vs. capacity vs. isolation)</li>
+        </ul>
+      </li>
+    </ul>
+
+    <p>This gives Fi-Wi:</p>
+
+    <ul>
+      <li>
+        <strong>Per-packet (or per-TXOP) steering</strong> between RRHs, with
+        the group queue as the single source of truth for the flow.
+      </li>
+
+      <li>
+        <strong>Fast failover:</strong> if one RRH degrades (noise, hardware
+        issue), the scheduler can switch the STA’s downlink traffic to another
+        RRH in the same cellularized domain without changing IP or transport
+        state.
+      </li>
+    </ul>
+
+    <div class="diagram">
+      Group Queue (airtime domain A) ────────────────────────────── │ ├─► RRH1
+      TXOPs to STA └─► RRH2 TXOPs to STA (backup or parallel) Concentrator
+      chooses RRH per TXOP based on CSI + load + shared state.
+    </div>
+
+    <h3 id="section-8.2.1">
+      8.2.1 Listen-Before-Talk (LBT) and RRH Eligibility for Downlink Scheduling
+    </h3>
+
+    <p>
+      In a multi-RRH Fi-Wi deployment, each radio head operates on the same
+      BSSID and channel but sits in a different physical location with its own
+      RF conditions. While Fi-Wi centralizes all queueing and scheduling
+      decisions, every RRH must still obey the fundamental 802.11 rule:
+      <strong>listen-before-talk (LBT)</strong>.
+    </p>
+
+    <p>
+      This is where Fi-Wi diverges sharply from classical multi-AP systems. In
+      UniFi, Ruckus, Aruba, and all controller-based Wi-Fi architectures, each
+      AP queue is blind to the RF medium state until it attempts to transmit.
+      The AP commits a packet to the hardware queue, and if the medium is busy,
+      the packet waits (Head-of-Line blocking) while the AP performs backoff.
+    </p>
+
+    <p>
+      Fi-Wi inverts this. RRHs continuously report their
+      <strong>LBT Eligibility Status</strong> (Clear/Busy) to the Concentrator
+      via the high-speed telemetry path. RRHs report LBT eligibility status via
+      PCIe telemetry with update intervals of 100–500 µs, well-matched to
+      inter-TXOP scheduling decisions. While the Concentrator cannot react
+      within a single 9µs backoff slot, it operates on the
+      <strong>Inter-TXOP timescale</strong> (200–500 µs<sup
+        ><a href="#fn1" id="ref1">1</a></sup
+      >).
+    </p>
+
+    <p>
+      Before posting a new DMA descriptor to an RRH, the Scheduler checks this
+      eligibility:
+    </p>
+
+    <ul>
+      <li>
+        <strong>Eligible:</strong> The RRH reports the medium is statistically
+        idle. The Concentrator posts the descriptor, "priming" that RRH for
+        immediate transmission.
+      </li>
+
+      <li>
+        <strong>Ineligible:</strong> The RRH reports heavy interference or high
+        Energy Detect (ED). The Scheduler <strong>skips this RRH</strong> and
+        routes the descriptor to an alternative RRH in the same airtime domain
+        that reports clear air.
+      </li>
+    </ul>
+
+    <p>
+      This prevents <strong>Head-of-Line Blocking</strong> where a packet sits
+      in a hardware queue on a jammed radio. When multiple RRHs report clear
+      airtime, Fi-Wi selects among them based on link quality (CSI) and
+      predicted airtime efficiency. Conversely, if all RRHs report medium-busy,
+      no RRH is primed; the scheduler pauses the flow to prevent backpressure
+      from accumulating in the RRH hardware, keeping the queue depth visible in
+      the Concentrator where L4S can measure it.
+    </p>
+
+    <p>
+      The result is a form of
+      <strong>Centralized Selection based on LBT Eligibility</strong>. Multi-AP
+      systems coordinate configuration (channels, power), but they cannot
+      coordinate transmit <em>starts</em> because they lack the real-time
+      feedback loop to steer packets away from busy radios before they are
+      queued.
+    </p>
+
+    <p class="small" id="fn1">
+      <sup>1</sup> Representative scheduling interval for mixed traffic
+      workloads; actual TXOP durations range from tens of microseconds (small
+      frames) to several milliseconds (large aggregates). <a href="#ref1">↩</a>
+    </p>
+
+    <figure id="fig-10-2-1-rrh-lbt-eligibility" class="fiwi-diagram">
+      <figcaption>
+        <strong>Figure 8-3:</strong> Per-RRH LBT eligibility feeding the
+        centralized Fi-Wi scheduler.
+      </figcaption>
+
+      <pre class="diagram">
+                        (Shared RF / Airtime Domain)
+
+       +----------------------+                 +----------------------+
+       |      RRH-A           |                 |      RRH-B           |
+       |  (Room / Zone A)     |                 |  (Room / Zone B)     |
+       +----------------------+                 +----------------------+
+       |  LBT: Clear          |                 |  LBT: Busy (ED high) |
+       |  Eligible = YES      |                 |  Eligible = NO       |
+       +----------+-----------+                 +-----------+----------+
+                  |                                           |
+                  |  Fiber fronthaul (low latency)            |
+                  |                                           |
+                  v                                           v
+
+                     +-----------------------------------+
+                     |  Fi-Wi Concentrator / Scheduler   |
+                     +-----------------------------------+
+                     |  Centralized queue for building   |
+                     |  L4S feedback / congestion state  |
+                     |                                   |
+                     |  Decision: Post Descriptor to A   |
+                     |  (RRH-B flagged as jammed/ineligible|
+                     |   prevents HoL blocking)          |
+                     +----------------+------------------+
+                                      |
+                                      | Downlink frames / aggregates
+                                      v
+
+                               +--------------+
+                               |   Client(s)  |
+                               +--------------+
+  </pre
+      >
+    </figure>
+
+    <figure id="fig-10-2-1-rrh-lbt-timing" class="fiwi-diagram">
+      <figcaption>
+        <strong>Figure 8-4:</strong> Inter-TXOP Steering. The Scheduler uses LBT
+        state to decide <em>where to stage</em> the next packet. Note: The RRH
+        still performs local backoff; the Scheduler simply ensures data is
+        staged at the RRH that
+        <strong>currently reports clear channel conditions</strong>.
+      </figcaption>
+
+      <pre class="diagram">
+Time →
+-------------------------------------------------------------------------------------------------&gt;
+
+RRH-A (Room A):        [ Sense medium ]  [ Idle ]  [ Clear ]  [  Transmit TXOP  ]  [ Idle ... ]
+                       |&lt;-- DIFS ---&gt;|   |&lt;---- contention window (few slots) ----&gt;|
+
+RRH-B (Room B):        [ Sense medium ]  [  ED high: medium busy  ]  [ Backoff ... ]
+                       |&lt;---- busy ----&gt;|
+
+RRH LBT → Scheduler:       A: "Clear"                  B: "Busy"
+
+Scheduler View:        [ Receive LBT states from A, B ]
+                       [ Mark A = eligible, B = ineligible ]
+                       [ Dequeue next packets from central queue ]
+                       [ Post descriptor to RRH-A only ]
+
+Downlink Action:       RRH-A receives descriptor, enters backoff, wins, transmits.
+                       RRH-B remains silent (no descriptor posted).
+
+Effect:                • No packet trapped in RRH-B's buffer
+                       • No exponential backoff storm
+                       • Deterministic selection of the RRH with clear airtime
+  </pre
+      >
+    </figure>
+
+    <h3 id="section-8.3">8.3 Analogy to Wi-Fi 7 MLO</h3>
+
+    <p>
+      802.11be MLO allows a multi-link device (AP/STA) to use multiple links
+      (e.g., 2.4G, 5G, 6G bands or channels) under a single MAC entity. Features
+      include:
+    </p>
+
+    <ul>
+      <li>
+        <strong>Link aggregation:</strong> parallel links for higher throughput.
+      </li>
+
+      <li>
+        <strong>Link redundancy:</strong> multi-link duplication for
+        reliability.
+      </li>
+
+      <li>
+        <strong>Per-packet steering:</strong> send each frame on the “best”
+        link.
+      </li>
+    </ul>
+
+    <p>
+      Fi-Wi provides a similar effect at the <strong>building scale</strong>,
+      but with important differences:
+    </p>
+
+    <ul>
+      <li>
+        The coordination point is the
+        <strong>Fi-Wi concentrator using shared state</strong>, not co-located
+        with any single AP.
+      </li>
+
+      <li>
+        The “links” are RRHs connected by deterministic fiber (or similar
+        fronthaul) to the concentrator.
+      </li>
+
+      <li>
+        The same centralized queues and L4S/AQM logic govern all RRHs in a
+        cellularized domain.
+      </li>
+    </ul>
+
+    <p>
+      Because the RRHs are <strong>spatially distributed</strong> around rooms
+      and hallways, Fi-Wi gains advantages that co-located antennas cannot
+      provide:
+    </p>
+
+    <ul>
+      <li>
+        <strong>More selection options:</strong> The Concentrator can choose
+        from multiple spatially separated RRHs, each seeing different channel
+        conditions to any given client
+      </li>
+
+      <li>
+        <strong>Macro-diversity:</strong> Uplink transmissions received by
+        multiple RRHs provide selection diversity that improves reliability at
+        cell edges
+      </li>
+
+      <li>
+        <strong>Spatial reuse opportunities:</strong> RRHs in different rooms
+        can reuse the same channel when spatial isolation is sufficient (&gt;25
+        dB attenuation)
+      </li>
+
+      <li>
+        <strong>Better coverage uniformity:</strong> Multiple distributed RRHs
+        eliminate dead zones and coverage gaps that single APs create
+      </li>
+    </ul>
+
+    <p>
+      These advantages come from <strong>intelligent packet routing</strong> and
+      <strong>dynamic RRH selection</strong>, not from RF phase coordination or
+      simultaneous beamforming across RRHs.
+    </p>
+
+    <h3 id="section-8.3.1">
+      8.3.1 Fi-Wi vs Wi-Fi 7 MLO: Compliance and Control
+    </h3>
+
+    <p>
+      Fi-Wi strictly adheres to <strong>local regulatory compliance</strong>.
+      The Concentrator manages the <em>queue</em> and the <em>schedule</em>, but
+      the RRH manages the <em>compliance</em>.
+    </p>
+
+    <p>
+      When the Scheduler assigns a TXOP to an RRH, it posts a descriptor. The
+      RRH hardware then performs standard 802.11 EDCA:
+    </p>
+
+    <ol>
+      <li>It senses the medium.</li>
+
+      <li>It draws a random backoff counter.</li>
+
+      <li>It counts down only when the medium is idle.</li>
+
+      <li>It transmits when the counter reaches zero.</li>
+    </ol>
+
+    <p><strong>The Architectural Difference:</strong></p>
+
+    <p>
+      <strong>In MLO or Mesh:</strong> If an AP commits a packet to a radio and
+      that radio hits congestion, the packet is trapped in the local buffer. The
+      backoff might take 50ms. During this time, the AP's other radios (or other
+      APs in the mesh) might be idle, but they cannot help because the packet is
+      already "owned" by the busy MAC.
+    </p>
+
+    <p>
+      <strong>In Fi-Wi:</strong> The packet remains in the Concentrator's
+      central memory until the last possible moment (see Appendix F). If the
+      Concentrator sees an RRH entering deep backoff (via real-time telemetry)
+      or reporting "Busy," it stops posting new descriptors to that RRH and
+      steers subsequent traffic to a free RRH. The backoff engine remains local
+      (compliance), but the queue feeding it is steered globally (performance).
+    </p>
+
+    <p>
+      This allows Fi-Wi to scale airtime domains across an entire building while
+      preventing the multi-node contention collapse that plagues traditional
+      Wi-Fi networks.
+    </p>
+
+    <figure id="fig-10-3-1-queues" class="fiwi-diagram">
+      <figcaption>
+        <strong>Figure 8-6:</strong> Per-airtime-domain queueing and scheduling
+        in MLO versus Fi-Wi.
+      </figcaption>
+
+      <pre class="diagram">
+Wi-Fi 7 MLO: per-radio queues and MAC logic           Fi-Wi: one centralized queue per airtime-domain
+================================================      ===============================================
+
+   Airtime-domain                                    Airtime-domain
+   --------------                                    --------------
+
+   +-------------+   +-------------+                +-------------------------+
+   |  Radio 1    |   |  Radio 2    |                |   Fi-Wi Concentrator    |
+   | MAC engine  |   | MAC engine  |                |  (per airtime-domain)   |
+   | Backoff     |   | Backoff     |                +-------------------------+
+   | DMA queues  |   | DMA queues  |                |  Centralized queue      |
+   +------+------+   +------+------+                |  AQM / L4S feedback     |
+          |                 |                       |  Scheduler              |
+          |                 |                       +-----------+-------------+
+          v                 v                                   |
+   Packet trapped          Packet trapped                       |
+   in local queue          in local queue                       |
+   during backoff          during backoff                       v
+
+                                                     +--------+-------+    +--------+-------+
+                                                     |   RRH A        |    |   RRH B        |
+                                                     | RF front-end   |    | RF front-end   |
+                                                     | LBT + backoff  |    | LBT + backoff  |
+                                                     +--------+-------+    +--------+-------+
+                                                              ^                    ^
+                                                              |                    |
+                                                   Scheduler posts descriptor only to
+                                                   the RRH that is clear and eligible.
+
+  </pre
+      >
+    </figure>
+
+    <h3 id="section-8.4">8.4 Preserving the "single bottleneck" L4S view</h3>
+
+    <p>
+      To keep L4S happy, Fi-Wi needs to preserve a
+      <strong>single bottleneck queue per flow</strong> even while using
+      multiple RRHs:
+    </p>
+
+    <ul>
+      <li>
+        If two RRHs are in the <strong>same RF airtime group</strong>, they
+        share the same group queue and marking logic. Uplink redundancy and
+        downlink dynamic point selection do not create multiple bottlenecks —
+        they just add spatial diversity and selection options within the same
+        queue.
+      </li>
+
+      <li>
+        If two RRHs are in <strong>different airtime domains</strong> (e.g.,
+        different channels or sufficiently isolated), the concentrator typically
+        designates one domain as the <em>primary</em> bottleneck for that flow
+        and uses the other only for failover or low-rate redundancy, to avoid
+        creating competing bottlenecks for the same traffic.
+      </li>
+    </ul>
+
+    <p>In other words:</p>
+
+    <ul>
+      <li>
+        Redundancy and spatial diversity are RF-level; queuing discipline
+        remains centralized and unique per flow/domain.
+      </li>
+
+      <li>
+        L4S still sees a single marking point (the group queue) for each path
+        its packets traverse.
+      </li>
+    </ul>
+
+    <h2 id="section-9">
+      9. Dynamic Point Selection and Intelligent Frequency Reuse
+    </h2>
+
+    <p>
+      Traditional Wi-Fi deployments suffer from two fundamental problems in
+      high-density environments: (1) clients are statically associated to a
+      single AP based on initial connection, leading to suboptimal performance
+      as they move, and (2) autonomous APs compete for airtime through CSMA/CA
+      contention, creating interference. Fi-Wi inverts this paradigm through
+      <strong>Dynamic Point Selection</strong>—continuously choosing the optimal
+      RRH per packet—and <strong>Intelligent Frequency Reuse</strong>—leveraging
+      spatial isolation to maximize capacity.
+    </p>
+
+    <h3>9.1 Dynamic Point Selection: The Core Capability</h3>
+
+    <p>
+      Unlike traditional Wi-Fi where clients are physically and logically tied
+      to a single Access Point (AP), Fi-Wi treats the entire building as a
+      single <strong>Virtual Cell</strong>. The Concentrator maintains real-time
+      Channel State Information (CSI) from all RRHs and dynamically selects the
+      optimal transmission point for each individual packet.
+    </p>
+
+    <h4>9.1.1 The Roaming Paradigm Shift: Negotiation vs. Execution</h4>
+
+    <p>
+      To understand the magnitude of this shift, we must compare the standard
+      "Fast BSS Transition" (802.11r) with the Fi-Wi approach. In standard
+      Wi-Fi, mobility is a negotiation. In Fi-Wi, it is an execution.
+    </p>
+
+    <table
+      border="1"
+      cellpadding="10"
+      cellspacing="0"
+      style="border-collapse: collapse; width: 100%"
+    >
+      <thead>
+        <tr style="background-color: #f2f2f2">
+          <th>Step</th>
+          <th>Standard Wi-Fi (802.11r / Fast Roaming)</th>
+          <th>Fi-Wi (Dynamic Point Selection)</th>
+        </tr>
+      </thead>
+
+      <tbody>
+        <tr>
+          <td><strong>1. Trigger</strong></td>
+          <td><strong>Client</strong> detects low RSSI and decides to scan.</td>
+          <td>
+            <strong>Concentrator</strong> detects better path via Uplink SNR.
+          </td>
+        </tr>
+
+        <tr>
+          <td><strong>2. Action</strong></td>
+          <td>
+            Client tunes radio off-channel to scan for beacons (Latency spike:
+            50–100ms).
+          </td>
+          <td><strong>Zero Action.</strong> Client stays on channel.</td>
+        </tr>
+
+        <tr>
+          <td><strong>3. Handshake</strong></td>
+          <td>
+            Client sends <em>Auth</em> + <em>Re-Assoc</em> frames. AP validates
+            keys.
+          </td>
+          <td><strong>None.</strong> No Over-the-Air frames.</td>
+        </tr>
+
+        <tr>
+          <td><strong>4. Switch</strong></td>
+          <td>AP 1 tears down keys; AP 2 installs keys.</td>
+          <td>
+            Concentrator updates the <em>DL_RRH_ID</em> pointer in memory.
+          </td>
+        </tr>
+
+        <tr>
+          <td><strong>Total Time</strong></td>
+          <td><strong>~50ms – 150ms</strong> (Best case)</td>
+          <td><strong>&lt; 1ms</strong> (PCIe Write)</td>
+        </tr>
+      </tbody>
+    </table>
+
+    <p>
+      While 802.11r is sufficient for buffered video (Netflix), it typically
+      breaks real-time applications like Voice over Wi-Fi (VoWiFi) and VR/XR,
+      where a 50ms gap causes audio dropouts or visual artifacts. Fi-Wi's
+      sub-millisecond switching ensures true continuity.
+    </p>
+
+    <h4>9.1.2 How It Works</h4>
+
+    <ul>
+      <li>
+        <strong>Continuous CSI Ingestion:</strong> The Concentrator constantly
+        reads memory-mapped uplink metadata (RSSI, SNR) from all RRHs. This is
+        not a periodic report, but a real-time stream of the physical
+        environment.
+      </li>
+
+      <li>
+        <strong>Per-Packet Decision:</strong> For each downlink packet, the
+        Scheduler examines current conditions and selects the RRH with the best
+        channel to that specific client.
+      </li>
+
+      <li>
+        <strong>Instant Routing:</strong> The packet is DMA-transferred via
+        fiber to the selected RRH, which performs standard CSMA/CA and
+        transmits.
+      </li>
+
+      <li>
+        <strong>Seamless Adaptation:</strong> As the client moves or RF
+        conditions change (e.g., a door closes), the Concentrator automatically
+        switches to a different RRH.
+      </li>
+    </ul>
+
+    <h4>9.1.3 Example Scenario</h4>
+
+    <p>Consider "Alice" on a VR headset walking down a hallway:</p>
+
+    <ol>
+      <li>
+        Alice starts a session in <strong>Room 304</strong> (near RRH-A: RSSI
+        -40 dBm). The Concentrator routes packets via RRH-A.
+      </li>
+
+      <li>
+        Alice walks toward the doorway. RRH-A degrades (-55 dBm) while the
+        hallway unit, RRH-B, improves (-45 dBm).
+      </li>
+
+      <li>The Concentrator detects this crossing point in the CSI data.</li>
+
+      <li>
+        For the very next packet, the pointer switches to
+        <strong>RRH-B</strong>.
+      </li>
+
+      <li>
+        <strong>Result:</strong> Alice's VR stream continues without a single
+        dropped frame or latency spike. She is unaware that the transmission
+        point changed.
+      </li>
+    </ol>
+
+    <h3 id="section-9.3">9.3 Intelligent Frequency Reuse</h3>
+
+    <p>
+      In traditional Wi-Fi, neighboring APs on the same channel create
+      co-channel interference. The standard solution is to assign different
+      channels (e.g., AP-A uses Channel 36, AP-B uses Channel 48), but this
+      wastes spectrum. Fi-Wi enables
+      <strong>intelligent frequency reuse</strong>—using the same channel across
+      multiple RRHs when spatial conditions allow.
+    </p>
+
+    <h4>When Frequency Reuse Works</h4>
+
+    <p>
+      Frequency reuse is viable when clients are in
+      <strong>spatially separated locations</strong> with significant isolation
+      (typically &gt;25-30 dB attenuation due to walls, floors, or distance).
+    </p>
+
+    <p><strong>Example: Adjacent Rooms</strong></p>
+
+    <ul>
+      <li>
+        <strong>Client A</strong> in Room 304 (served by RRH-A, RSSI: -42 dBm)
+      </li>
+
+      <li>
+        <strong>Client B</strong> in Room 305 (served by RRH-B, RSSI: -45 dBm)
+      </li>
+
+      <li>
+        <strong>Cross-coupling:</strong> RRH-A → Client B = -75 dBm, RRH-B →
+        Client A = -78 dBm
+      </li>
+
+      <li><strong>Isolation:</strong> ~30-33 dB (wall attenuation)</li>
+    </ul>
+
+    <p><strong>The Fi-Wi Decision:</strong></p>
+
+    <ol>
+      <li>
+        Concentrator detects &gt;30 dB spatial isolation via CSI measurements
+      </li>
+
+      <li>Configures both RRH-A and RRH-B to operate on Channel 36</li>
+
+      <li>Each RRH performs independent CSMA/CA in its local environment</li>
+
+      <li>Cross-interference is minimal due to spatial isolation</li>
+
+      <li>
+        Result: Effective channel capacity is doubled without requiring
+        additional spectrum
+      </li>
+    </ol>
+
+    <h4>Dynamic Adaptation</h4>
+
+    <p>
+      The key advantage over static channel planning is
+      <strong>real-time adaptation</strong>:
+    </p>
+
+    <ul>
+      <li>
+        <strong>Movement Detection:</strong> If Client A moves toward doorway,
+        cross-coupling increases (e.g., -75 dBm → -55 dBm)
+      </li>
+
+      <li>
+        <strong>Automatic Response:</strong> Concentrator detects degraded
+        isolation via updated CSI
+      </li>
+
+      <li>
+        <strong>Mitigation Options:</strong>
+        <ul>
+          <li>Switch one RRH to different channel (Channel 36 → Channel 48)</li>
+
+          <li>
+            Use time-division: Alternate which RRH transmits in each TXOP window
+          </li>
+
+          <li>
+            Apply dynamic point selection: Route all traffic through whichever
+            RRH provides best path to both clients
+          </li>
+        </ul>
+      </li>
+    </ul>
+
+    <h4>Why Autonomous APs Cannot Do This</h4>
+
+    <table class="comparison">
+      <tr>
+        <th>Requirement</th>
+        <th>Fi-Wi (C-RAN)</th>
+        <th>Autonomous APs</th>
+      </tr>
+
+      <tr>
+        <td><strong>Global CSI Visibility</strong></td>
+        <td>
+          Complete: Concentrator sees CSI from all RRHs to all clients in
+          real-time
+        </td>
+        <td>
+          Fragmented: Each AP only knows its own channel. Must exchange info
+          over backhaul (high latency)
+        </td>
+      </tr>
+
+      <tr>
+        <td><strong>Decision Latency</strong></td>
+        <td>
+          Microseconds: Concentrator makes decisions in software at µs
+          granularity
+        </td>
+        <td>
+          Milliseconds to seconds: APs coordinate via slow management protocols
+        </td>
+      </tr>
+
+      <tr>
+        <td><strong>Adaptation Speed</strong></td>
+        <td>Per-packet: Can switch RRH or channel based on every CSI update</td>
+        <td>
+          Minutes: Channel changes require beacon updates, client reassociation
+        </td>
+      </tr>
+
+      <tr>
+        <td><strong>Client Disruption</strong></td>
+        <td>None: Decisions are transparent to clients</td>
+        <td>
+          High: Channel changes or AP reassignment cause connectivity
+          interruptions
+        </td>
+      </tr>
+    </table>
+
+    <h3 id="section-9.4">9.4 Transparent Integration with L4S</h3>
+
+    <p>
+      The complexity of dynamic point selection and frequency reuse is hidden
+      from the L4S congestion control loop. Traffic still lives in
+      per-airtime-domain group queues. When the Concentrator enables frequency
+      reuse or optimizes RRH selection, it simply affects the effective service
+      rate μ(t) of the queue.
+    </p>
+
+    <p>
+      The PI² controller in the outer loop (see
+      <a href="#section-5">Section 5</a>) sees the queue draining faster and
+      naturally reduces ECN marking. This allows L4S senders (TCP Prague) to
+      ramp up their congestion windows to fill the expanded capacity. The system
+      automatically discovers and exploits available spatial capacity without
+      requiring changes to congestion control algorithms or application
+      awareness.
+    </p>
+
+    <div class="section" id="section-9-5">
+      <h3 id="section-9.5">
+        9.5 Governing Station Media Access: The Control Hierarchy
+      </h3>
+
+      <p>
+        A common critique of centralized wireless architectures is the
+        "autonomous client problem": while the infrastructure can be
+        coordinated, the stations (STAs) are independent entities that contend
+        for the medium using their own logic.
+      </p>
+
+      <p>
+        Fi-Wi addresses this by enforcing a
+        <strong>Control Hierarchy</strong> that governs client behavior from the
+        physical layer up to the transport layer. Instead of passively hoping
+        for "good client behavior," Fi-Wi uses four distinct mechanisms to
+        throttle, steer, or schedule station media access.
+      </p>
+
+      <div class="diagram-block">
+        <h4>Figure 9-3: The Four Tiers of Client Governance</h4>
+
+        <pre class="diagram">
+Level 1: Deterministic (Hard)
+   [ 802.11ax Trigger Frames ] ──▶ STA must wait for Schedule
+                                    (Zero contention)
+
+Level 2: Transport (Adaptive)
+   [ L4S / ECN Marking ] ────────▶ OS Kernel throttles pacing
+                                    (Reduces MAC load before enqueue)
+
+Level 3: RF Physics (Steering)
+   [ Beacon Power Shaping ] ─────▶ STA firmware seeks new cell
+                                    (Moves demand to different domain)
+
+Level 4: Statistical (Soft)
+   [ WMM / AIFS Parameters ] ────▶ STA adjusts backoff aggression
+                                    (Statistical deprioritization)
+    </pre
+        >
+      </div>
+
+      <h4>1. Deterministic Scheduling (802.11ax/be)</h4>
+
+      <p>
+        For modern clients (Wi-Fi 6/7), Fi-Wi removes autonomy entirely for
+        uplink traffic. The Concentrator generates
+        <strong>Trigger Frames</strong> via the RRH.
+      </p>
+
+      <ul>
+        <li>
+          <strong>Mechanism:</strong> The Trigger Frame explicitly allocates
+          Resource Units (RU) and time slots to specific clients.
+        </li>
+
+        <li>
+          <strong>Effect:</strong> The STA is forbidden from using EDCA to
+          contend for the medium. It effectively becomes a "slave" to the
+          Concentrator's schedule, converting uplink traffic from a stochastic
+          probability distribution into a deterministic timetable.
+        </li>
+      </ul>
+
+      <h4>2. Transport-Layer Pacing (L4S)</h4>
+
+      <p>
+        For the growing ecosystem of L4S-capable clients (iOS, macOS, Linux,
+        Windows), control is applied at the Operating System kernel.
+      </p>
+
+      <ul>
+        <li>
+          <strong>Mechanism:</strong> The Concentrator marks the
+          <code>CE</code> (Congestion Experienced) codepoint in the IP header of
+          downlink packets based on the centralized Group Queue depth.
+        </li>
+
+        <li>
+          <strong>Effect:</strong> The client's TCP stack (e.g., TCP Prague)
+          detects the mark and immediately reduces its
+          <strong>packet pacing rate</strong>. This throttles media access
+          <em>upstream</em> of the Wi-Fi chip, preventing the client from
+          flooding its local hardware queues and causing self-inflicted
+          collision storms.
+        </li>
+      </ul>
+
+      <h4>3. RF Footprint Shaping (Beacon Power)</h4>
+
+      <p>
+        Fi-Wi manipulates the physical environment to restrict which RRHs a
+        client perceives as viable, effectively "shoving" media access demand to
+        specific airtime domains.
+      </p>
+
+      <ul>
+        <li>
+          <strong>Mechanism:</strong> The Concentrator dynamically adjusts the
+          beacon transmit power of individual RRHs.
+        </li>
+
+        <li>
+          <strong>Effect:</strong> By shrinking the effective cell size of a
+          loaded RRH, the system forces the client's firmware to roam to an
+          alternative RRH. This is a "Physics-Layer Steering" mechanism that
+          works on all clients, regardless of protocol version.
+        </li>
+      </ul>
+
+      <h4>4. Statistical Parameter Biasing (WMM/AIFS)</h4>
+
+      <p>
+        As a defense-in-depth measure for legacy clients, Fi-Wi advertises tuned
+        WMM EDCA parameters.
+      </p>
+
+      <ul>
+        <li>
+          <strong>Mechanism:</strong> Dynamic adjustment of
+          <strong>AIFS</strong> (Arbitration Inter-Frame Space) and
+          <strong>CWmin/CWmax</strong> values in the Beacon.
+        </li>
+
+        <li>
+          <strong>Effect:</strong> Increasing AIFS for background traffic
+          classes forces aggressive legacy clients to wait longer intervals
+          between sensing idle air and transmitting. While not deterministic,
+          this statistically biases the medium access probability in favor of
+          managed traffic.
+        </li>
+      </ul>
+
+      <div class="callout">
+        <strong>Summary:</strong> Fi-Wi does not rely on a single method to
+        control clients. It uses <strong>Triggers</strong> for precision,
+        <strong>L4S</strong> for flow-rate discipline,
+        <strong>RF Shaping</strong> for load balancing, and
+        <strong>WMM</strong> as a statistical safety net.
+      </div>
+    </div>
+
+    <h3 id="section-9.6">9.6 What Dynamic Point Selection Does NOT Enable</h3>
+
+    <p>
+      To maintain technical accuracy, it is important to clarify what Fi-Wi's
+      dynamic point selection <strong>does not</strong> provide:
+    </p>
+
+    <ul>
+      <li>
+        <strong>Not Distributed MIMO:</strong> Fi-Wi does not perform
+        simultaneous coherent transmission from multiple RRHs. Each transmission
+        originates from a single selected RRH.
+      </li>
+
+      <li>
+        <strong>Not Beamforming:</strong> While the Concentrator selects the
+        optimal RRH, it does not control RF phase or perform spatial
+        multiplexing across RRHs. Each RRH operates independently.
+      </li>
+
+      <li>
+        <strong>Not Coordinated Simultaneous Transmission:</strong> When using
+        frequency reuse, each RRH performs independent CSMA/CA. There is no
+        explicit scheduling or coordination of transmission timing—the
+        coordination comes from routing decisions and channel assignments.
+      </li>
+    </ul>
+
+    <p>These capabilities would require either:</p>
+
+    <ul>
+      <li>
+        Custom ASIC development with externally-controllable RF synthesizers
+        (for phase-coherent MIMO)
+      </li>
+
+      <li>
+        Or extensive firmware modification to enable TSF-scheduled transmission
+        (regulatory challenges in unlicensed spectrum)
+      </li>
+    </ul>
+
+    <p>
+      Fi-Wi's architecture deliberately focuses on capabilities achievable with
+      COTS Wi-Fi chips, providing 2-3x capacity improvement through intelligent
+      management rather than pursuing 4-6x gains that would require custom
+      silicon development.
+    </p>
+
+    <h3 id="section-9.7">9.7 Performance Expectations</h3>
+
+    <p>
+      Based on the capabilities described above, Fi-Wi provides the following
+      performance improvements over traditional autonomous AP deployments:
+    </p>
+
+    <ul>
+      <li>
+        <strong>2-3x Capacity Increase:</strong> Through intelligent frequency
+        reuse and reduced CSMA/CA contention
+      </li>
+
+      <li>
+        <strong>50-80% Latency Reduction:</strong> Via dynamic point selection
+        eliminating handoff delays
+      </li>
+
+      <li>
+        <strong>90%+ Cell Edge Reliability:</strong> Through selection diversity
+        on uplink
+      </li>
+
+      <li>
+        <strong>Zero Handoff Delay:</strong> Seamless mobility without 802.11
+        reassociation
+      </li>
+    </ul>
+
+    <p>
+      These gains are achieved through
+      <strong>centralized intelligence and microsecond-latency fronthaul</strong
+      >, not through RF phase control or coordinated transmission. The
+      architecture remains fully compliant with unlicensed spectrum regulations
+      and works with commodity Wi-Fi chipsets.
+    </p>
+
+    <h3 id="section-9.8">9.8 Summary</h3>
+
+    <p>
+      Fi-Wi transforms the problem of wireless density by treating it as a
+      <strong>routing and scheduling problem</strong> rather than an RF
+      coordination problem. By centralizing packet memory and MAC scheduling,
+      Fi-Wi converts adjacent radios from <strong>interferers</strong> into
+      <strong>dynamically selected access points</strong>, allowing the network
+      to scale capacity through intelligent management rather than collapsing
+      under interference.
+    </p>
+
+    <p>
+      The key insight is that most Wi-Fi performance problems stem from poor
+      decisions (wrong AP, wrong channel, wrong timing) rather than fundamental
+      RF limitations. Fi-Wi solves this by providing the Concentrator with
+      complete visibility and control, enabling microsecond-granularity
+      optimization that autonomous APs cannot match.
+    </p>
+
+    <hr />
+
+    <h2 id="section-10">10. Fi-Wi value vs. Traditional Distributed APs</h2>
+
+    <p>
+      Modern enterprise Wi-Fi deployments use centralized controllers (Cisco
+      WLC, Aruba Mobility Controller, Ubiquiti UniFi, Ruckus SmartZone, etc.) to
+      manage multiple APs. These controllers coordinate the
+      <em>control plane</em>: channel assignment, transmit power, client
+      association hints, roaming policies, and security. However, these remain
+      <em>loosely-coupled</em> systems where the <strong>data plane</strong> —
+      queueing, MAC scheduling, aggregation, and packet memory — remains
+      distributed inside individual APs.
+    </p>
+
+    <p>
+      A traditional AP is not just “running EDCA.” It is running EDCA
+      <em>after</em> juggling dozens or hundreds of logical MAC queues and state
+      machines:
+    </p>
+
+    <ul>
+      <li>
+        <strong>Per-station queues</strong> (one or more per associated STA)
+      </li>
+
+      <li>
+        <strong>Per-TID / per-WMM-class queues</strong> (4–8 AC/TIDs per
+        station)
+      </li>
+
+      <li>
+        <strong>Firmware queues</strong> for rate control, reordering, and AMPDU
+        assembly
+      </li>
+
+      <li><strong>Internal MAC hardware ring buffers</strong> for TX/RX DMA</li>
+
+      <li>
+        <strong>Rate-control queues</strong> tracking MCS and retry history
+      </li>
+
+      <li>
+        <strong>PS-poll / U-APSD buffered-traffic queues</strong> for sleeping
+        stations
+      </li>
+
+      <li>
+        <strong>BAR/BA state</strong> (BlockAck windows of outstanding MPDUs)
+      </li>
+    </ul>
+
+    <p>
+      With <em>N</em> stations, an AP can easily have on the order of
+      <strong>N × (4–8)</strong> logical queues behind a single RF channel.
+      Every AP in the same RF domain runs this large, isolated, queue-filled
+      state machine independently. No AP has a global view; controllers see only
+      coarse statistics.
+    </p>
+
+    <p>The result:</p>
+
+    <ul>
+      <li>
+        <strong>The bottleneck is smeared across many hidden queues</strong>, so
+        L4S cannot see or control it.
+      </li>
+
+      <li>
+        <strong>APs make inconsistent decisions</strong>, so collapse emerges at
+        different times across APs.
+      </li>
+
+      <li>
+        <strong>AP-side buffering encourages long TXOPs</strong> when an AP
+        finally wins the medium, driving multi-millisecond bursts.
+      </li>
+    </ul>
+
+    <p>
+      Fi-Wi is fundamentally different: it centralizes both control plane
+      <em>and</em> data plane with shared state across all RRHs. The
+      concentrator does not just configure RRHs; it directly manages their
+      queues, schedules their TXOPs, maintains unified CSI and airtime state,
+      and applies coordinated ECN marking for each airtime domain. This
+      architectural difference — not just improved control-plane coordination —
+      is what enables Fi-Wi’s latency, L4S, and spatial multiplexing advantages.
+    </p>
+
+    <div class="diagram-block">
+      <h4>Diagram 10-1: Queue Explosion Inside a Traditional AP</h4>
+
+      <pre class="diagram">
+┌──────────────────────────── Traditional Distributed AP ───────────────────────────┐
+│                                                                                   │
+│  Many MAC queues hidden inside each AP:                                           │
+│                                                                                   │
+│    ┌─────────────┐  ┌─────────────┐  ┌─────────────┐                              │
+│    │ STA 1 TID   │  │ STA 2 TID   │  │ STA N TID   │   ... (N stations × 4–8 TIDs)│
+│    │ Queues      │  │ Queues      │  │ Queues      │                              │
+│    └─────┬───────┘  └─────┬───────┘  └─────┬───────┘                              │
+│          │                │                │                                      │
+│   ┌──────▼────────────────▼────────────────▼──────────┐                           │
+│   │   Firmware Queues (Aggregation, Reorder, BAR/BA)  │                           │
+│   └───────────┬───────────────────────────────────────┘                           │
+│               │                                                                   │
+│   ┌───────────▼──────────────┐                                                    │
+│   │ Hardware MAC Ring Buffers│   (TX/RX DMA)                                      │
+│   └───────────┬──────────────┘                                                    │
+│               │                                                                   │
+│   ┌───────────▼──────────────┐                                                    │
+│   │ EDCA / CSMA-CA Contention│   (Per-AP, no coordination)                        │
+│   └───────────┬──────────────┘                                                    │
+│               │                                                                   │
+│        Long, multi-ms TXOP bursts, inconsistent ECN, early collapse               │
+│                                                                                   │
+└───────────────────────────────────────────────────────────────────────────────────┘
+  </pre
+      >
+      <p class="diagram-caption">
+        <strong>See also:</strong>
+        <a href="#section-2.1">Section 2.1 — Why L4S + Legacy Wi-Fi Struggle</a
+        >,
+        <a href="#appendix-a">Appendix A — 802.11 Backoff & Collapse Dynamics</a
+        >.
+      </p>
+    </div>
+
+    <p>
+      The following subsections detail specific benefits of Fi-Wi’s
+      cellularized, tightly-coupled architecture compared to controller-managed,
+      loosely-coupled AP systems.
+    </p>
+
+    <h3 id="section-10.1">10.1 Deterministic low latency</h3>
+
+    <p><strong>Traditional APs:</strong></p>
+
+    <p>
+      Each AP builds its own local queues. Under load, large aggregates,
+      retries, and hidden buffering produce multi-millisecond queueing and
+      service delays. Tail latency is largely uncontrolled, and varies across
+      APs sharing the same channel.
+    </p>
+
+    <p><strong>Fi-Wi (cellularized Wi-Fi, cell-per-room):</strong></p>
+
+    <ul>
+      <li>
+        There is exactly <strong>one deep queue per airtime domain</strong> (RF
+        group of interfering RRHs), not per AP.
+      </li>
+
+      <li>
+        That queue is <strong>fully visible and centrally controlled</strong> in
+        the concentrator.
+      </li>
+
+      <li>
+        Sojourn-time or PI2-style control targets sub-millisecond delay (e.g.,
+        200–400&nbsp;µs) including tails.
+      </li>
+
+      <li>
+        Concentrator pacing prevents upstream buffers from filling while
+        respecting L4S behavior.
+      </li>
+    </ul>
+
+    <h3 id="section-10.2">10.2 Stable L4S behavior</h3>
+
+    <p><strong>Traditional APs:</strong></p>
+
+    <p>
+      L4S flows traverse multiple hidden queues: wired bottlenecks, AP-local
+      queues, firmware queues, and EDCA contention. ECN marking (if it exists at
+      all) is inconsistent and not tied to a single bottleneck. Collapse
+      produces noisy, bursty marking or loss, and the L4S control loop becomes
+      oscillatory or falls back toward classic congestion behavior, especially
+      in the tails that matter to users.
+    </p>
+
+    <p><strong>Fi-Wi:</strong></p>
+
+    <ul>
+      <li>
+        There is
+        <strong>exactly one main marking point per airtime domain</strong> — the
+        centralized group queue.
+      </li>
+
+      <li>
+        ECN marks are <strong>proportional to queue delay</strong> (sojourn time
+        / PI2) in that queue.
+      </li>
+
+      <li>
+        The control loop sees a
+        <strong>small number of well-behaved bottlenecks</strong>, which is
+        precisely the design assumption behind L4S.
+      </li>
+    </ul>
+
+    <h3 id="section-10.3">10.3 Aggregation without losing visibility</h3>
+
+    <p><strong>Traditional APs:</strong></p>
+
+    <p>
+      Aggregation improves PHY efficiency but hides individual packet timing
+      from the congestion controller. The controller does not know which MSDUs
+      were grouped into a TXOP, what the queue state was when the TXOP started,
+      or how long each device has been waiting.
+    </p>
+
+    <p><strong>Fi-Wi:</strong></p>
+
+    <ul>
+      <li>
+        Aggregation is retained for efficiency, but the concentrator knows
+        <strong>which MSDUs shared a TXOP</strong>.
+      </li>
+
+      <li>
+        The <strong>queue state at TXOP start</strong> is known for each
+        cellularized domain.
+      </li>
+
+      <li>
+        ECN marks can be
+        <strong>assigned within aggregates in a controlled way</strong>,
+        preserving fine-grained congestion information.
+      </li>
+    </ul>
+
+    <p>
+      This combination yields high PHY efficiency <em>and</em> transport-layer
+      visibility into congestion, instead of having to choose one or the other.
+    </p>
+
+    <h3 id="section-10.4">10.4 Building-scale coordination</h3>
+
+    <p><strong>Controller-managed loosely-coupled APs:</strong></p>
+
+    <p>
+      The controller can adjust channels, power, and send steering hints (e.g.,
+      802.11v), but it cannot see or control:
+    </p>
+
+    <ul>
+      <li>
+        Per-packet queue states across APs — each AP's queue depth, sojourn
+        times, and retries remain opaque to the controller.
+      </li>
+
+      <li>
+        MAC aggregation and TXOP decisions — each AP independently decides when
+        to send aggregates and how long TXOPs last.
+      </li>
+
+      <li>
+        Real-time CSI or spatial stream structure — CSI stays local to each AP's
+        antenna array; the controller sees only coarse stats ("MCS 9,
+        80&nbsp;MHz").
+      </li>
+
+      <li>
+        Cross-AP packet routing — the controller cannot dynamically route
+        individual packets to different APs based on instantaneous RF
+        conditions.
+      </li>
+
+      <li>
+        Coordinated frequency reuse — APs cannot adapt channel usage in
+        real-time based on spatial isolation measurements.
+      </li>
+    </ul>
+
+    <p>
+      As a result, these systems rely on heuristic, reactive policies: channel
+      reassignment after interference is observed, power adjustments based on
+      neighbor reports, and client steering using RSSI or airtime snapshots.
+      These help, but they operate on coarse time scales (seconds to minutes)
+      and cannot fix the fundamental data-plane issues of distributed queues,
+      MAC contention, and tail latency under load.
+    </p>
+
+    <p><strong>Fi-Wi cellularized architecture:</strong></p>
+
+    <p>
+      The concentrator maintains true
+      <strong>shared state across all RRHs</strong> in the building:
+    </p>
+
+    <ul>
+      <li>
+        <strong>Centralized queues per airtime domain</strong> — every packet's
+        enqueue time, queue depth, and marking state is visible.
+      </li>
+
+      <li>
+        <strong>Unified timestamp reference</strong> — µs-accurate
+        synchronization enables precise sojourn-time measurement and coordinated
+        scheduling.
+      </li>
+
+      <li>
+        <strong>Aggregated CSI from all RRHs</strong> — the concentrator sees
+        channel quality from every RRH to every active STA, enabling dynamic
+        point selection.
+      </li>
+
+      <li>
+        <strong>Coordinated MAC scheduling</strong> — TXOP timing, aggregation
+        policies, and RRH selection are decided centrally with full visibility.
+      </li>
+
+      <li>
+        <strong>Dynamic RF grouping</strong> — RRH-to-queue assignments adapt in
+        real time based on measured interference, CSI, and load conditions.
+      </li>
+
+      <li>
+        <strong>Dynamic point selection</strong> — each packet is routed to the
+        optimal RRH based on current channel conditions, providing seamless
+        mobility and selection diversity.
+      </li>
+    </ul>
+
+    <p>
+      Because RRHs are distributed in space (often 2–4 per room in high-density
+      deployments), Fi-Wi can leverage spatial separation for intelligent
+      frequency reuse. The concentrator sees CSI from all RRHs and can make
+      microsecond-granularity decisions about which RRH should transmit each
+      packet — all while preserving the "single bottleneck queue per airtime
+      domain" discipline required for stable L4S behavior.
+    </p>
+
+    <div class="diagram-block">
+      <h4>
+        Diagram 10-2: Fi-Wi Centralized Queueing, Scheduling, and Shared State
+      </h4>
+
+      <pre class="diagram">
+┌─────────────────────────── Fi-Wi Cellularized Architecture ────────────────────────────┐
+│                                                                                        │
+│     One deep queue per airtime domain                     Shared CSI + µs timestamps   │
+│                                                                                        │
+│          ┌───────────────────────────────────────────┐                                 │
+│          │ Centralized Airtime-Domain Queue (ECN AQM)│◄──────────┐                     │
+│          └───────────────────┬──────────────────────┘            │                     │
+│                              │                                   │                     │
+│   ┌──────────────────────────▼──────────────────────────┐        │                     │
+│   │   Concentrator Scheduler (L4S, TXOP, RF Grouping)   │◄───────┘                     │
+│   │        Dynamic Point Selection per Packet           │                              │
+│   └───────────────┬─────────────────────────┬───────────┘                              │
+│                   │                         │                                          │
+│       PCIe/Fiber  │                         │   PCIe/Fiber                             │
+│                   │                         │                                          │
+│   ┌───────────────▼─────────────┐  ┌────────▼──────────────┐  ...                      │
+│   │    RRH 1 (Thin MAC/PHY)     │  │   RRH 2 (Thin MAC/PHY)│                           │
+│   └───────────────┬─────────────┘  └────────┬──────────────┘                           │
+│                   │                         │                                          │
+│             Selected RRH transmits; others silent in this TXOP                         │
+│                                                                                        │
+└────────────────────────────────────────────────────────────────────────────────────────┘
+  </pre
+      >
+      <p class="diagram-caption">
+        <strong>See also:</strong>
+        <a href="#section-4">Section 4 — Key Fi-Wi Mechanisms</a>,
+        <a href="#section-5">Section 5 — Control Architecture</a>,
+        <a href="#section-9">Section 9 — Dynamic Point Selection</a>.
+      </p>
+    </div>
+
+    <h3 id="section-10.5">10.5 Control Plane vs. Data Plane</h3>
+
+    <p>
+      The table below summarizes the architectural differences between
+      controller-managed, loosely-coupled APs and Fi-Wi's cellularized,
+      tightly-coupled architecture:
+    </p>
+
+    <table class="comparison">
+      <tr>
+        <th>Capability</th>
+        <th>Controller-Managed Loosely-Coupled APs</th>
+        <th>Fi-Wi Cellularized Tightly-Coupled</th>
+      </tr>
+
+      <tr class="section-row">
+        <td colspan="3"><strong>Control Plane</strong></td>
+      </tr>
+
+      <tr>
+        <td>Channel assignment</td>
+        <td>✓ Centralized</td>
+        <td>✓ Centralized</td>
+      </tr>
+
+      <tr>
+        <td>Transmit power control</td>
+        <td>✓ Centralized</td>
+        <td>✓ Centralized + dynamic beacon shaping</td>
+      </tr>
+
+      <tr>
+        <td>Client steering hints</td>
+        <td>✓ Centralized (802.11v/k)</td>
+        <td>✓ Centralized</td>
+      </tr>
+
+      <tr class="section-row">
+        <td colspan="3"><strong>Data Plane</strong></td>
+      </tr>
+
+      <tr>
+        <td>Packet queues</td>
+        <td>
+          ✗ Distributed per-AP; many hidden per-STA/per-TID/firmware queues
+        </td>
+        <td>✓ Exactly one deep queue per airtime domain in the concentrator</td>
+      </tr>
+
+      <tr>
+        <td>MAC scheduling & aggregation</td>
+        <td>✗ Autonomous per-AP; long TXOPs under load</td>
+        <td>✓ Coordinated across RRH groups; TXOP length explicitly bounded</td>
+      </tr>
+
+      <tr>
+        <td>Timestamp synchronization</td>
+        <td>✗ Not available at packet level</td>
+        <td>✓ µs-accurate (PTM/PTP) shared across RRHs</td>
+      </tr>
+
+      <tr>
+        <td>Shared CSI state</td>
+        <td>✗ Per-AP only; summarized to controller</td>
+        <td>✓ Building-wide CSI aggregation at the concentrator</td>
+      </tr>
+
+      <tr>
+        <td>Queue visibility & AQM</td>
+        <td>✗ Hidden in each AP; no global AQM</td>
+        <td>
+          ✓ Fully visible per domain; explicit L4S/AQM on the true bottleneck
+        </td>
+      </tr>
+
+      <tr>
+        <td>L4S/ECN marking point</td>
+        <td>✗ Inconsistent or absent; multiple uncontrolled bottlenecks</td>
+        <td>✓ Single, well-defined marking point per airtime domain</td>
+      </tr>
+
+      <tr>
+        <td>Dynamic point selection</td>
+        <td>✗ Clients statically associated to one AP</td>
+        <td>✓ Per-packet RRH selection based on real-time CSI (Section 9)</td>
+      </tr>
+
+      <tr>
+        <td>Selection diversity</td>
+        <td>✗ Single AP receives uplink</td>
+        <td>✓ Multiple RRHs receive; best copy selected (Section 9)</td>
+      </tr>
+
+      <tr>
+        <td>Intelligent frequency reuse</td>
+        <td>✗ Static channel plan</td>
+        <td>✓ Dynamic adaptation based on spatial isolation (Section 9)</td>
+      </tr>
+
+      <tr>
+        <td>Per-packet steering between radios</td>
+        <td>✗ Not available</td>
+        <td>✓ Active redundancy and fast failover (Section 8)</td>
+      </tr>
+
+      <tr>
+        <td>Dynamic RF grouping</td>
+        <td>✗ Static AP boundaries</td>
+        <td>✓ Adaptive airtime domains based on CSI and load (Section 6)</td>
+      </tr>
+    </table>
+
+    <div class="callout">
+      <strong>Key insight:</strong> controller-managed systems coordinate
+      configuration but leave data-plane behavior distributed and autonomous.
+      Fi-Wi unifies the data plane with shared state and explicit control of
+      queues and TXOPs, enabling fundamentally different behavior for latency
+      control, dynamic point selection, and building-scale coordination. All
+      capabilities are achieved with COTS Wi-Fi chipsets and comply with
+      unlicensed spectrum regulations.
+    </div>
+
+    <h3 id="section-10.6">10.6 Operational and lifecycle advantages</h3>
+
+    <p><strong>Controller-managed loosely-coupled APs:</strong></p>
+
+    <ul>
+      <li>
+        <strong>Complex distributed software updates:</strong> each AP runs its
+        own firmware image. Updating a building with 50+ APs requires
+        orchestrated rollouts, version compatibility checks, and managing
+        multi-version fleets.
+      </li>
+
+      <li>
+        <strong>Per-AP troubleshooting:</strong> queue states, retries, and
+        airtime stats are scattered across devices with limited visibility.
+      </li>
+
+      <li>
+        <strong>Hardware scaling complexity:</strong> each new AP adds another
+        autonomous system with its own memory, CPU, queue management, and
+        potential firmware bugs.
+      </li>
+
+      <li>
+        <strong>Failure diagnosis:</strong> hard to distinguish AP hardware
+        failure from configuration or RF environment issues; local state is
+        often incomplete.
+      </li>
+
+      <li>
+        <strong>Massive Security Surface:</strong> Each AP runs a full Linux OS
+        with millions of lines of code, open ports, and local services. A
+        vulnerability in the kernel or a driver compromises the edge device.
+      </li>
+    </ul>
+
+    <p><strong>Fi-Wi cellularized architecture:</strong></p>
+
+    <ul>
+      <li>
+        <strong>Simplified RRH hardware:</strong> RRHs are thin RF endpoints: RF
+        front end, DMA engine, PTP sync, and a minimal control shim. No complex
+        queue management, routing, or control logic.
+      </li>
+
+      <li>
+        <strong>Centralized software complexity:</strong> the concentrator
+        contains control logic, shared-state machinery, AQM, scheduling, and
+        learning models. Software updates and new features happen at the
+        concentrator, not across dozens of RRHs.
+      </li>
+
+      <li>
+        <strong>Unified diagnostics and visibility:</strong> all queues,
+        timestamps, CSI, and per-packet metadata flow through the concentrator:
+        <ul>
+          <li>
+            View real-time queue depths and sojourn times per airtime domain.
+          </li>
+
+          <li>
+            See per-STA, per-RRH, or per-flow latency histograms (median, p95,
+            p99).
+          </li>
+
+          <li>
+            Trace individual packets from ingress to air transmission with
+            µs-accurate timestamps.
+          </li>
+
+          <li>
+            Identify which RRH, spatial stream, or MCS is causing retries or
+            collapse.
+          </li>
+        </ul>
+      </li>
+
+      <li>
+        <strong>Scaling simplicity:</strong> adding RRHs to increase density
+        (e.g., 2–4 per room) does not add new control complexity. The
+        concentrator’s logic remains the same; only RF groupings change.
+      </li>
+
+      <li>
+        <strong>Predictable failure modes:</strong>
+        <ul>
+          <li>
+            <em>RRH failure:</em> the concentrator detects loss of DMA/PTP,
+            removes that RRH from its airtime group, and redistributes its STAs
+            to other RRHs in the same domain; queues and flow state remain
+            intact.
+          </li>
+
+          <li>
+            <em>Concentrator failure:</em> in HA deployments, a standby
+            concentrator with synchronized state can take over; RRHs remain thin
+            endpoints.
+          </li>
+        </ul>
+      </li>
+
+      <li>
+        <strong>Installation and provisioning:</strong> RRHs are auto-configured
+        by the concentrator (channel, power, beacon content, group membership,
+        and policies). Installers mount RRHs, connect fiber, and the
+        concentrator handles the rest—no per-RRH SSH or web UI.
+      </li>
+
+      <li>
+        <strong>Cost structure advantages:</strong> RRH hardware is simpler and
+        cheaper than full APs; software development and testing concentrate on
+        one platform; support and diagnostics are centralized.
+      </li>
+
+      <li>
+        <strong>Zero Security Surface at the Edge:</strong> RRHs run **zero
+        lines of Linux**. They have no shell, no open network ports, and no OS
+        to exploit. They are fixed-function hardware micro-bridges.
+      </li>
+    </ul>
+
+    <hr />
+
+    <div class="section" id="section-11">
+      <h2 id="section-11">
+        11. RRH Physical Envelope: Power, Thermals, and Size
+      </h2>
+
+      <p>
+        The economic viability of a "Cell-Per-Room" architecture hinges on the
+        Remote Radio Head (RRH) being fundamentally simpler, cooler, and cheaper
+        than a traditional Enterprise Access Point. By offloading complex logic
+        to the Concentrator (Section 13) and precision timing to the Fronthaul
+        (Section 4.7), the RRH becomes a lean physical device.
+      </p>
+
+      <h3 id="section-11.1">
+        11.1 The Silicon Strategy: Mobile vs. Enterprise SKUs
+      </h3>
+
+      <p>
+        Fi-Wi explicitly selects
+        <strong>Mobile/Client Wi-Fi 7 chipsets</strong> (e.g., Qualcomm
+        FastConnect or Broadcom BCM43xx client series) rather than traditional
+        Enterprise AP/Networking SKUs. While Section 4.7 detailed how this
+        enables external clocking, this choice is equally critical for the
+        physical envelope:
+      </p>
+
+      <ul>
+        <li>
+          <strong>Power Efficiency:</strong> Optimized for battery-powered
+          smartphones, these chips operate within a strict thermal envelope
+          (&lt; 3W peak), compared to 10W+ for Enterprise AP SoCs.
+        </li>
+
+        <li>
+          <strong>Integration:</strong> Mobile SKUs often integrate the
+          Baseband, MAC, and RF Front End (FEM) logic tightly, reducing board
+          footprint.
+        </li>
+
+        <li>
+          <strong>Cost Structure:</strong> Driven by smartphone volumes
+          (billions of units), mobile SKUs offer a significantly lower price
+          point than low-volume enterprise silicon.
+        </li>
+      </ul>
+
+      <h3 id="section-11.2">11.2 Power Budget Composition</h3>
+
+      <p>
+        We set a hard budget of <strong>3.5–4 W total</strong> per RRH, enabling
+        Power over Ethernet (PoE) Class 1 or 2 operation, or simple remote
+        powering over hybrid fiber/copper cables.
+      </p>
+
+      <ul>
+        <li>
+          <strong>Wi-Fi RTL + LNAs (≤ 3.0 W):</strong><br />
+          <span class="small"
+            >LNA and baseband signal processing dominate the power. Because the
+            complex MAC scheduling logic lives in the Concentrator, the RRH
+            silicon remains in a low-power state compared to an autonomous AP
+            CPU which burns power on interrupts and queue management.</span
+          >
+        </li>
+
+        <li>
+          <strong>PCIe Retimer + Optics (~0.8 W):</strong><br />
+          <span class="small"
+            >The PCIe Gen3/4 retimer and optical driver consume a fraction of
+            the power of a standard 2.5GbE PHY/Switch port found in legacy
+            APs.</span
+          >
+        </li>
+
+        <li>
+          <strong>Housekeeping (~0.2 W):</strong><br />
+          <span class="small"
+            >Jitter attenuator (Section 4.7.3), voltage regulation, and
+            environmental sensors.</span
+          >
+        </li>
+      </ul>
+
+      <h3 id="section-11.3">11.3 Thermal and Mechanical Implications</h3>
+
+      <p>
+        A sub-4W envelope fundamentally changes the industrial design
+        possibilities for the RRH:
+      </p>
+
+      <ul>
+        <li>
+          <strong>Fanless Design:</strong> Passive cooling is sufficient via a
+          modest metal backplate or internal heatsink. There are no moving parts
+          to fail.
+        </li>
+
+        <li>
+          <strong>Flush-Mount Enclosures:</strong> The low thermal density
+          allows the RRH to be mounted flush inside wall boxes or ceiling tiles
+          without requiring large air gaps or vented "ufo-style" enclosures.
+        </li>
+
+        <li>
+          <strong>MTBF (Reliability):</strong> The elimination of fans,
+          high-heat CPU cores, and electrolytic capacitors (often needed for
+          higher power rails) significantly increases the Mean Time Between
+          Failures.
+        </li>
+      </ul>
+
+      <h3 id="section-11.4">11.4 Concentrator-Side Considerations</h3>
+
+      <p>
+        Fi-Wi relies on a "Split Thermal" architecture. We deliberately shift
+        the power density from the edge (the ceiling) to the core (the wiring
+        closet).
+      </p>
+
+      <ul>
+        <li>
+          <strong>The Core:</strong> The Concentrator handles the
+          computation-heavy tasks: shared-state maintenance, CSI learning
+          (Appendix B), and L4S AQM. This generates heat, but it lives in a rack
+          where forced-air cooling and noise are acceptable.
+        </li>
+
+        <li>
+          <strong>The Edge:</strong> The RRH handles only RF and signaling. By
+          moving the "brain" to the rack, we ensure the visible hardware in the
+          room remains cool, silent, and small—a prerequisite for high-density
+          residential and hospitality deployments.
+        </li>
+      </ul>
+    </div>
+
+    <hr />
+
+    <h2 id="section-12">12. PCIe Fronthaul (Gen3 x1 over Fiber)</h2>
+
+    <h3 id="section-12.1">12.1 Why PCIe as the RRH interface</h3>
+
+    <p>
+      A central hardware design choice is to make the RRH look like a
+      <strong>PCIe endpoint</strong> to the Fi-Wi concentrator. This leverages
+      the fact that:
+    </p>
+
+    <ul>
+      <li>
+        <strong>All modern Wi-Fi chips speak PCIe</strong> already, especially
+        mobile / laptop-class devices.
+      </li>
+
+      <li>
+        The Wi-Fi silicon is validated with PCIe as its primary host interface.
+      </li>
+    </ul>
+
+    <p>Benefits of this choice:</p>
+
+    <ul>
+      <li>No need to invent a new device-side PHY or host interface.</li>
+
+      <li>
+        Reuse of commodity controller logic, driver models, and test
+        infrastructure.
+      </li>
+
+      <li>
+        Natural mapping to our <strong>DMA-based RRH model</strong>:
+        <ul>
+          <li>RRH presents DMA queues for Tx/Rx over PCIe.</li>
+
+          <li>Concentrator holds packet memory and group queues in DRAM.</li>
+        </ul>
+      </li>
+    </ul>
+
+    <p>
+      We start with <strong>PCIe Gen3, one lane (x1)</strong>, carried over
+      fiber via a retimer + optical interface. Higher generations or widths
+      (Gen4, x2/x4) are possible later but not required for the initial Fi-Wi
+      performance targets.
+    </p>
+
+    <h3 id="section-12.2">12.2 Gen3 x1 throughput</h3>
+
+    <p>PCIe Gen3 provides:</p>
+
+    <ul>
+      <li>Raw signaling rate: <strong>8.0&nbsp;GT/s</strong> per lane.</li>
+
+      <li>
+        128b/130b encoding → effective line rate ≈
+        <strong>7.88&nbsp;Gb/s</strong>.
+      </li>
+    </ul>
+
+    <p>
+      After protocol overhead (TLP headers, DLLPs, flow control), the
+      <strong>sustained payload throughput</strong> for Gen3 x1 is in the rough
+      range of <strong>6–7&nbsp;Gb/s</strong> for large transfers. This is more
+      than sufficient for:
+    </p>
+
+    <ul>
+      <li>
+        Feeding a single RRH with:
+        <ul>
+          <li>Multi-Gbps Wi-Fi PHY peaks and realistic MAC efficiency.</li>
+
+          <li>CSI export and control traffic.</li>
+        </ul>
+      </li>
+
+      <li>
+        Supporting the expected 2.4&nbsp;GHz + 5/6&nbsp;GHz bands per RRH with
+        headroom.
+      </li>
+    </ul>
+
+    <p>
+      If a future RRH design must exceed this, the same architecture scales to:
+    </p>
+
+    <ul>
+      <li><strong>Gen4 x1</strong> (doubles per-lane throughput), or</li>
+
+      <li>Multi-lane links (x2, x4) for very high-end RRHs.</li>
+    </ul>
+
+    <p>
+      For our initial Fi-Wi deployment assumptions,
+      <strong
+        >Gen3 x1 over fiber is a sensible and sufficient starting point</strong
+      >.
+    </p>
+
+    <h3 id="section-12.3">12.3 Latency characteristics and budget</h3>
+
+    <p>PCIe Gen3 latency has several components:</p>
+
+    <ul>
+      <li>Transaction layer and data link layer processing.</li>
+
+      <li>PHY serialization / deserialization.</li>
+
+      <li>Retimer pipeline delay.</li>
+
+      <li>Fiber propagation.</li>
+    </ul>
+
+    <p>Order-of-magnitude:</p>
+
+    <ul>
+      <li>
+        PCIe controller + retimer pipeline:
+        <strong>microsecond-scale</strong> latency per transaction (a handful of
+        µs).
+      </li>
+
+      <li>
+        Fiber propagation: ~5&nbsp;µs/km, so tens to a few hundred meters of
+        in-building fiber adds only tens to hundreds of nanoseconds.
+      </li>
+    </ul>
+
+    <p>Compared to:</p>
+
+    <ul>
+      <li>L4S RTTs (tens of ms, WAN) or a few ms (LAN).</li>
+
+      <li>MAC TXOP durations (hundreds of µs).</li>
+
+      <li>Queue sojourn-time targets (200–400&nbsp;µs at group queues).</li>
+    </ul>
+
+    <p>
+      the <strong>PCIe-over-fiber latency is effectively negligible</strong>. It
+      comfortably fits within the microsecond-level time base used for:
+    </p>
+
+    <ul>
+      <li><code>t_ingress_us</code> timestamps in <code>FiWiMeta</code>.</li>
+
+      <li>RRH transmit time stamps.</li>
+
+      <li>Queue-delay computation and ECN marking.</li>
+    </ul>
+
+    <h3 id="section-12.4">12.4 Mapping queues and metadata</h3>
+
+    <p>
+      The PCIe model fits naturally with the Fi-Wi queueing and metadata scheme.
+      Each RRH behaves like a PCIe endpoint with:
+    </p>
+
+    <ul>
+      <li>
+        <strong>Downlink DMA rings:</strong> concentrator → RRH descriptors,
+        pointing to packets in central DRAM.
+      </li>
+
+      <li>
+        <strong>Uplink DMA rings:</strong> RRH → concentrator descriptors,
+        carrying received frames and CSI/stats.
+      </li>
+    </ul>
+
+    <p>
+      The <code>FiWiMeta</code> header lives in host memory adjacent to packet
+      payloads and is referenced by these descriptors.
+    </p>
+
+    <p><strong>Downlink flow:</strong></p>
+
+    <ol>
+      <li>
+        Concentrator enqueues IP/Ethernet packets into a group queue in DRAM,
+        allocates or updates <code>FiWiMeta</code> (including
+        <code>t_ingress_us</code> and queue snapshot).
+      </li>
+
+      <li>
+        Scheduler posts PCIe descriptors to the RRH for the next TXOP, selecting
+        which MSDUs and which RF group/airtime domain.
+      </li>
+
+      <li>
+        RRH DMA-fetches the MSDUs via Gen3 x1, builds an aggregate (A-MPDU),
+        transmits over the air, and reports:
+        <ul>
+          <li>Success / retry / PER data</li>
+
+          <li>CSI and MCS used</li>
+
+          <li>TX time / TXOP info</li>
+        </ul>
+      </li>
+    </ol>
+
+    <p><strong>Uplink flow:</strong></p>
+
+    <ol>
+      <li>
+        RRH receives 802.11 frames from STAs, decodes them, and attaches CSI and
+        MAC status.
+      </li>
+
+      <li>
+        RRH DMA-writes the frames + metadata into concentrator DRAM via PCIe.
+      </li>
+
+      <li>
+        Concentrator:
+        <ul>
+          <li>Inserts packets into the appropriate group queues.</li>
+
+          <li>Updates per-flow and per-domain control state.</li>
+
+          <li>
+            Feeds CSI and MAC outcomes into the learning model (Appendix B).
+          </li>
+        </ul>
+      </li>
+    </ol>
+
+    <p>In both directions, the PCIe fronthaul:</p>
+
+    <ul>
+      <li>Provides more than enough throughput for RRH traffic and control.</li>
+
+      <li>
+        Provides low and stable latency consistent with microsecond-level
+        timing.
+      </li>
+
+      <li>
+        Aligns perfectly with the central group-queue and
+        <code>FiWiMeta</code> assumptions of the control-plane design.
+      </li>
+    </ul>
+
+    <h3 id="section-12.5">12.5 PCIe Hot Swap</h3>
+
+    <p>
+      A critical operational requirement for Fi-Wi is the ability to service,
+      replace, or add RRHs without bringing down the entire building's wireless
+      network. PCIe provides native support for this through
+      <strong>hot-plug capability</strong>, which is standard in enterprise
+      server platforms and can be leveraged for Fi-Wi deployments.
+    </p>
+
+    <h4>12.5.1 Hot-plug fundamentals</h4>
+
+    <p>
+      PCIe hot-plug allows physical insertion and removal of endpoint devices
+      (RRHs) while the system is running:
+    </p>
+
+    <ul>
+      <li>
+        <strong>Hardware signaling:</strong> Hot-plug controllers detect
+        physical presence via dedicated signals (PRSNT# pins) or through link
+        training state changes.
+      </li>
+
+      <li>
+        <strong>Surprise removal:</strong> Modern PCIe supports surprise removal
+        where endpoints disappear without software notification—critical for
+        field environments where RRHs may lose power or have cable
+        disconnections.
+      </li>
+
+      <li>
+        <strong>Software enumeration:</strong> The concentrator's PCIe root
+        complex can dynamically enumerate new RRHs when they appear and
+        gracefully handle removal of existing RRHs.
+      </li>
+    </ul>
+
+    <h4>12.5.2 RRH insertion flow</h4>
+
+    <p>When a new RRH is connected or powered on:</p>
+
+    <ol>
+      <li>
+        <strong>Physical detection:</strong> PCIe hot-plug controller detects
+        the new device via link training.
+      </li>
+
+      <li>
+        <strong>Enumeration:</strong> Concentrator OS (Linux) enumerates the new
+        PCIe endpoint:
+        <ul>
+          <li>Reads device configuration space</li>
+
+          <li>Assigns bus/device/function (BDF) identifier</li>
+
+          <li>
+            Allocates BAR (Base Address Register) resources for DMA rings and
+            control registers
+          </li>
+        </ul>
+      </li>
+
+      <li>
+        <strong>Driver initialization:</strong> Fi-Wi driver:
+        <ul>
+          <li>Initializes DMA rings for this RRH</li>
+
+          <li>Establishes time synchronization (PTP)</li>
+
+          <li>Reads RRH capabilities (bands, spatial streams, MAC features)</li>
+
+          <li>Configures initial RF parameters (channel, power, SSID)</li>
+        </ul>
+      </li>
+
+      <li>
+        <strong>RF group integration:</strong> Concentrator control plane:
+        <ul>
+          <li>
+            Determines which airtime domain this RRH should join (Section 6)
+          </li>
+
+          <li>Updates queue group assignments</li>
+
+          <li>Begins CSI collection and spatial stream analysis</li>
+
+          <li>Steers traffic to/from the new RRH as appropriate</li>
+        </ul>
+      </li>
+    </ol>
+
+    <p>
+      Time from physical insertion to active traffic forwarding: typically
+      <strong>1–5 seconds</strong>, depending on link training, driver
+      initialization, and RF group discovery.
+    </p>
+
+    <h4>12.5.3 RRH removal flow</h4>
+
+    <p>
+      When an RRH is removed (planned maintenance, failure, or surprise
+      disconnection):
+    </p>
+
+    <ol>
+      <li>
+        <strong>Detection:</strong> PCIe hot-plug event or surprise removal
+        detected:
+        <ul>
+          <li>Link training failure indicates physical disconnection</li>
+
+          <li>Transaction timeouts indicate unresponsive endpoint</li>
+
+          <li>Software-initiated removal for planned maintenance</li>
+        </ul>
+      </li>
+
+      <li>
+        <strong>Traffic rerouting:</strong> Concentrator immediately:
+        <ul>
+          <li>Stops scheduling TXOPs to the removed RRH</li>
+
+          <li>
+            Redirects downlink traffic to other RRHs in the same airtime domain
+            (Section 8)
+          </li>
+
+          <li>
+            Continues receiving uplink traffic from remaining RRHs in the domain
+          </li>
+        </ul>
+      </li>
+
+      <li>
+        <strong>Queue cleanup:</strong> Driver:
+        <ul>
+          <li>Flushes any outstanding DMA transactions</li>
+
+          <li>
+            Returns in-flight packets to group queues for retransmission via
+            other RRHs
+          </li>
+
+          <li>Releases DMA ring memory and BAR resources</li>
+        </ul>
+      </li>
+
+      <li>
+        <strong>RF group adjustment:</strong> Control plane:
+        <ul>
+          <li>Removes RRH from airtime domain membership</li>
+
+          <li>Recomputes spatial stream structure without this RRH</li>
+
+          <li>
+            Adjusts beacon power and grouping for remaining RRHs if needed
+          </li>
+
+          <li>Updates capacity estimates for the affected domain</li>
+        </ul>
+      </li>
+    </ol>
+
+    <p>
+      Impact on active connections: <strong>minimal to none</strong> for STAs
+      served by multi-RRH domains. Traffic seamlessly fails over to remaining
+      RRHs within the same RF group. For isolated single-RRH cells, removal
+      causes brief disconnection until STAs reassociate with neighboring cells.
+    </p>
+
+    <h4>12.5.4 Operational advantages</h4>
+
+    <p>Hot-plug capability provides critical operational benefits:</p>
+
+    <ul>
+      <li>
+        <strong>Zero-downtime maintenance:</strong> Individual RRHs can be
+        serviced, upgraded, or replaced without affecting other RRHs or
+        requiring a maintenance window.
+      </li>
+
+      <li>
+        <strong>Gradual deployment:</strong> Buildings can be brought online
+        room-by-room as RRHs are installed, rather than requiring all hardware
+        to be in place before service begins.
+      </li>
+
+      <li>
+        <strong>Failure resilience:</strong> Hardware failures are automatically
+        detected and isolated. The concentrator continues serving the building
+        with reduced capacity rather than failing entirely.
+      </li>
+
+      <li>
+        <strong>Capacity scaling:</strong> New RRHs can be added to increase
+        coverage or capacity in high-demand areas without disrupting existing
+        service.
+      </li>
+
+      <li>
+        <strong>Troubleshooting:</strong> Suspect RRHs can be temporarily
+        removed for testing or swapped to isolate problems without taking the
+        network offline.
+      </li>
+    </ul>
+
+    <h4>12.5.5 Design considerations</h4>
+
+    <p>To fully support hot-swap in production deployments:</p>
+
+    <ul>
+      <li>
+        <strong>PCIe root complex requirements:</strong> The concentrator must
+        use a PCIe root complex with hot-plug controller support (standard in
+        server-class platforms).
+      </li>
+
+      <li>
+        <strong>Driver state management:</strong> The Fi-Wi driver must handle
+        surprise removal gracefully:
+        <ul>
+          <li>
+            No kernel panics or resource leaks on sudden RRH disconnection
+          </li>
+
+          <li>Proper cleanup of DMA mappings and IOMMU entries</li>
+
+          <li>Idempotent initialization for re-inserted RRHs</li>
+        </ul>
+      </li>
+
+      <li>
+        <strong>Power delivery:</strong> RRHs should support Power-over-Ethernet
+        (PoE) or similar remote power so they can be powered on/off without
+        physical access to the device.
+      </li>
+
+      <li>
+        <strong>Fiber link quality:</strong> Optical transceivers must reliably
+        signal link state changes for accurate hot-plug detection. Poor fiber
+        connections can cause spurious removal/insertion events.
+      </li>
+
+      <li>
+        <strong>Testing:</strong> Hot-plug paths must be tested under realistic
+        failure scenarios:
+        <ul>
+          <li>Power loss during active traffic</li>
+
+          <li>Fiber disconnection mid-TXOP</li>
+
+          <li>Rapid insertion/removal cycles</li>
+
+          <li>Multiple simultaneous RRH failures</li>
+        </ul>
+      </li>
+    </ul>
+
+    <h4>12.5.6 Contrast with traditional APs</h4>
+
+    <p>Traditional distributed APs handle failures differently:</p>
+
+    <ul>
+      <li>
+        <strong>Independent failure domains:</strong> Each AP is autonomous.
+        When one fails, only its coverage area is affected, but STAs must
+        reassociate and rebuild transport-layer connections.
+      </li>
+
+      <li>
+        <strong>No graceful failover:</strong> Distributed APs cannot seamlessly
+        redirect traffic to neighbors because they lack shared state and
+        centralized control. STA reassociation takes hundreds of milliseconds to
+        seconds and disrupts active connections.
+      </li>
+
+      <li>
+        <strong>Controller-managed systems:</strong> Even with controllers, AP
+        failures are detected via keepalive timeout (seconds), not
+        microsecond-scale PCIe events. Recovery is slow and disruptive.
+      </li>
+    </ul>
+
+    <p>
+      Fi-Wi's PCIe hot-plug, combined with multi-RRH airtime domains and
+      centralized queues, enables
+      <strong>sub-second failover with minimal packet loss</strong>—a
+      qualitative improvement over traditional Wi-Fi high-availability
+      approaches.
+    </p>
+
+    <h4>12.5.7 Integration with L4S and queue management</h4>
+
+    <p>
+      Hot-swap events interact cleanly with Fi-Wi's L4S and queueing
+      architecture:
+    </p>
+
+    <ul>
+      <li>
+        <strong>Group queue preservation:</strong> When an RRH is removed, its
+        airtime domain's group queue remains intact. Packets simply drain via
+        remaining RRHs instead of the removed one.
+      </li>
+
+      <li>
+        <strong>ECN marking continuity:</strong> L4S marking continues
+        uninterrupted because it happens at the group queue, not at individual
+        RRHs. Transport-layer congestion control sees no disruption.
+      </li>
+
+      <li>
+        <strong>Capacity adjustment:</strong> The concentrator dynamically
+        adjusts capacity estimates for the affected domain, which naturally
+        influences L4S control loop behavior without requiring special handling.
+      </li>
+
+      <li>
+        <strong>Timestamp coherence:</strong> Even during RRH insertion/removal,
+        timestamp-based sojourn time measurements remain valid because they
+        reference the concentrator's PTP time base, not per-RRH clocks.
+      </li>
+    </ul>
+
+    <p>
+      This separation—queues and control in the concentrator, timing-critical
+      MAC in hot-swappable RRHs—is precisely what enables graceful hardware
+      lifecycle management while maintaining the control-theoretic cleanliness
+      that L4S requires (Appendix A).
+    </p>
+
+    <hr />
+
+    <h2 id="section-13">
+      13. Hardware Architecture: The Workstation Concentrator vs. The Legacy AP
+    </h2>
+
+    <p>
+      To understand why Fi-Wi achieves deterministic latency where traditional
+      Wi-Fi fails, we must look beyond the protocol and into the physical
+      architecture of the devices. The feasibility of the "Cut-Through" RRH
+      design relies on the upstream link being non-blocking. Fi-Wi achieves this
+      by replacing the internal switching fabric of legacy APs with the massive
+      PCIe lane overprovisioning of a workstation-class Concentrator.
+    </p>
+
+    <h3 id="section-13.1">
+      13.1 The Legacy Bottleneck: Anatomy of a Traditional AP
+    </h3>
+
+    <table class="comparison">
+      <thead>
+        <tr>
+          <th>Component</th>
+          <th>Traditional AP (The Appliance)</th>
+          <th>Fi-Wi RRH (The Peripheral)</th>
+        </tr>
+      </thead>
+
+      <tbody>
+        <tr>
+          <td><strong>Core Silicon</strong></td>
+          <td>Complex SoC (Quad-core CPU, NPU, Switch)</td>
+          <td>Thin PHY/MAC + PCIe Retimer</td>
+        </tr>
+
+        <tr>
+          <td><strong>Data Path</strong></td>
+          <td>Store-and-Forward (Switch → CPU → DMA)</td>
+          <td>Cut-Through (Fiber → PCIe → Air)</td>
+        </tr>
+
+        <tr>
+          <td><strong>Queues</strong></td>
+          <td>1000s of opaque hardware queues</td>
+          <td>Zero deep queues (FIFO only)</td>
+        </tr>
+
+        <tr>
+          <td><strong>Decision Making</strong></td>
+          <td>Autonomous (Local Scheduler)</td>
+          <td>None (Slave to Concentrator)</td>
+        </tr>
+      </tbody>
+    </table>
+
+    <p>
+      A traditional Enterprise Access Point is functionally a
+      "Router-on-a-Stick." It forces high-speed wireless traffic through a
+      series of internal serialization bottlenecks before the software ever sees
+      the packet.
+    </p>
+
+    <div class="diagram">
+      TRADITIONAL AP ARCHITECTURE (The Traffic Jam) [ Cat6 Cable ] |
+      +----------v-----------+ | RJ45 Magnetics | +----------+-----------+ |
+      +----------v-----------+ | Ethernet Switch | &lt;--- Queuing Point A:
+      Switch Buffer | (or PHY) | (Head-of-Line Blocking / Opaque)
+      +----------+-----------+ | | GMII / RGMII / SGMII Link | (Fixed 1G or 2.5G
+      Pipe) | +----------v-----------+ | AP SoC | | | | [ CPU / OS ] | &lt;---
+      Queuing Point B: Kernel/Driver | | | (Software Bridging Latency) | v | | [
+      HW DMA Rings ] | &lt;--- Queuing Point C: Hardware Queues | (Per
+      Station/AC) | (The "Blind" Enqueue Point) | | | | [ Wi-Fi MAC/BB ] |
+      +--------+-------------+ | [ Radios ]
+    </div>
+
+    <p><strong>Architectural Flaws in Legacy APs:</strong></p>
+
+    <ol>
+      <li>
+        <strong>The GMII Choke:</strong> The interface between the internal
+        Switch and the CPU is a serialized bottleneck (typically GMII/SGMII).
+        High-speed bursts from Wi-Fi 6E/7 radios can saturate this single link,
+        causing invisible backpressure inside the SoC.
+      </li>
+
+      <li>
+        <strong>Triple Buffering:</strong> A packet is buffered at the Switch
+        (Point A), then in system RAM (Point B), and finally in the Hardware DMA
+        Ring (Point C). This "Store-and-Forward" chain destroys the precise
+        timing required for L4S.
+      </li>
+
+      <li>
+        <strong>Opaque Switching:</strong> The internal switch operates
+        autonomously. The CPU has no visibility into the depth of the switch's
+        internal buffers, meaning latency accumulates invisibly before the OS
+        can measure it.
+      </li>
+    </ol>
+
+    <h3 id="section-13.2">13.2 The Fi-Wi Solution: The 92-Lane Fabric</h3>
+
+    <p>
+      Fi-Wi eliminates the internal switch, the GMII link, and the autonomous
+      CPU. By utilizing high-end workstation silicon (e.g., AMD Threadripper Pro
+      or Intel Xeon W-3400 series), the Concentrator provides
+      <strong>92 to 128 native PCIe lanes</strong> directly from a CPU with
+      <strong>24 to 96 high-performance cores</strong>.
+    </p>
+
+    <p>
+      <strong
+        >The 92+ lanes of PCIe eliminate the need for an internal ethernet
+        switch anywhere in the datapath.</strong
+      >
+    </p>
+
+    <div class="diagram">
+      TOPOLOGY COMPARISON Standard Server + Switch Fi-Wi Workstation
+      Concentrator ┌─────────────┐ ┌──────────────────────────┐ │ Dual CPU │ 20
+      Lanes │ Workstation CPU │ │ (High Core) │ per CPU │ (24-96 Cores, High
+      Freq) │ └──────┬──────┘ └────────────┬─────────────┘ │ ||||||||||||||| (92
+      Native Lanes) ┌──────▼──────┐ ↓↓↓↓↓↓↓↓↓↓↓↓↓↓↓ │ PCIe Switch │ (Congestion
+      Point) RRH RRH RRH RRH (Direct Attach) └─┬─┬─┬─┬─┬─┬─┘ ... ... ... ... ↓ ↓
+      ↓ ↓ ↓ ↓ RRH Connections
+    </div>
+
+    <h3 id="section-13.3">13.3 Dedicated Resources and Determinism</h3>
+
+    <p>
+      By mapping each RRH (or small groups of RRHs) to dedicated root ports on
+      the CPU, Fi-Wi achieves a <strong>Non-Blocking Architecture</strong>:
+    </p>
+
+    <ul>
+      <li>
+        <strong>Zero Switching Contention:</strong> A read request from RRH #1
+        does not compete with RRH #2 for switch buffer credits. They travel on
+        separate physical copper traces all the way to the CPU's I/O die.
+      </li>
+
+      <li>
+        <strong>Massive Bandwidth Headroom:</strong> 92 lanes of PCIe Gen4/5
+        offer &gt;150 GB/s of aggregate bandwidth. Even if all 50 RRHs retry
+        simultaneously (a "Synchronized Collapse" event), the aggregate demand
+        (~10 GB/s) consumes less than 7% of the host's I/O capacity.
+      </li>
+    </ul>
+
+    <p>
+      This guarantees that the host DRAM behaves like
+      <strong>Deterministic Ultra-Low Latency Memory</strong> rather than a
+      shared network resource. This stability is the physical foundation that
+      allows the software-defined queues (Section 14) to operate with
+      microsecond precision.
+    </p>
+
+    <div class="callout">
+      <strong
+        >Historical Analogy: How the Cisco 7500 Removed the "Global
+        Lock"</strong
+      ><br />
+
+      <p>
+        Just as Fi-Wi removes blocking via massive PCIe lane availability, the
+        <strong>CyBus ASIC</strong> in the Cisco 7500 (1990s) solved a similar
+        bottleneck in routing.
+      </p>
+
+      <ul>
+        <li>
+          <strong>The Problem:</strong> Previous architectures used a central
+          CPU with a <strong>Global Software Lock</strong> to manage shared
+          memory. Every interface had to wait for the CPU to grant access,
+          creating a single point of failure for performance.
+        </li>
+
+        <li>
+          <strong>The Solution:</strong> Cisco distributed the intelligence by
+          placing a CyBus ASIC on every Interface Processor (VIP). Instead of
+          asking a CPU for a lock, these ASICs used
+          <strong>Hardware Arbitration</strong> on a high-speed bus.
+        </li>
+
+        <li>
+          <strong>The Result:</strong> Multiple interfaces could negotiate
+          memory access in nanoseconds (hardware cycles) rather than
+          microseconds (software interrupts), effectively removing the global
+          lock.
+        </li>
+      </ul>
+
+      <p>
+        <em
+          >Fi-Wi applies this same "Non-Blocking" philosophy to the wireless
+          stack, utilizing 92+ lanes of PCIe to ensure that RRH memory access is
+          never gated by a shared internal switch or software mutex.</em
+        >
+      </p>
+    </div>
+
+    <h2 id="section-14">14. Hardware Queues and the Software Advantage</h2>
+
+    <h3 id="section-14.1">14.1 The Hardware Queue Problem</h3>
+
+    <p>
+      Traditional Wi-Fi APs use hardware DMA (Direct Memory Access) rings to
+      meet strict 802.11 MAC timing requirements—SIFS and DIFS deadlines
+      measured in microseconds. While this solves the timing problem, it creates
+      a cascade of architectural constraints that Fi-Wi explicitly avoids.
+    </p>
+
+    <p>
+      Hardware queues are expensive to implement in silicon. Each queue requires
+      dedicated SRAM for descriptor storage, control logic for pointer
+      management and overflow handling, and power even when idle. Current chip
+      design limits traditional APs to hardware queues at L2 or MAC—typically
+      the four WMM access categories (AC_VO, AC_VI, AC_BE, AC_BK) per radio * N
+      stations.
+    </p>
+
+    <p>
+      While sufficient for basic priority handling, this fundamental constraint
+      prevents the sophisticated per-flow scheduling that modern high-density
+      networks require:
+    </p>
+
+    <div class="diagram">
+      What AP hardware queues prevent: ✗ Per-flow fair queuing (would require
+      100+ queues) ✗ DualQ L4S per flow ✗ Dynamic queue allocation based on
+      traffic patterns
+    </div>
+
+    <h3 id="section-14.2">14.2 The DMA Ownership Constraint</h3>
+
+    <p>
+      An equally significant problem is that once packets are enqueued to
+      hardware DMA rings, the CPU <em>cannot access them</em> without causing
+      race conditions. This "ownership transfer" creates fundamental
+      limitations:
+    </p>
+
+    <div class="callout">
+      <strong>Critical constraint:</strong> All packet inspection,
+      classification, ECN marking, and policy decisions must occur
+      <em>before</em> handing packets to hardware. After DMA enqueue, software
+      is blind until transmission completes.
+    </div>
+
+    <p>This prevents:</p>
+
+    <ul>
+      <li>
+        <strong>Real-time ECN marking:</strong> Must mark speculatively at
+        enqueue time based on guessed future queue depth, not actual sojourn
+        time at transmission. This leads to oscillation and poor L4S
+        performance.
+      </li>
+
+      <li>
+        <strong>Adaptive packet inspection:</strong> Cannot implement policies
+        like "inspect deeply when queue exceeds threshold" because packets in
+        queue are inaccessible. All inspection must happen in the fast path,
+        adding latency or requiring expensive hardware DPI engines.
+      </li>
+
+      <li>
+        <strong>Dynamic flow reclassification:</strong> If a flow's
+        characteristics change (HTTP connection becomes video stream), packets
+        already in hardware queues cannot be moved. Only future packets can be
+        reclassified.
+      </li>
+
+      <li>
+        <strong>Queue debugging:</strong> Network operators cannot see which
+        flows are filling queues, packet ages, or causes of head-of-line
+        blocking. Only aggregate counters are visible.
+      </li>
+    </ul>
+
+    <h3 id="section-14.3">14.3 Compensating Hardware</h3>
+
+    <p>
+      Because hardware queues are limited and packets become inaccessible after
+      DMA, traditional AP vendors must add compensating hardware functionality
+      to address these fundamental architectural limitations:
+    </p>
+
+    <table class="comparison">
+      <thead>
+        <tr>
+          <th>Fundamental Limitation</th>
+          <th>Hardware Workaround Required</th>
+          <th>Complexity Added</th>
+        </tr>
+      </thead>
+
+      <tbody>
+        <tr>
+          <td>Only 4-8 queues → no per-flow fairness</td>
+          <td>Airtime fairness tracking engine</td>
+          <td>Significant additional logic</td>
+        </tr>
+
+        <tr>
+          <td>Only 4-8 queues → no per-STA queuing</td>
+          <td>MU-MIMO grouping and coordination</td>
+          <td>Complex scheduling algorithms</td>
+        </tr>
+
+        <tr>
+          <td>Can't inspect after enqueue</td>
+          <td>Hardware deep packet inspection engine</td>
+          <td>Pattern matching, state tracking</td>
+        </tr>
+
+        <tr>
+          <td>Can't mark ECN in real-time</td>
+          <td>Hardware ECN marker with threshold logic</td>
+          <td>Queue monitoring, marking logic</td>
+        </tr>
+
+        <tr>
+          <td>Can't reclassify flows dynamically</td>
+          <td>Flow classification accelerator (TCAM)</td>
+          <td>Fixed rules; high-priority only; cannot update easily</td>
+        </tr>
+      </tbody>
+    </table>
+
+    <p>
+      This compensating hardware represents substantial additional silicon area,
+      design complexity, and verification effort. More critically,
+      hardware-based solutions are fundamentally limited to fixed thresholds and
+      simple policies that were designed into the chip. They cannot implement
+      sophisticated algorithms like CoDel, PIE, or adaptive per-flow policies
+      that require complex state and frequent updates.
+    </p>
+
+    <h3 id="section-14.4">14.4 Fi-Wi's Architectural Solution</h3>
+
+    <p>Fi-Wi escapes these constraints through architectural separation:</p>
+
+    <h4>RRH: Timing without queuing</h4>
+
+    <p>
+      RRH silicon implements only timing-critical functions (MAC/PHY,
+      synchronization) with zero hardware queues. Packets arrive from the
+      concentrator milliseconds before transmission, stay in simple descriptor
+      rings briefly, then transmit. No autonomous queuing or scheduling logic.
+    </p>
+
+    <h4>Concentrator: Unlimited software queues</h4>
+
+    <p>
+      All queues live in concentrator DRAM. Because the concentrator operates at
+      TXOP granularity (~600 µs) rather than SIFS granularity (16 µs), it has
+      time for software scheduling. Queue structures are simple data structures
+      in memory— vastly cheaper than dedicated silicon:
+    </p>
+
+    <div class="diagram">
+      Concentrator per RF group: - 1000+ per-flow queues implemented as hash
+      tables in DRAM - Each queue is a simple software structure (linked list or
+      array) - Memory cost is negligible compared to 8+ GB server DRAM in
+      concentrator - No power consumption when idle - Can be
+      allocated/deallocated dynamically as needed Enables what traditional APs
+      cannot do: ✓ Per-flow fair queuing (stochastic fairness) ✓ DualQ L4S with
+      separate queues per flow class ✓ Real-time ECN marking (actual sojourn
+      time at TX) ✓ Sophisticated AQM (CoDel, PIE, custom algorithms) ✓ Deep
+      packet inspection any time before TX ✓ Dynamic flow reclassification ✓
+      Full queue visibility for debugging
+    </div>
+
+    <h4>Packet ownership until last moment</h4>
+
+    <p>
+      The critical difference: packets remain in concentrator DRAM
+      (software-accessible) until milliseconds before transmission. The
+      scheduler can:
+    </p>
+
+    <ul>
+      <li>
+        Compute sojourn time at TXOP instant:
+        <code>now() - pkt-&gt;enqueue_time</code>
+      </li>
+
+      <li>Mark ECN based on <em>actual</em> delay, not speculation</li>
+
+      <li>Inspect packet contents and apply adaptive policies</li>
+
+      <li>Reclassify flows by moving packets between queues</li>
+
+      <li>Export queue state to monitoring tools</li>
+    </ul>
+
+    <p>
+      RRH only owns packets for ~1 ms while transmitting a TXOP—too brief to
+      constrain the system.
+    </p>
+
+    <h3 id="section-14.5">14.5 Economic and Strategic Impact</h3>
+
+    <table class="comparison">
+      <thead>
+        <tr>
+          <th>Aspect</th>
+          <th>Traditional AP</th>
+          <th>Fi-Wi</th>
+        </tr>
+      </thead>
+
+      <tbody>
+        <tr>
+          <td><strong>Queue count</strong></td>
+          <td>N stations * 4-8 (at MAC or L2 level)</td>
+          <td>1000+ (dynamically allocated, quintuple level)</td>
+        </tr>
+
+        <tr>
+          <td><strong>Queue implementation</strong></td>
+          <td>Dedicated silicon (expensive)</td>
+          <td>Software data structures (negligible cost)</td>
+        </tr>
+
+        <tr>
+          <td><strong>Compensating logic</strong></td>
+          <td>Substantial silicon for workarounds</td>
+          <td>None needed</td>
+        </tr>
+
+        <tr>
+          <td><strong>Per-flow fairness</strong></td>
+          <td>Impossible (insufficient queues)</td>
+          <td>Standard capability</td>
+        </tr>
+
+        <tr>
+          <td><strong>Sophisticated AQM</strong></td>
+          <td>Simple thresholds only (hardware fixed)</td>
+          <td>Any algorithm (CoDel, PIE, ML-based)</td>
+        </tr>
+
+        <tr>
+          <td><strong>Policy updates</strong></td>
+          <td>Requires new silicon design</td>
+          <td>Software configuration or code update</td>
+        </tr>
+
+        <tr>
+          <td><strong>Operational visibility</strong></td>
+          <td>Aggregate counters only</td>
+          <td>Full per-flow statistics and queue contents</td>
+        </tr>
+
+        <tr>
+          <td><strong>Algorithm experimentation</strong></td>
+          <td>Impossible in production</td>
+          <td>A/B testing, gradual rollout possible</td>
+        </tr>
+      </tbody>
+    </table>
+
+    <p>
+      Beyond the direct silicon cost advantages, Fi-Wi gains strategic
+      advantages that compound over time:
+    </p>
+
+    <ul>
+      <li>
+        <strong>Continuous improvement:</strong> Deploy new AQM algorithms,
+        better classifiers, or ML-based scheduling via software updates—no
+        hardware changes required.
+      </li>
+
+      <li>
+        <strong>Operational excellence:</strong> Full queue visibility enables
+        root-cause analysis, capacity planning, and identification of
+        misbehaving clients—impossible with opaque hardware queues.
+      </li>
+
+      <li>
+        <strong>Research integration:</strong> As networking research advances,
+        Fi-Wi deployments benefit immediately through software deployment, not
+        years-long silicon redesign cycles.
+      </li>
+    </ul>
+
+    <h3 id="section-14.6">14.6 Architectural Principle</h3>
+
+    <p>Fi-Wi's approach follows a clear design principle:</p>
+
+    <div class="callout">
+      <strong>RRH (hardware):</strong> Only latency-critical functions requiring
+      microsecond determinism (MAC timing, PHY processing, synchronization).<br />
+      <br />
+      <strong>Concentrator (software):</strong> All scheduling, queuing,
+      inspection, marking, policy, and adaptation—anything that benefits from
+      flexibility, visibility, or frequent updates.
+    </div>
+
+    <p>
+      This separation is not arbitrary. It's driven by fundamental constraints:
+      hardware is expensive, inflexible, and opaque; software is cheap,
+      updatable, and inspectable. By placing intelligence in software and only
+      timing-critical functions in hardware, Fi-Wi achieves both the performance
+      of hardware-accelerated systems and the flexibility of software-defined
+      networking—advantages that traditional distributed-AP architectures cannot
+      replicate due to their need for autonomous per-AP decision-making at
+      microsecond timescales.
+    </p>
+
+    <h2 id="section-15">15. Adaptive Control via Machine Learning</h2>
+
+    <p>
+      The Fi-Wi architecture's centralized observability enables machine
+      learning to optimize MCS transition dynamics on a per-site basis. Unlike
+      autonomous APs that operate on partial, local state, the Concentrator
+      observes the complete state-transition graph for all RRHs under a single
+      clock. This section describes how Fi-Wi combines physics-based models with
+      adaptive learning to optimize performance.
+    </p>
+
+    <h3 id="section-15.1">
+      15.1 The MCS State Graph as a Probability Current Network
+    </h3>
+
+    <p>
+      The MCS state graph from Section 2.7 can be formalized as a probability
+      current network, where each node represents a PHY configuration state (MCS
+      index, spatial stream count) and edges represent transitions between
+      states. The system's behavior follows probability flow dynamics:
+    </p>
+
+    <div class="diagram-block">
+      <h4>
+        Figure 15-1: Interactive Animation: MCS and Spatial Stream Performance
+        (with Eigen Space)
+      </h4>
+
+      <div class="container">
+        <h1>
+          Interactive Animation: MCS and Spatial Stream Performance (with Eigen
+          Space)
+        </h1>
+
+        <div class="canvas-container">
+          <div class="canvas-box">
+            <h3>Autonomous AP(s)</h3>
+
+            <canvas id="canvasAuto"> </canvas>
+
+            <div class="per-display">
+              PER: <span class="per-value" id="perAuto">0.0%</span>
+            </div>
+
+            <div class="eigen-display">
+              Eigen Vectors: <span class="eigen-value" id="eigenAuto">2</span>
+            </div>
+
+            <div class="eigen-display" style="margin-top: 4px">
+              WLAN Util:
+              <span class="eigen-value" id="utilAuto">0.0%</span> (<span
+                class="eigen-value"
+                id="utilAutoBits"
+                >0 Mbps</span
+              >)
+            </div>
+
+            <div class="eigen-display" style="margin-top: 4px">
+              P99.9 Latency:
+              <span class="eigen-value" id="latencyAuto">0 ms</span>
+              <div style="font-size: 0.75rem; color: #888; margin-top: 2px">
+                (802.11: <span id="latency80211Auto">0 ms</span> + 802.3:
+                <span id="latency8023Auto">0 ms</span>)
+              </div>
+            </div>
+          </div>
+
+          <div class="canvas-box">
+            <h3>Centralized Concentrator</h3>
+
+            <canvas id="canvasFiwi"> </canvas>
+
+            <div class="per-display">
+              PER: <span class="per-value" id="perFiwi">0.0%</span>
+            </div>
+
+            <div class="eigen-display">
+              Eigen Vectors: <span class="eigen-value" id="eigenFiwi">16</span>
+            </div>
+
+            <div class="eigen-display" style="margin-top: 4px">
+              WLAN Util:
+              <span class="eigen-value" id="utilFiwi">0.0%</span> (<span
+                class="eigen-value"
+                id="utilFiwiBits"
+                >0 Mbps</span
+              >)
+            </div>
+
+            <div class="eigen-display" style="margin-top: 4px">
+              P99.9 Latency:
+              <span class="eigen-value" id="latencyFiwi">0 ms</span>
+              <div style="font-size: 0.75rem; color: #888; margin-top: 2px">
+                (802.11: <span id="latency80211Fiwi">0 ms</span> + 802.3:
+                <span id="latency8023Fiwi">0 ms</span>)
+              </div>
+            </div>
+          </div>
+        </div>
+        <!-- Separate Probability Current Visualization -->
+
+        <div
+          id="probabilityCurrentSection"
+          style="display: block; margin-top: 30px"
+        >
+          <h2 style="text-align: center; color: #333; margin-bottom: 20px">
+            Flow Field Visualization
+          </h2>
+
+          <div class="canvas-container">
+            <div class="canvas-box">
+              <h3>Autonomous AP(s) - Flow Field</h3>
+
+              <canvas id="canvasAutoFlow"> </canvas>
+
+              <div
+                style="
+                  text-align: center;
+                  font-size: 0.85rem;
+                  color: #d9534f;
+                  margin-top: 8px;
+                  font-weight: 600;
+                "
+              >
+                Turbulent Flow (High Entropy)
+              </div>
+            </div>
+
+            <div class="canvas-box">
+              <h3>Centralized Concentrator - Flow Field</h3>
+
+              <canvas id="canvasFiwiFlow"> </canvas>
+
+              <div
+                style="
+                  text-align: center;
+                  font-size: 0.85rem;
+                  color: #174f8a;
+                  margin-top: 8px;
+                  font-weight: 600;
+                "
+              >
+                Laminar Flow (Low Entropy)
+              </div>
+            </div>
+          </div>
+        </div>
+        <!-- Two Column Layout: Text and Controls -->
+
+        <div
+          style="
+            display: grid;
+            grid-template-columns: 1fr 1fr;
+            gap: 30px;
+            margin-top: 30px;
+            max-width: 1400px;
+            margin-left: auto;
+            margin-right: auto;
+          "
+        >
+          <!-- Left Column: Text -->
+
+          <div
+            style="
+              background: #fff3cd;
+              padding: 20px;
+              border-radius: 8px;
+              border: 2px solid #ffc107;
+            "
+          >
+            <h3 style="margin: 0 0 15px 0; color: #856404; font-size: 16px">
+              Probability Current (J) - Flow Field Visualization
+            </h3>
+
+            <div style="font-size: 14px; color: #856404; line-height: 1.8">
+              <p style="margin: 0 0 12px 0">
+                <strong>What you're seeing:</strong> The vector field (arrows)
+                shows the "flow" of PPDUs through the MCS/Spatial Stream
+                space—the "river" of probability current that drives system
+                behavior.
+              </p>
+
+              <p style="margin: 0 0 12px 0">
+                <strong>Autonomous AP (Left):</strong> Turbulent flow with
+                chaotic arrow directions, sometimes pointing backward when
+                collisions occur. Multiple shallow potential wells create
+                competing forces. This represents
+                <strong>High Entropy</strong>—the system doesn't know which way
+                is optimal.
+              </p>
+
+              <p style="margin: 0">
+                <strong>Centralized Concentrator (Right):</strong> Laminar flow
+                with smooth, coherent streamlines pointing toward the optimum.
+                Steeper gradients and deeper potential wells create strong
+                convergence. This represents
+                <strong>Low Entropy (Determinism)</strong>—the system has clear
+                direction toward the optimal state.
+              </p>
+            </div>
+          </div>
+          <!-- Right Column: Controls -->
+
+          <div
+            style="
+              background: white;
+              padding: 20px;
+              border-radius: 8px;
+              box-shadow: 0 2px 8px rgba(0, 0, 0, 0.1);
+            "
+          >
+            <div
+              style="
+                margin-bottom: 15px;
+                padding: 10px;
+                background: #f0f8ff;
+                border-radius: 4px;
+                border: 1px solid #174f8a;
+              "
+            >
+              <label
+                style="
+                  font-size: 14px;
+                  color: #174f8a;
+                  display: flex;
+                  align-items: center;
+                  gap: 10px;
+                  cursor: pointer;
+                "
+              >
+                <input
+                  type="checkbox"
+                  id="showEigenSpace"
+                  checked
+                  style="width: 18px; height: 18px; cursor: pointer"
+                />
+                <span style="font-weight: 600"
+                  >Show Eigen Space Visualization (rings around PPDUs indicate
+                  available eigenvectors)</span
+                ></label
+              >
+            </div>
+
+            <div
+              style="
+                margin-bottom: 15px;
+                padding: 10px;
+                background: #fff3cd;
+                border-radius: 4px;
+                border: 1px solid #ffc107;
+              "
+            >
+              <label
+                style="
+                  font-size: 14px;
+                  color: #856404;
+                  display: flex;
+                  align-items: center;
+                  gap: 10px;
+                  cursor: pointer;
+                "
+              >
+                <input
+                  type="checkbox"
+                  id="showSingleDevice"
+                  style="width: 18px; height: 18px; cursor: pointer"
+                />
+                <span style="font-weight: 600"
+                  >Show Single Device (Zoom in on one device to visualize
+                  turbulence)</span
+                ></label
+              >
+              <div
+                style="
+                  font-size: 12px;
+                  color: #856404;
+                  margin-top: 5px;
+                  margin-left: 28px;
+                "
+              >
+                When enabled, only one device is visualized. All devices still
+                run in the background to drive system dynamics, but you can see
+                the turbulence affecting a single device more clearly.
+              </div>
+            </div>
+
+            <div
+              style="
+                margin-bottom: 15px;
+                padding: 10px;
+                background: #e7f3ff;
+                border-radius: 4px;
+                border: 1px solid #0066cc;
+              "
+            >
+              <label
+                style="
+                  font-size: 14px;
+                  color: #0066cc;
+                  display: flex;
+                  align-items: center;
+                  gap: 10px;
+                  cursor: pointer;
+                "
+              >
+                <input
+                  type="checkbox"
+                  id="l4sEnabled"
+                  checked
+                  style="width: 18px; height: 18px; cursor: pointer"
+                />
+                <span style="font-weight: 600"
+                  >L4S Mode (Low Latency Low Loss Scalable)</span
+                ></label
+              >
+              <div
+                style="
+                  font-size: 12px;
+                  color: #0066cc;
+                  margin-top: 5px;
+                  margin-left: 28px;
+                "
+              >
+                <strong>L4S ON:</strong> Optimizes both PHY rates and latency
+                (conservative, stable MCS)<br />
+                <strong>Note:</strong> L4S ECN signaling only works with
+                <strong>Centralized Concentrator</strong> architecture.<br />
+                Autonomous APs can't coordinate aggregate WAN state, so queue
+                delay reduction doesn't apply.<br />
+                <strong>L4S OFF (Greedy):</strong> Maximizes PHY rates
+                (aggressive, higher MCS targets)
+              </div>
+            </div>
+
+            <div
+              class="phase2-box"
+              style="margin-bottom: 15px; padding: 10px; border-radius: 4px"
+            >
+              <label class="phase2-label" style="cursor: pointer"
+                ><input
+                  type="checkbox"
+                  id="phaseSyncEnabled"
+                  checked
+                  style="width: 18px; height: 18px; cursor: pointer"
+                />
+                Phase 2: Enable FPGA Phase Synchronization</label
+              >
+              <div
+                style="
+                  font-size: 0.9rem;
+                  color: #546e7a;
+                  margin-top: 8px;
+                  margin-left: 28px;
+                "
+              >
+                <strong>Unchecked (Phase 1):</strong> Software MAC Coordination
+                only. Eliminates collisions, but Eigenvectors capped at 4
+                (Hardware Limit).<br />
+                <strong>Checked (Phase 2):</strong> FPGA-based Coherency.
+                Unlocks Distributed MIMO (Rank Expansion). Eigenvectors scale to
+                16+.
+              </div>
+            </div>
+
+            <div
+              style="display: grid; grid-template-columns: 1fr 1fr; gap: 20px"
+            >
+              <div>
+                <label
+                  style="
+                    font-size: 14px;
+                    color: #555;
+                    display: block;
+                    margin-bottom: 8px;
+                  "
+                  >Number of Active Devices:</label
+                >
+                <input
+                  type="range"
+                  id="deviceCount"
+                  min="0"
+                  max="1000"
+                  value="15"
+                  style="width: 100%"
+                />
+                <div
+                  style="
+                    text-align: center;
+                    font-size: 12px;
+                    color: #777;
+                    margin-top: 4px;
+                  "
+                >
+                  <span id="deviceCountValue">15</span> devices
+                </div>
+              </div>
+
+              <div>
+                <label
+                  style="
+                    font-size: 14px;
+                    color: #555;
+                    display: block;
+                    margin-bottom: 8px;
+                  "
+                  >Airtime Domains:</label
+                >
+                <div
+                  style="
+                    display: grid;
+                    grid-template-columns: 1fr 1fr;
+                    gap: 10px;
+                  "
+                >
+                  <div>
+                    <div
+                      style="font-size: 11px; color: #999; margin-bottom: 4px"
+                    >
+                      Autonomous AP(s):
+                    </div>
+                    <input
+                      type="range"
+                      id="airtimeDomainsAP"
+                      min="1"
+                      max="5"
+                      value="1"
+                      style="width: 100%"
+                    />
+                    <div
+                      style="
+                        text-align: center;
+                        font-size: 11px;
+                        color: #777;
+                        margin-top: 2px;
+                      "
+                    >
+                      <span id="airtimeDomainsAPValue">1</span> domain
+                    </div>
+                  </div>
+
+                  <div>
+                    <div
+                      style="font-size: 11px; color: #999; margin-bottom: 4px"
+                    >
+                      Fi-Wi RRH per room:
+                    </div>
+                    <input
+                      type="range"
+                      id="roomSize"
+                      min="200"
+                      max="1000"
+                      step="50"
+                      value="400"
+                      style="width: 100%"
+                    />
+                    <div
+                      style="
+                        text-align: center;
+                        font-size: 11px;
+                        color: #777;
+                        margin-top: 2px;
+                      "
+                    >
+                      <span id="roomSizeValue">400</span> sq ft/room (<span
+                        id="airtimeDomainsFiwiValue"
+                        >25</span
+                      >
+                      domains)
+                    </div>
+                  </div>
+                </div>
+              </div>
+
+              <div style="grid-column: 1 / -1">
+                <label
+                  style="
+                    font-size: 14px;
+                    color: #555;
+                    display: block;
+                    margin-bottom: 8px;
+                  "
+                  >Coverage Area:</label
+                >
+                <input
+                  type="range"
+                  id="coverageArea"
+                  min="1000"
+                  max="50000"
+                  step="1000"
+                  value="10000"
+                  style="width: 100%"
+                />
+                <div
+                  style="
+                    text-align: center;
+                    font-size: 12px;
+                    color: #777;
+                    margin-top: 4px;
+                  "
+                >
+                  <span id="coverageAreaValue">10,000</span> sq ft (<span
+                    id="deviceDensity"
+                    >1.5</span
+                  >
+                  devices per 1,000 sq ft)
+                </div>
+              </div>
+            </div>
+          </div>
+        </div>
+      </div>
+      <!-- Appendix Section -->
+
+      <div class="appendix-box">
+        <div class="appendix-title">
+          Technical Justification for FPGA (Phase 2)
+        </div>
+
+        <div class="appendix-content">
+          <div>
+            <strong>Phase 1: Coordinated Scheduling (Software/MAC)</strong>
+            <p>
+              In Phase 1, the Central Concentrator uses standard MAC-level
+              timing to prevent APs from transmitting simultaneously on the same
+              frequency.<br />
+              <br />
+              <em>Result:</em> This successfully eliminates the "Red"
+              (collisions) seen in the Autonomous model. However, because the
+              Radio Heads (RRHs) are not phase-aligned, they cannot perform
+              Joint Transmission. The channel rank is limited to the physical
+              antennas of a single RRH (Rank 4). Throughput hits a "Glass
+              Ceiling."
+            </p>
+          </div>
+
+          <div>
+            <strong>Phase 2: Distributed MIMO (FPGA/PHY)</strong>
+            <p>
+              In Phase 2, we introduce an FPGA to achieve sub-nanosecond
+              synchronization between RRHs. This allows multiple RRHs to act as
+              a single, distributed antenna array.<br />
+              <br />
+              <em>Result:</em> This unlocks <strong>Rank Expansion</strong>. The
+              system can resolve 16+ spatial streams (Eigenvectors)
+              simultaneously. The "Glass Ceiling" is removed, and throughput
+              scales linearly with the number of RRHs deployed.
+            </p>
+
+            <div class="tech-note">
+              <strong>Implementation Mechanism:</strong> To achieve &lt;1ns
+              precision over fiber, the system utilizes the
+              <strong>White Rabbit (IEEE 1588 HA)</strong> protocol. An FPGA on
+              the RRH compensates for variable PCIe bus latency (using
+              <strong>PCIe PTM</strong>) and fiber propagation delay, ensuring
+              the RRH clock is phase-locked to the Central Concentrator.
+            </div>
+          </div>
+        </div>
+      </div>
+      <script>
+        (function () {
+          const NODE_ROWS = 4; // SS1, SS2, SS3, SS4
+          const NODE_COLS = 12; // MCS 0-11
+          const PADDING = 40;
+          const WAN_CAPACITY_MBPS = 1000; // 1 Gbps WAN link capacity
+
+          // MCS to PHY Rate mapping (802.11ac/ax, Mbps) - per spatial stream
+          // Rates are for 20MHz channel, 800ns GI
+          const MCS_PHY_RATES = [
+            // MCS 0-7: Single spatial stream rates
+            6.5, 13, 19.5, 26, 39, 52, 58.5, 65,
+            // MCS 8-9: 256-QAM (802.11ac)
+            78, 86.7,
+            // MCS 10-12: Extended (802.11ax)
+            96.3, 104, 115.6,
+          ];
+
+          // Calculate available eigenvectors based on architecture (time-varying)
+          function getAvailableEigenVectors(
+            isFiwi,
+            numAirtimeDomains,
+            currentSS,
+            time,
+            interferenceLevel,
+          ) {
+            // Check Phase 2 FPGA synchronization status
+            const phaseSyncEnabled = document.getElementById("phaseSyncEnabled")
+              ? document.getElementById("phaseSyncEnabled").checked
+              : true;
+
+            // Base eigen structure from architecture
+            let baseMaxEigen;
+            if (isFiwi) {
+              // PHASE 1 vs PHASE 2 LOGIC
+              if (!phaseSyncEnabled) {
+                // PHASE 1: MAC COORDINATION ONLY
+                // We eliminate collisions (handled in collision logic),
+                // BUT we cannot expand Rank beyond the hardware limit of one RRH.
+                // Cap at 4 (typical 4x4 MIMO radio)
+                baseMaxEigen = 4;
+              } else {
+                // PHASE 2: FPGA PHASE SYNC
+                // Rank scales with the number of domains (RRHs)
+                // Linear scaling: 16+ streams possible
+                // Number of RRHs = number of airtime domains (one RRH per room)
+                // Available eigenvectors scale with number of RRHs/antennas
+                if (numAirtimeDomains <= 5) {
+                  baseMaxEigen = 2 + Math.floor(numAirtimeDomains * 0.4); // 2-4 for 1-5 RRHs
+                } else if (numAirtimeDomains <= 10) {
+                  baseMaxEigen = 4 + Math.floor((numAirtimeDomains - 5) * 0.8); // 4-8 for 5-10 RRHs
+                } else if (numAirtimeDomains <= 20) {
+                  baseMaxEigen = 8 + Math.floor((numAirtimeDomains - 10) * 0.8); // 8-16 for 10-20 RRHs
+                } else {
+                  baseMaxEigen =
+                    16 + Math.floor((numAirtimeDomains - 20) * 0.4); // 16+ for 20+ RRHs
+                }
+              }
+            } else {
+              // AP: Limited antennas (typically 2-4 antennas per AP)
+              const baseEigen =
+                numAirtimeDomains === 1
+                  ? 2
+                  : Math.min(4, 2 + numAirtimeDomains);
+              baseMaxEigen = baseEigen;
+            }
+
+            // Time-varying channel effects (multipath fading, movement, environmental changes)
+            // Use sine waves with different frequencies to simulate channel variations
+            const timeVariation1 = Math.sin(time / 2000) * 0.15; // Slow variation (seconds)
+            const timeVariation2 = Math.sin(time / 500) * 0.1; // Medium variation
+            const timeVariation3 = Math.sin(time / 100) * 0.05; // Fast variation (fading)
+            const timeVariation =
+              timeVariation1 + timeVariation2 + timeVariation3;
+
+            // Interference reduces available eigenvectors (channel degradation)
+            // interferenceLevel: 0 (no interference) to 1 (heavy interference)
+            const interferencePenalty = interferenceLevel * 0.3; // Up to 30% reduction
+
+            // Random channel fluctuations (small random variations)
+            const randomVariation = (Math.random() - 0.5) * 0.1;
+
+            // Calculate time-varying eigen count
+            let availableEigen =
+              baseMaxEigen *
+              (1 + timeVariation - interferencePenalty + randomVariation);
+
+            // Ensure minimum: need at least SS+1 eigenvectors to use that SS
+            availableEigen = Math.max(currentSS + 1, availableEigen);
+
+            // Round to integer (eigenvectors are discrete)
+            return Math.round(availableEigen);
+          }
+
+          // Create PPDUs (PLCP Protocol Data Units)
+          function createPPDUs(w, stepX, isFiwi, numAirtimeDomains) {
+            const ppdus = [];
+            if (numAirtimeDomains === 0) numAirtimeDomains = 1;
+            const devicesPerDomain = Math.ceil(
+              targetDeviceCount / numAirtimeDomains,
+            );
+
+            for (let i = 0; i < targetDeviceCount; i++) {
+              // Assign to airtime domain
+              const domainIndex = Math.floor(i / devicesPerDomain);
+              const domainColor = isFiwi
+                ? `hsl(${220 + domainIndex * 20}, 70%, ${50 + domainIndex * 5}%)`
+                : "#d9534f";
+
+              // Check L4S mode: L4S is more conservative (lower MCS), Greedy is aggressive (higher MCS)
+              const l4sEnabled = document.getElementById("l4sEnabled")
+                ? document.getElementById("l4sEnabled").checked
+                : true;
+
+              // More airtime domains = less interference = higher MCS potential
+              // L4S mode: Lower MCS targets (prioritize stability and latency)
+              // Greedy mode: Higher MCS targets (maximize PHY rates)
+              let mcsBase;
+              if (isFiwi) {
+                if (l4sEnabled) {
+                  // L4S: Conservative, prioritize stability
+                  mcsBase =
+                    5 +
+                    (numAirtimeDomains > 10
+                      ? 2
+                      : numAirtimeDomains > 5
+                        ? 1
+                        : 0);
+                } else {
+                  // Greedy: Aggressive, maximize rates
+                  mcsBase =
+                    8 +
+                    (numAirtimeDomains > 10
+                      ? 4
+                      : numAirtimeDomains > 5
+                        ? 3
+                        : 2);
+                }
+              } else {
+                if (l4sEnabled) {
+                  // L4S: Conservative for Autonomous
+                  mcsBase = 3;
+                } else {
+                  // Greedy: Aggressive for Autonomous
+                  mcsBase = 5;
+                }
+              }
+
+              // Initialize closer to target MCS (not completely random) to avoid clustering at MCS 0
+              // L4S: Tighter range around target (more stable)
+              // Greedy: Wider range (more exploration)
+              const mcsRange = l4sEnabled ? 2 : 4;
+              const targetMCS = mcsBase + Math.random() * mcsRange;
+              const initialMCS = Math.max(
+                0,
+                Math.min(
+                  NODE_COLS - 1,
+                  Math.floor(targetMCS - 2 + Math.random() * 4), // Start within 2 MCS of target
+                ),
+              );
+              const stepX = (w - PADDING * 2) / (NODE_COLS - 1);
+
+              // Initialize SS based on available eigenvectors (Fi-Wi can start higher)
+              // For Autonomous AP: Most devices are 2x2 (SS2), but some are 3x3/4x4 (SS3)
+              // Allow ~15% of Autonomous AP PPDUs to start at SS3 to represent larger devices
+              const baseSS = isFiwi
+                ? Math.min(2, Math.floor(Math.random() * 3)) // Fi-Wi: start at SS1-SS3
+                : Math.random() < 0.15
+                  ? 2 // 15% chance for SS3 (3x3/4x4 devices)
+                  : Math.min(1, Math.floor(Math.random() * 2)); // 85% at SS1-SS2 (1x1/2x2 devices)
+              const initialSS = baseSS;
+              const initialTime = Date.now();
+              const availableEigen = getAvailableEigenVectors(
+                isFiwi,
+                numAirtimeDomains,
+                initialSS,
+                0,
+                0,
+              );
+
+              ppdus.push({
+                x: PADDING + initialMCS * stepX, // Start at discrete MCS position
+                yIndex: initialSS,
+                vx: (Math.random() - 0.5) * 0.5,
+                vy: 0, // Vertical velocity for SS transitions (for probability current)
+                targetMCS: targetMCS,
+                airtimeDomain: domainIndex,
+                color: domainColor,
+                size: 4 + Math.random() * 3,
+                energy: 1.0, // Energy level of this transmission
+                lastYIndex: initialSS,
+                lastMCS: initialMCS, // Track previous MCS for horizontal trails
+                mcsTrail: [], // Trail for horizontal MCS movements
+                successState: Math.random() > 0.5, // Initial success/failure state
+                startTime: initialTime, // Track creation time for time-varying effects
+                availableEigenVectors: availableEigen, // Number of available eigenvectors
+                transmissionStartTime: initialTime, // Track when transmission started (for latency)
+                latencySamples: [], // Track latency samples for P99.9 calculation
+                latency80211Samples: [], // Track 802.11 media access delay
+                latency8023Samples: [], // Track 802.3 queueing delay
+              });
+            }
+            return ppdus;
+          }
+
+          // Physics Constants - 802.11 WMM/EDCA and SINR-based PER
+          const PHYSICS = {
+            // EDCA Parameters (802.11 WMM - Best Effort AC)
+            EDCA_BE: {
+              AIFS: 3, // Arbitration Inter-Frame Space (slots)
+              CWmin: 15, // Minimum Contention Window
+              CWmax: 1023, // Maximum Contention Window
+              SlotTime: 9e-6, // 9 microseconds
+            },
+            // Path loss model (simplified free space + obstacles)
+            PATH_LOSS_EXPONENT: 2.5, // Typical indoor
+            REFERENCE_DISTANCE: 1.0, // 1 meter
+            REFERENCE_POWER_DBM: 20, // 20 dBm transmit power
+            NOISE_FLOOR_DBM: -95, // Thermal noise floor
+            // SINR thresholds for MCS (dB) - approximate values for 802.11ac/ax
+            MCS_SINR_THRESHOLDS: [
+              -82,
+              -79,
+              -77,
+              -74,
+              -70,
+              -66,
+              -65,
+              -64,
+              -59,
+              -57,
+              -54,
+              -52,
+              -50, // MCS 0-12
+            ],
+            // PER curves: PER as function of (SINR - MCS_threshold) in dB
+            // Approximate PER curve: PER ≈ 0.5 * erfc((SINR - threshold) / 2)
+            // Simplified: PER = 0.5 * (1 - tanh((SINR - threshold) / 3))
+          };
+
+          // Draw function - PPDUs
+          function draw(canvas, ppdus, isFiwi) {
+            if (!canvas) {
+              console.error("draw: canvas is null");
+              return;
+            }
+            if (!ppdus || !Array.isArray(ppdus)) {
+              console.error("draw: ppdus is not an array", ppdus);
+              return;
+            }
+            const ctx = canvas.getContext("2d");
+            const rect = canvas.getBoundingClientRect();
+            if (rect.width === 0 || rect.height === 0) {
+              return; // Canvas not ready
+            }
+            const dpr = window.devicePixelRatio || 1;
+
+            canvas.width = rect.width * dpr;
+            canvas.height = rect.height * dpr;
+            ctx.setTransform(dpr, 0, 0, dpr, 0, 0);
+
+            const w = rect.width;
+            const h = rect.height;
+            const stepX = (w - PADDING * 2) / (NODE_COLS - 1);
+            const stepY = (h - PADDING * 2) / (NODE_ROWS - 1);
+
+            // Calculate airtime domains for this architecture
+            const numAirtimeDomainsFiwi = isFiwi
+              ? Math.ceil(coverageArea / roomSize)
+              : 1;
+            const showEigenSpace =
+              document.getElementById("showEigenSpace")?.checked ?? true;
+            const showSingleDeviceCheckbox =
+              document.getElementById("showSingleDevice");
+            const showSingleDevice = showSingleDeviceCheckbox
+              ? showSingleDeviceCheckbox.checked
+              : false;
+
+            // Get L4S mode once for the entire draw call (used in multiple places)
+            const l4sEnabled = document.getElementById("l4sEnabled")
+              ? document.getElementById("l4sEnabled").checked
+              : false;
+
+            function getGridY(rowIndex) {
+              return h - PADDING - rowIndex * stepY;
+            }
+
+            // Clear
+            ctx.clearRect(0, 0, w, h);
+
+            // Draw only discrete node markers (no continuous lines)
+            // Draw small dots at each discrete node (MCS, SS) intersection
+            ctx.fillStyle = "#ddd";
+            for (let r = 0; r < NODE_ROWS; r++) {
+              const gy = getGridY(r);
+              for (let i = 0; i < NODE_COLS; i++) {
+                const gx = PADDING + i * stepX;
+                ctx.beginPath();
+                ctx.arc(gx, gy, 2, 0, Math.PI * 2);
+                ctx.fill();
+              }
+
+              // SS labels
+              ctx.fillStyle = "#666";
+              ctx.font = "12px system-ui";
+              ctx.textAlign = "right";
+              ctx.fillText("SS" + (r + 1), PADDING - 10, gy + 4);
+            }
+
+            // MCS labels - show "MCS" once, then just numbers for tick marks
+            ctx.textAlign = "left";
+            ctx.font = "10px system-ui";
+            ctx.fillStyle = "#666";
+            ctx.fillText("MCS", 5, h - 10); // Move to far left to avoid overlap with tick marks
+            ctx.textAlign = "center";
+            for (let i = 0; i < NODE_COLS; i++) {
+              const gx = PADDING + i * stepX;
+              ctx.fillText(i.toString(), gx, h - 10);
+            }
+
+            // Draw Phase 1 Hardware Limit Cap Line (for Fi-Wi when Phase 2 is off)
+            const phaseSyncEnabled = document.getElementById("phaseSyncEnabled")
+              ? document.getElementById("phaseSyncEnabled").checked
+              : true;
+            if (isFiwi && !phaseSyncEnabled) {
+              // Cap line between SS3 and SS4 (yIndex 2 and 3)
+              const capY = getGridY(3) + stepY / 2; // Midpoint between SS3 and SS4
+              ctx.strokeStyle = "#ff9800";
+              ctx.setLineDash([5, 5]);
+              ctx.lineWidth = 2;
+              ctx.beginPath();
+              ctx.moveTo(PADDING, capY);
+              ctx.lineTo(w - PADDING, capY);
+              ctx.stroke();
+              ctx.setLineDash([]);
+              // Label
+              ctx.fillStyle = "#ff9800";
+              ctx.font = "11px system-ui";
+              ctx.textAlign = "center";
+              ctx.fillText("Phase 1 Hardware Limit (Rank 4)", w / 2, capY - 5);
+            }
+
+            // Update ALL PPDUs for system dynamics (interference, collisions, etc.)
+            // But only draw the selected device(s) if showSingleDevice is enabled
+            ppdus.forEach((ppdu) => {
+              // Discrete MCS positions: PPDUs snap to one of 13 MCS values (0-12)
+              // Calculate current MCS index from position
+              let currentMCS = Math.round((ppdu.x - PADDING) / stepX);
+              currentMCS = Math.max(0, Math.min(NODE_COLS - 1, currentMCS)); // Clamp to 0-12
+
+              // Initialize lastMCS if not set
+              if (ppdu.lastMCS === undefined) ppdu.lastMCS = currentMCS;
+
+              // Target MCS is also discrete
+              const targetMCS = Math.round(ppdu.targetMCS);
+              const clampedTargetMCS = Math.max(
+                0,
+                Math.min(NODE_COLS - 1, targetMCS),
+              );
+
+              // Calculate factors for transitions (used for both MCS and SS transitions)
+              const mcsSuccessFactor = currentMCS / 11; // Higher current MCS = more likely to succeed
+              const numAirtimeDomainsFiwi = isFiwi
+                ? Math.ceil(coverageArea / roomSize)
+                : 1;
+              const domainIsolationFactor = isFiwi
+                ? Math.min(1.0, 0.5 + numAirtimeDomainsFiwi / 20)
+                : 0.3;
+
+              // Calculate SINR-based PER using 802.11 WMM/EDCA model
+              // Count active PPDUs in the same airtime domain
+              const activePPDUsInDomain = ppdus.filter(
+                (p) => p.airtimeDomain === ppdu.airtimeDomain,
+              ).length;
+              const n = activePPDUsInDomain; // Number of contending PPDUs in this domain
+
+              // 1. Calculate path loss and received signal power
+              // Simplified distance model: assume devices are distributed, use domain-based distance estimate
+              const devicesPerDomain = Math.ceil(
+                targetDeviceCount /
+                  (isFiwi
+                    ? Math.ceil(coverageArea / roomSize)
+                    : numAirtimeDomainsAP),
+              );
+              const estimatedDistance = isFiwi
+                ? Math.sqrt(roomSize / Math.PI) * 0.5 // Average distance in room (simplified)
+                : 10 + (ppdu.airtimeDomain % 3) * 5; // Vary distance for AP architecture
+
+              // Path loss in dB: PL = 20*log10(d/d0) + 20*log10(f) + 32.44 (simplified)
+              // Using simplified model: PL = 10*n*log10(d) + L0
+              const pathLossDb =
+                20 *
+                  Math.log10(estimatedDistance / PHYSICS.REFERENCE_DISTANCE) +
+                PHYSICS.PATH_LOSS_EXPONENT * 10 * Math.log10(estimatedDistance);
+              const receivedPowerDb = PHYSICS.REFERENCE_POWER_DBM - pathLossDb;
+
+              // 2. Calculate interference from other PPDUs
+              // For Fi-Wi: interference comes from all RRHs (even in other rooms), not just same domain
+              // For Autonomous: interference primarily from same domain
+              // When showing single device, simulate interference from all other devices to create turbulence
+              let totalInterferingDevices = n; // Default: same domain
+              if (showSingleDevice && !isFiwi) {
+                // When viewing single device in Autonomous mode, simulate full interference from all devices
+                // This creates the turbulence effect even though we're only visualizing one device
+                totalInterferingDevices = Math.max(n, targetDeviceCount - 1); // At least n, but simulate all other devices
+              } else if (isFiwi) {
+                // Fi-Wi: All RRHs contribute to interference (co-channel interference across rooms)
+                // Even with scheduling, many RRHs transmit simultaneously, creating interference
+                // Scale by total device count, but with some spatial isolation between rooms
+                const totalDevices = targetDeviceCount;
+                const numAirtimeDomainsFiwi = Math.ceil(
+                  coverageArea / roomSize,
+                );
+                // Interference from other rooms is reduced by spatial isolation, but still significant
+                // Assume 50% of devices in other rooms contribute (reduced by walls/distance)
+                const otherRoomInterference = (totalDevices - n) * 0.5;
+                totalInterferingDevices = n + otherRoomInterference; // Same domain + other rooms
+              }
+
+              // Interference isolation: with many devices, interference becomes dominant
+              // For 1000 devices, interference should be very close to signal level (within 5-10 dB)
+              let interferenceIsolationDb = 3; // Default isolation
+              if (totalInterferingDevices > 500) {
+                // Massive interference: interference is within 5 dB of signal (can exceed it)
+                interferenceIsolationDb = -10; // Very negative: interference approaches or exceeds signal
+              } else if (totalInterferingDevices > 200) {
+                interferenceIsolationDb = -8; // High interference: within 8 dB of signal
+              } else if (totalInterferingDevices > 100) {
+                interferenceIsolationDb = -5; // Negative isolation: interference can exceed signal
+              } else if (totalInterferingDevices > 50) {
+                interferenceIsolationDb = 0; // No isolation for large groups
+              } else if (totalInterferingDevices > 10) {
+                interferenceIsolationDb = 2; // Reduced isolation
+              }
+              // For very large n, interference aggregation: 10*log10(n) assumes coherent addition
+              // In reality, with 1000 transmitters, interference can be within 5-10 dB of signal
+              // For massive device counts, add additional interference penalty beyond log scaling
+              let interferencePenaltyDb = 0;
+              if (totalInterferingDevices > 500) {
+                // Additional penalty: interference is even more dominant
+                interferencePenaltyDb = -5; // Interference is 5 dB closer to signal
+              } else if (totalInterferingDevices > 200) {
+                interferencePenaltyDb = -3; // Interference is 3 dB closer
+              }
+              const interferencePowerDb =
+                totalInterferingDevices > 1
+                  ? receivedPowerDb -
+                    10 * Math.log10(totalInterferingDevices - 1) -
+                    interferenceIsolationDb +
+                    interferencePenaltyDb
+                  : -Infinity; // No interference if alone
+
+              // 3. Calculate SINR (Signal-to-Interference-plus-Noise Ratio)
+              const noisePowerDb = PHYSICS.NOISE_FLOOR_DBM;
+              // SINR = Signal / (Noise + Interference)
+              // Convert to linear, add, convert back to dB
+              const signalLinear = Math.pow(10, receivedPowerDb / 10);
+              const noiseLinear = Math.pow(10, noisePowerDb / 10);
+              const interferenceLinear =
+                interferencePowerDb > -200
+                  ? Math.pow(10, interferencePowerDb / 10)
+                  : 0;
+              const sinrDb =
+                10 *
+                Math.log10(signalLinear / (noiseLinear + interferenceLinear));
+
+              // 4. Calculate EDCA collision probability
+              // P(collision) based on EDCA contention window and number of stations
+              // When showing single device, simulate collisions from all devices to create turbulence
+              let collisionProb = 0;
+              if (isFiwi) {
+                // Fi-Wi: Centralized scheduling eliminates collisions
+                collisionProb = 0.001 * n;
+              } else {
+                // Autonomous: EDCA collision probability
+                // When showing single device, use full device count to simulate turbulence
+                const effectiveN = showSingleDevice
+                  ? Math.max(n, targetDeviceCount)
+                  : n;
+                if (effectiveN <= 1) {
+                  collisionProb = 0.0;
+                } else {
+                  // More realistic EDCA collision probability
+                  // P(collision) ≈ 1 - (1 - 1/(CW+1))^n (birthday problem with EDCA)
+                  const cw = PHYSICS.EDCA_BE.CWmin;
+                  const theoreticalCollision =
+                    1 - Math.pow(1 - 1 / (cw + 1), effectiveN);
+
+                  // Scale down for small groups (not all stations contend simultaneously)
+                  // But for very large groups, approach theoretical limit (birthday problem dominates)
+                  if (effectiveN <= 5) {
+                    collisionProb = theoreticalCollision * 0.3; // 30% of theoretical for small groups
+                  } else if (effectiveN <= 10) {
+                    collisionProb = theoreticalCollision * 0.5; // 50% for medium groups
+                  } else if (effectiveN <= 20) {
+                    collisionProb = theoreticalCollision * 0.7; // 70% for larger groups
+                  } else if (effectiveN <= 50) {
+                    collisionProb = theoreticalCollision * 0.85; // 85% for large groups
+                  } else if (effectiveN <= 100) {
+                    collisionProb = theoreticalCollision * 0.92; // 92% for very large groups
+                  } else if (effectiveN <= 500) {
+                    collisionProb = theoreticalCollision * 0.97; // 97% for massive groups
+                  } else {
+                    // For extremely large groups (500+), birthday problem is essentially 100%
+                    // With 1000 devices and CW=15, theoretical is 99.99%, so use 98-99%
+                    collisionProb = theoreticalCollision * 0.98; // 98% for extremely large groups
+                  }
+
+                  // Multiple domains increase collision probability (but less aggressively)
+                  const numAirtimeDomainsAuto = numAirtimeDomainsAP;
+                  if (numAirtimeDomainsAuto > 1) {
+                    collisionProb += 0.02 * (numAirtimeDomainsAuto - 1);
+                  }
+
+                  // Cap at 98% (matches reference behavior - birthday problem can approach 100%)
+                  collisionProb = Math.min(0.98, collisionProb);
+                }
+              }
+
+              // Store collision probability
+              ppdu.birthdayProb = collisionProb;
+              ppdu.sinrDb = sinrDb;
+
+              // Interference level for eigen calculations (normalized)
+              const interferenceLevel =
+                targetDeviceCount > 0
+                  ? Math.min(
+                      1.0,
+                      activePPDUsInDomain / (targetDeviceCount * 0.5),
+                    )
+                  : 0;
+
+              // Update available eigenvectors based on current SS, architecture, time, and interference
+              const currentTime = Date.now();
+              const timeSinceStart =
+                currentTime - (ppdu.startTime || currentTime);
+              ppdu.availableEigenVectors = getAvailableEigenVectors(
+                isFiwi,
+                numAirtimeDomainsFiwi,
+                ppdu.yIndex,
+                timeSinceStart,
+                interferenceLevel,
+              );
+
+              // Determine transmission success/failure using SINR-based PER model
+              // l4sEnabled is already declared at function scope
+
+              // 5. Calculate PER based on SINR and MCS
+              // PER = f(SINR - SINR_threshold_for_MCS)
+              // Use complementary error function approximation: PER ≈ 0.5 * erfc((SINR - threshold) / σ)
+              // Simplified: PER = 0.5 * (1 - tanh((SINR - threshold) / 3))
+              const sinrThreshold =
+                PHYSICS.MCS_SINR_THRESHOLDS[currentMCS] || -82;
+              const sinrMargin = sinrDb - sinrThreshold; // dB margin above threshold
+
+              // PER curve: low PER when SINR >> threshold, high PER when SINR << threshold
+              // Realistic PER curve based on 802.11: PER drops rapidly with SINR margin
+              // For good SINR (margin > 5 dB): PER < 0.1%
+              // For moderate SINR (margin 0-5 dB): PER 0.1-5%
+              // For poor SINR (margin < 0 dB): PER > 5%
+              let sinrBasedPER;
+              if (sinrMargin > 10) {
+                // Excellent SINR: very low PER
+                sinrBasedPER = 0.0001; // 0.01%
+              } else if (sinrMargin > 5) {
+                // Good SINR: low PER
+                sinrBasedPER = 0.001 + (10 - sinrMargin) * 0.0002; // 0.1-0.2%
+              } else if (sinrMargin > 0) {
+                // Moderate SINR: moderate PER
+                sinrBasedPER = 0.002 + (5 - sinrMargin) * 0.006; // 0.2-3.2%
+              } else {
+                // Poor SINR: high PER
+                // For very poor SINR (large negative margin), PER can be much higher
+                const absMargin = Math.abs(sinrMargin);
+                if (absMargin > 20) {
+                  // Extremely poor SINR: PER approaches 50%+
+                  sinrBasedPER = 0.15 + (absMargin - 20) * 0.01; // 15%+ for very poor SINR
+                } else {
+                  sinrBasedPER = 0.032 + absMargin * 0.01; // 3.2%+ for poor SINR
+                }
+              }
+              // Clamp to reasonable range (allow higher PER for massive interference)
+              sinrBasedPER = Math.max(0.0001, Math.min(0.5, sinrBasedPER)); // Cap at 50% instead of 15%
+
+              // For very high device counts, apply minimum PER floor regardless of SINR
+              // Even with good SINR, massive interference creates baseline errors
+              if (isFiwi && targetDeviceCount > 500) {
+                // Fi-Wi with 500+ devices: minimum PER floor due to massive interference
+                const minPERFloor =
+                  targetDeviceCount > 800
+                    ? 0.08 // 8% minimum for 800+ devices
+                    : targetDeviceCount > 600
+                      ? 0.05 // 5% minimum for 600-800 devices
+                      : 0.03; // 3% minimum for 500-600 devices
+                sinrBasedPER = Math.max(sinrBasedPER, minPERFloor);
+              }
+
+              // 6. Combine SINR-based PER with collision probability
+              // Use weighted combination: PER is primarily SINR-based, collisions add to it
+              // Total PER ≈ SINR_PER + α * Collision_PER (where α < 1 to avoid double-counting)
+              let basePER = 0;
+
+              // 7. Apply architecture and algorithm adjustments
+              if (l4sEnabled) {
+                // L4S: More conservative, better rate control
+                if (isFiwi) {
+                  // Fi-Wi L4S: Scheduling eliminates collisions, but SINR errors still occur
+                  // At high device counts, interference from all RRHs affects SINR significantly
+                  // Even with centralized scheduling, 1000 devices create massive interference
+                  const residualCollision = Math.min(
+                    0.005,
+                    collisionProb * 0.05,
+                  ); // Very small residual
+                  // Use total device count, not just per-domain, for interference impact
+                  const totalDevices = targetDeviceCount;
+                  // Scale SINR impact: at very high total device count, interference dominates
+                  let sinrScale = 0.3; // Base scale
+                  if (totalDevices > 500) {
+                    sinrScale = 1.0; // Full impact for massive device counts (100% of SINR PER)
+                  } else if (totalDevices > 200) {
+                    sinrScale = 0.85; // Very high impact
+                  } else if (totalDevices > 100) {
+                    sinrScale = 0.7; // High impact
+                  } else if (totalDevices > 50) {
+                    sinrScale = 0.5; // Moderate-high impact
+                  } else if (totalDevices > 20) {
+                    sinrScale = 0.4; // Moderate impact
+                  }
+                  basePER = sinrBasedPER * sinrScale + residualCollision;
+
+                  // Additional PER floor for very high device counts (beyond SINR scaling)
+                  if (totalDevices > 800) {
+                    basePER = Math.max(basePER, 0.1); // 10% minimum for 800+ devices
+                  } else if (totalDevices > 600) {
+                    basePER = Math.max(basePER, 0.07); // 7% minimum for 600-800 devices
+                  } else if (totalDevices > 500) {
+                    basePER = Math.max(basePER, 0.05); // 5% minimum for 500-600 devices
+                  }
+                } else {
+                  // Autonomous L4S: SINR-based with reduced collision impact
+                  basePER = sinrBasedPER + collisionProb * 0.2; // Collisions add 20% of their value
+                }
+              } else {
+                // Greedy algorithm: More aggressive
+                if (isFiwi) {
+                  // Fi-Wi Greedy: Scheduling eliminates collisions, but:
+                  // - SINR-based errors still occur (especially with high interference)
+                  // - Some bufferbloat/queue issues
+                  // - At very high device counts, interference degrades SINR significantly
+                  const residualCollision = Math.min(0.01, collisionProb * 0.1); // Small residual
+                  // Use total device count, not just per-domain, for interference impact
+                  const totalDevices = targetDeviceCount;
+                  // Scale SINR impact: at very high total device count, interference dominates
+                  let sinrScale = 0.5; // Base scale
+                  if (totalDevices > 500) {
+                    sinrScale = 1.0; // Full impact for massive device counts (100% of SINR PER)
+                  } else if (totalDevices > 200) {
+                    sinrScale = 0.9; // Very high impact
+                  } else if (totalDevices > 100) {
+                    sinrScale = 0.85; // High impact
+                  } else if (totalDevices > 50) {
+                    sinrScale = 0.7; // Moderate-high impact
+                  } else if (totalDevices > 20) {
+                    sinrScale = 0.6; // Moderate impact
+                  }
+                  basePER = sinrBasedPER * sinrScale + residualCollision;
+
+                  // Additional PER floor for very high device counts (beyond SINR scaling)
+                  if (totalDevices > 800) {
+                    basePER = Math.max(basePER, 0.12); // 12% minimum for 800+ devices
+                  } else if (totalDevices > 600) {
+                    basePER = Math.max(basePER, 0.08); // 8% minimum for 600-800 devices
+                  } else if (totalDevices > 500) {
+                    basePER = Math.max(basePER, 0.06); // 6% minimum for 500-600 devices
+                  }
+                } else {
+                  // Autonomous Greedy: SINR-based with collision contribution
+                  // For large groups, collisions dominate; use union formula for accuracy
+                  if (n > 50) {
+                    // Large groups: collisions are the dominant factor
+                    // PER = SINR_PER + Collision_PER - SINR_PER * Collision_PER (union of events)
+                    basePER =
+                      sinrBasedPER +
+                      collisionProb -
+                      sinrBasedPER * collisionProb;
+                  } else {
+                    // Small-medium groups: weighted combination
+                    basePER = sinrBasedPER + collisionProb * 0.4; // Collisions add 40% of their value
+                  }
+                }
+              }
+
+              // Cap PER at reasonable maximum
+              const finalPER = Math.min(0.99, Math.max(0.001, basePER));
+
+              // Success probability = 1 - PER
+              const successProb = 1 - finalPER;
+              const isSuccess = Math.random() < successProb;
+
+              // Store success state and PER (update periodically, not every frame)
+              // Always calculate and store PER, but update success state less frequently
+              ppdu.currentPER = finalPER; // Always store PER for display
+              const wasSuccess = ppdu.successState;
+              if (!ppdu.successState || Math.random() < 0.1) {
+                // Update success state 10% of frames
+                ppdu.successState = isSuccess;
+
+                // Track latency when transmission succeeds
+                // Record latency for every successful transmission (not just state transitions)
+                // This gives us a proper distribution for P99.9 calculation
+                if (isSuccess) {
+                  // Calculate PHY rate for this PPDU
+                  const phyRatePerStream = MCS_PHY_RATES[currentMCS] || 6.5;
+                  const spatialStreams = ppdu.yIndex + 1;
+                  const totalPhyRate = phyRatePerStream * spatialStreams; // Mbps
+
+                  // Typical 802.11 packet size: 1500 bytes (12,000 bits) for data
+                  const PACKET_SIZE_BITS = 12000;
+
+                  // Calculate 802.11 media access delay (WLAN/Wi-Fi delays)
+                  // 1. Base transmission time (in milliseconds)
+                  const transmissionTime =
+                    (PACKET_SIZE_BITS / (totalPhyRate * 1e6)) * 1000; // Convert to ms
+
+                  // 2. Retransmission delay (if PER > 0)
+                  // Average retransmissions = PER / (1 - PER) for geometric distribution
+                  const avgRetransmissions =
+                    ppdu.currentPER > 0
+                      ? ppdu.currentPER / (1 - ppdu.currentPER)
+                      : 0;
+                  const retransmissionDelay =
+                    avgRetransmissions * (transmissionTime + 0.1); // 0.1ms ACK time
+
+                  // 3. Contention and head-of-line blocking delay (802.11)
+                  // With multiple devices, each must wait for others to finish + backoff
+                  // Count active devices in the same airtime domain
+                  const activeDevicesInDomain = ppdus.filter(
+                    (p) =>
+                      p.airtimeDomain === ppdu.airtimeDomain && p.successState,
+                  ).length;
+
+                  // Average transmission time for other devices (simplified - use average MCS)
+                  // Each contending device takes transmissionTime + overhead
+                  const avgTransmissionTime = transmissionTime; // Simplified: assume similar MCS
+                  const overheadPerTx = 0.1; // DIFS + ACK + SIFS overhead (~0.1ms)
+                  const avgTxDuration = avgTransmissionTime + overheadPerTx;
+
+                  // Head-of-line blocking: wait for other devices to finish
+                  // With n devices, on average wait for (n-1)/2 devices to finish before your turn
+                  const avgWaitForOthers =
+                    activeDevicesInDomain > 1
+                      ? ((activeDevicesInDomain - 1) / 2) * avgTxDuration
+                      : 0;
+
+                  // 4. EDCA backoff delay (from collisions and contention)
+                  // Average backoff = (CWmin - 1) / 2 * slot_time
+                  // For 802.11, slot_time = 9μs, CWmin typically 15-31
+                  const CWmin = 15; // Typical for data frames
+                  const slotTime = 0.009; // 9 microseconds in ms
+                  const avgBackoff = ((CWmin - 1) / 2) * slotTime; // ~0.063ms average
+
+                  // With multiple contending devices, collisions are more likely
+                  // Collision probability increases with number of devices
+                  // Each collision triggers exponential backoff
+                  const collisionProb =
+                    activeDevicesInDomain > 1
+                      ? Math.min(0.9, 0.1 + (activeDevicesInDomain - 1) * 0.05) // Higher collision prob with more devices
+                      : 0;
+
+                  // Expected number of backoff attempts before successful transmission
+                  // With collision probability p, expected attempts = 1 / (1 - p)
+                  const expectedBackoffAttempts =
+                    collisionProb > 0 ? 1 / (1 - collisionProb) : 1;
+
+                  // Total backoff delay = average backoff * expected attempts
+                  // Also account for retransmission backoffs (from PER)
+                  const expectedRetransBackoffs =
+                    ppdu.currentPER > 0
+                      ? ppdu.currentPER / (1 - ppdu.currentPER)
+                      : 0;
+                  const backoffDelay =
+                    avgBackoff *
+                    (expectedBackoffAttempts + expectedRetransBackoffs);
+
+                  // 802.11 media access delay = transmission + retransmissions + contention + backoff
+                  const delay80211 =
+                    transmissionTime +
+                    retransmissionDelay +
+                    avgWaitForOthers +
+                    backoffDelay;
+
+                  // Calculate 802.3 packet queueing delay (Ethernet/WAN queue delay)
+                  // L4S with ECN: Sources back off BEFORE queue fills, so queue delay is minimal
+                  // BUT: L4S only works with Centralized Concentrator (Fi-Wi) architecture
+                  // Autonomous APs can't coordinate to see aggregate WAN link state, so L4S ECN signaling doesn't work
+                  // Without L4S (or with L4S in Autonomous mode): Queue can build up, causing significant delay
+                  let delay8023 = 0;
+
+                  // Get L4S status (ensure we have the correct value)
+                  const l4sCheckbox = document.getElementById("l4sEnabled");
+                  const currentL4sEnabled = l4sCheckbox
+                    ? l4sCheckbox.checked
+                    : false;
+
+                  // L4S ECN signaling only works for Centralized Concentrator (Fi-Wi)
+                  // Autonomous APs operate independently and can't see aggregate WAN state
+                  if (currentL4sEnabled && isFiwi) {
+                    // L4S mode (Fi-Wi only): ECN marks signal congestion early (at ~2-5% queue fill)
+                    // The concentrator sees aggregate capacity and can signal congestion back to sources
+                    // Sources immediately back off, so queue NEVER builds up
+                    // Queue delay is minimal - just propagation/processing (~0.1ms)
+                    // This is the KEY benefit of L4S - no queue buildup = low latency
+                    delay8023 = 0.1; // Minimal delay, essentially zero - ECN prevents queue buildup
+                  } else {
+                    // Greedy mode (no L4S): Queue delay increases with utilization
+                    // Calculate link utilization on-the-fly for this PPDU's context
+                    let estimatedTotalCapacity = 0;
+                    ppdus.forEach((p) => {
+                      if (p.successState) {
+                        const pMCS = Math.round((p.x - PADDING) / stepX);
+                        const pClampedMCS = Math.max(0, Math.min(12, pMCS));
+                        const pSpatialStreams = p.yIndex + 1;
+                        const pPhyRatePerStream =
+                          MCS_PHY_RATES[pClampedMCS] || 6.5;
+                        estimatedTotalCapacity +=
+                          pPhyRatePerStream * pSpatialStreams;
+                      }
+                    });
+                    // Cap at WAN capacity
+                    const estimatedWanUtilization = Math.min(
+                      estimatedTotalCapacity,
+                      WAN_CAPACITY_MBPS,
+                    );
+                    const linkUtilization =
+                      (estimatedWanUtilization / WAN_CAPACITY_MBPS) * 100;
+                    // Queue delay: increases exponentially near 100%
+                    // At 50% util: ~0ms, at 80%: ~2ms, at 95%: ~10ms, at 100%: ~50ms
+                    delay8023 =
+                      linkUtilization > 50
+                        ? Math.pow((linkUtilization - 50) / 50, 2) * 50 // Exponential increase
+                        : 0;
+                  }
+
+                  // Total latency = 802.11 delay + 802.3 delay
+                  const latency = delay80211 + delay8023;
+
+                  // Store latency with components for breakdown
+                  ppdu.latencySamples.push({
+                    total: latency,
+                    delay80211: delay80211,
+                    delay8023: delay8023,
+                  });
+                  // Keep only recent samples (last 1000) for P99.9 calculation
+                  if (ppdu.latencySamples.length > 1000) {
+                    ppdu.latencySamples.shift();
+                  }
+                  // Reset transmission start time for next transmission
+                  ppdu.transmissionStartTime = Date.now();
+                } else if (!isSuccess) {
+                  // On failure, transmission continues (latency accumulates)
+                  // Don't reset transmissionStartTime
+                }
+              }
+
+              // Discrete MCS transitions: try to move to adjacent MCS positions
+              // L4S: More conservative transitions (prioritize stability)
+              // Greedy: More aggressive transitions (prioritize max rates)
+              // l4sEnabled is already declared at function scope
+
+              if (currentMCS !== clampedTargetMCS) {
+                // Try to step toward target MCS
+                // L4S: Lower transition probability (more cautious)
+                // Greedy: Higher transition probability (more aggressive)
+                // When viewing single device, increase transitions to show more turbulence
+                // Emphasize horizontal (MCS) movement over vertical (SS) movement
+                // Scale transition probability with device count - more devices = more turbulence
+                let baseTransitionProb;
+                if (showSingleDevice) {
+                  // Single device view: Much higher MCS transitions to show horizontal turbulence
+                  // Scale with device count - with 533 devices, firmware tries many MCS nodes frequently
+                  const deviceScale = Math.min(
+                    1.3,
+                    1.0 + (targetDeviceCount / 1000) * 0.3,
+                  ); // Scale up to 1.3x at 1000 devices
+                  if (l4sEnabled) {
+                    baseTransitionProb =
+                      (currentMCS === 0 ? 0.35 : 0.25) * deviceScale;
+                  } else if (isFiwi) {
+                    baseTransitionProb =
+                      (currentMCS === 0 ? 0.45 : 0.3) * deviceScale;
+                  } else {
+                    // Autonomous: Scale aggressively with device count
+                    baseTransitionProb =
+                      (currentMCS === 0 ? 0.4 : 0.28) * deviceScale;
+                  }
+                } else {
+                  // Normal view: Reduced transition probabilities for Autonomous AP to reduce jumpiness
+                  baseTransitionProb = l4sEnabled
+                    ? currentMCS === 0
+                      ? 0.12
+                      : 0.06 // L4S: More conservative
+                    : isFiwi
+                      ? currentMCS === 0
+                        ? 0.18
+                        : 0.1 // Greedy Fi-Wi: More aggressive
+                      : currentMCS === 0
+                        ? 0.12
+                        : 0.06; // Greedy Autonomous: Reduced from 0.18/0.10 to 0.12/0.06 for smoother transitions
+                }
+                const transitionProb =
+                  baseTransitionProb *
+                  (0.5 + mcsSuccessFactor * 0.5) *
+                  domainIsolationFactor;
+                if (Math.random() < transitionProb) {
+                  if (currentMCS < clampedTargetMCS) {
+                    currentMCS++; // Step up
+                  } else if (currentMCS > clampedTargetMCS) {
+                    currentMCS--; // Step down
+                  }
+                }
+
+                // When showing single device, add artificial turbulence to prevent convergence
+                // Simulate collisions/interference causing random MCS jumps
+                // Scale turbulence with device count - more devices = more collisions = more chaos
+                // With 533 devices, collision probability is very high, so firmware tries and fails many MCS nodes
+                if (showSingleDevice && !isFiwi) {
+                  // Scale turbulence frequency with device count
+                  // At 100 devices: 0.25, at 500 devices: 0.50, at 1000 devices: 0.70
+                  const turbulenceScale = Math.min(
+                    0.7,
+                    0.15 + (targetDeviceCount / 1000) * 0.55,
+                  );
+                  const turbulenceProb = 0.18 * (1 + turbulenceScale); // Base 0.18, scales up to ~0.31 at 500 devices
+
+                  if (Math.random() < turbulenceProb) {
+                    // Random jump due to simulated collision/interference
+                    // Turbulent flow: large jumps in MCS space (horizontal), not just small oscillations
+                    // With many devices, make larger jumps more likely (firmware tries high MCS, fails, backs off)
+                    const jumpDirection = Math.random() < 0.5 ? -1 : 1;
+                    // Scale jump size with device count - more devices = larger jumps (more aggressive try/fail cycles)
+                    const deviceScale = Math.min(1.5, targetDeviceCount / 400); // Scale up to 1.5x at 600 devices
+                    const rand = Math.random();
+                    let jumpAmount;
+                    if (rand < 0.2) {
+                      jumpAmount = 1;
+                    } else if (rand < 0.5) {
+                      jumpAmount = 2 + Math.floor(Math.random() * 2); // 2-3 steps
+                    } else if (rand < 0.75) {
+                      jumpAmount = 4 + Math.floor(Math.random() * 3); // 4-6 steps
+                    } else {
+                      jumpAmount = 7 + Math.floor(Math.random() * 4); // 7-10 steps (very large jumps with many devices)
+                    }
+                    jumpAmount = Math.floor(jumpAmount * deviceScale); // Scale with device count
+                    currentMCS = Math.max(
+                      0,
+                      Math.min(
+                        NODE_COLS - 1,
+                        currentMCS + jumpDirection * jumpAmount,
+                      ),
+                    );
+
+                    // Also randomly change target MCS to create more chaos (horizontal turbulence)
+                    // With many devices, target changes more frequently (firmware constantly re-evaluating)
+                    const targetChangeProb = Math.min(
+                      0.6,
+                      0.4 + (targetDeviceCount / 1000) * 0.2,
+                    );
+                    if (Math.random() < targetChangeProb) {
+                      ppdu.targetMCS = Math.max(
+                        0,
+                        Math.min(
+                          NODE_COLS - 1,
+                          currentMCS + (Math.random() - 0.5) * 12,
+                        ),
+                      );
+                    }
+                  }
+                }
+
+                // Step back due to interference/failure
+                // L4S: More sensitive to failures (step down faster to maintain low latency)
+                // Greedy: Less sensitive (try to maintain high rates)
+                // When viewing single device, scale failure probability with device count
+                // With many devices (533), collisions are frequent, so failures should be very frequent
+                let stepDownProb;
+                if (showSingleDevice && !isFiwi) {
+                  // Scale failure probability with device count
+                  // At 100 devices: 0.05, at 500 devices: 0.15, at 1000 devices: 0.25
+                  const failureScale = Math.min(
+                    0.25,
+                    0.05 + (targetDeviceCount / 1000) * 0.2,
+                  );
+                  stepDownProb =
+                    failureScale *
+                    (1 - mcsSuccessFactor) *
+                    (1 - domainIsolationFactor);
+                } else if (showSingleDevice) {
+                  stepDownProb = l4sEnabled
+                    ? 0.04 *
+                      (1 - mcsSuccessFactor) *
+                      (1 - domainIsolationFactor) // L4S: More visible failures
+                    : 0.03 *
+                      (1 - mcsSuccessFactor) *
+                      (1 - domainIsolationFactor); // Greedy: More visible failures
+                } else {
+                  stepDownProb = l4sEnabled
+                    ? 0.015 *
+                      (1 - mcsSuccessFactor) *
+                      (1 - domainIsolationFactor) // L4S: More sensitive
+                    : 0.008 *
+                      (1 - mcsSuccessFactor) *
+                      (1 - domainIsolationFactor); // Greedy: Less sensitive
+                }
+                if (Math.random() < stepDownProb) {
+                  if (currentMCS > 0) {
+                    // When showing single device in Autonomous mode, make larger panic jumps
+                    // Scale panic jump size with device count - more devices = larger backoff
+                    if (showSingleDevice && !isFiwi) {
+                      // Panic behavior: large backward jumps on failure
+                      // With many devices, firmware backs off more aggressively
+                      const deviceScale = Math.min(
+                        2.0,
+                        1.0 + targetDeviceCount / 500,
+                      ); // Scale up to 2x at 500 devices
+                      const basePanicJump = 2 + Math.floor(Math.random() * 3); // 2-4 steps base
+                      const panicJump = Math.floor(basePanicJump * deviceScale); // Scale with device count
+                      currentMCS = Math.max(0, currentMCS - panicJump);
+                    } else {
+                      currentMCS--; // Step down on failure (normal behavior)
+                    }
+                  }
+                }
+              }
+
+              // Track MCS transitions for horizontal trail
+              if (currentMCS !== ppdu.lastMCS) {
+                if (!ppdu.mcsTrail) ppdu.mcsTrail = [];
+                const y = getGridY(ppdu.yIndex);
+                ppdu.mcsTrail.push({
+                  fromX: PADDING + ppdu.lastMCS * stepX,
+                  toX: PADDING + currentMCS * stepX,
+                  y: y,
+                  age: 0,
+                  maxAge: 40, // Shorter trail for horizontal movement
+                });
+                ppdu.lastMCS = currentMCS;
+              }
+
+              // Snap position to discrete MCS grid position
+              ppdu.x = PADDING + currentMCS * stepX;
+
+              // Calculate velocity vectors for probability current visualization
+              // This shows the "intended" flow direction, not just actual movement
+              // vx: direction toward target MCS (positive = right, negative = left)
+              const mcsDirection = clampedTargetMCS - currentMCS;
+              // For Autonomous: smoother transitions (reduced turbulence); for Fi-Wi: smooth/laminar
+              // When viewing single device, increase turbulence to make it more visible
+              let turbulenceFactor;
+              if (showSingleDevice) {
+                turbulenceFactor = isFiwi ? 0.5 : 0.8; // Increased turbulence for single device view
+              } else {
+                turbulenceFactor = isFiwi ? 0.3 : 0.4; // Reduced from 1.0 to 0.4 for Autonomous to reduce jumpiness
+              }
+              // Base velocity toward target, with turbulence/noise
+              const baseVx = mcsDirection * 0.15; // Stronger base velocity toward target
+              const noiseVx =
+                (Math.random() - 0.5) *
+                (showSingleDevice ? 0.4 : 0.2) *
+                turbulenceFactor; // More noise when viewing single device
+              ppdu.vx = baseVx + noiseVx;
+
+              // vy: direction for SS transitions (positive = up, negative = down)
+              // Calculate based on "intended" direction (toward higher SS if conditions allow)
+              // Track actual SS change for immediate feedback
+              const actualSSChange =
+                ppdu.yIndex -
+                (ppdu.lastYIndex !== undefined ? ppdu.lastYIndex : ppdu.yIndex);
+              // Base velocity: upward bias if not at max SS and conditions are good
+              const canMoveUp =
+                ppdu.yIndex < NODE_ROWS - 1 && mcsSuccessFactor > 0.3;
+              const baseVy = canMoveUp ? 0.05 : 0; // Slight upward bias if conditions allow
+              const noiseVy =
+                (Math.random() - 0.5) *
+                (showSingleDevice ? 0.2 : 0.1) *
+                turbulenceFactor; // More noise when viewing single device
+              // Combine actual change (immediate) with intended direction (flow)
+              ppdu.vy = baseVy + actualSSChange * 0.5 + noiseVy;
+
+              // Update target MCS occasionally to keep movement dynamic
+              // L4S: Update less frequently, stay more stable
+              // Greedy: Update more frequently, explore higher MCS
+              // When viewing single device, scale update frequency with device count
+              // With many devices (533), firmware constantly re-evaluates and tries different MCS nodes
+              let targetUpdateProb;
+              if (showSingleDevice && !isFiwi) {
+                // Scale target update frequency with device count
+                // At 100 devices: 0.015, at 500 devices: 0.035, at 1000 devices: 0.050
+                const baseProb = 0.01;
+                const deviceScale = Math.min(
+                  0.04,
+                  (targetDeviceCount / 1000) * 0.04,
+                );
+                targetUpdateProb = baseProb + deviceScale;
+              } else if (showSingleDevice) {
+                targetUpdateProb = l4sEnabled
+                  ? 0.005 // L4S: Much more frequent updates for single device
+                  : 0.008; // Greedy Fi-Wi: More frequent updates
+              } else {
+                targetUpdateProb = l4sEnabled
+                  ? 0.0005 // L4S: 0.05%
+                  : isFiwi
+                    ? 0.002 // Greedy Fi-Wi: 0.2%
+                    : 0.001; // Greedy Autonomous: Reduced from 0.002 to 0.001 for smoother transitions
+              }
+              if (Math.random() < targetUpdateProb) {
+                if (showSingleDevice && !isFiwi) {
+                  // Single device Autonomous: Large random target changes to show turbulence
+                  // Scale target jump range with device count - more devices = more aggressive exploration
+                  const deviceScale = Math.min(
+                    1.5,
+                    1.0 + targetDeviceCount / 600,
+                  ); // Scale up to 1.5x at 600 devices
+                  const jumpRange = Math.floor(10 * deviceScale); // Larger range with more devices
+                  ppdu.targetMCS = Math.max(
+                    0,
+                    Math.min(
+                      NODE_COLS - 1,
+                      currentMCS + (Math.random() - 0.5) * jumpRange,
+                    ),
+                  );
+                } else if (l4sEnabled) {
+                  // L4S: Conservative targets (prioritize stability)
+                  if (isFiwi) {
+                    // Fi-Wi: Target well location (MCS 8) with some variation for stability
+                    // 70% chance to target MCS 8 (well location), 30% chance to vary slightly
+                    ppdu.targetMCS =
+                      Math.random() < 0.7 ? 8 : 7 + Math.random() * 2; // MCS 7-9, mostly 8
+                  } else {
+                    ppdu.targetMCS = 3 + Math.random() * 3; // Autonomous: MCS 3-6
+                  }
+                } else {
+                  // Greedy: Aggressive targets (maximize rates)
+                  if (isFiwi) {
+                    // Fi-Wi: Target well location (MCS 8) consistently
+                    // 80% chance to target MCS 8 (well location), 20% chance to explore slightly higher
+                    ppdu.targetMCS =
+                      Math.random() < 0.8 ? 8 : 8 + Math.random() * 4; // MCS 8-12, mostly 8
+                  } else {
+                    ppdu.targetMCS = 5 + Math.random() * 5; // Autonomous: MCS 5-10
+                  }
+                }
+              }
+
+              // Spatial Stream Transitions (using discrete currentMCS and factors from above)
+
+              // Try to step up spatial stream
+              // When viewing single device, dramatically reduce SS transitions to emphasize MCS (PHY rate) changes
+              // SS changes should be MUCH less frequent than MCS changes - PHY rate turbulence dominates
+              // For Fi-Wi: Add bias toward SS2 (yIndex 1) to match well location
+              const ssTransitionBaseProb = showSingleDevice
+                ? isFiwi
+                  ? 0.003
+                  : 0.002 // Very low for single device - MCS changes should dominate
+                : isFiwi
+                  ? 0.01
+                  : 0.008; // Reduced from 0.015 to 0.008 for smoother transitions (normal view)
+
+              // For Fi-Wi: If not at SS2 (well location), add bias to move toward it
+              if (isFiwi && ppdu.yIndex !== 1) {
+                // Stronger probability to move toward SS2 (well location)
+                const wellSSBias = ppdu.yIndex < 1 ? 1.5 : 0.8; // Easier to move up to SS2, harder to move down from SS2
+                const adjustedProb = ssTransitionBaseProb * wellSSBias;
+                if (
+                  ppdu.yIndex < 1 &&
+                  Math.random() <
+                    adjustedProb * mcsSuccessFactor * domainIsolationFactor
+                ) {
+                  // Move up to SS2 (well location)
+                  ppdu.lastYIndex = ppdu.yIndex;
+                  ppdu.yIndex = 1;
+                  if (!ppdu.jumpTrails) ppdu.jumpTrails = [];
+                  ppdu.jumpTrails.push({
+                    fromY: getGridY(ppdu.lastYIndex),
+                    toY: getGridY(ppdu.yIndex),
+                    age: 0,
+                    maxAge: 60,
+                  });
+                } else if (
+                  ppdu.yIndex > 1 &&
+                  Math.random() < adjustedProb * 0.5
+                ) {
+                  // Move down toward SS2 (well location)
+                  ppdu.lastYIndex = ppdu.yIndex;
+                  ppdu.yIndex = Math.max(1, ppdu.yIndex - 1);
+                  if (!ppdu.jumpTrails) ppdu.jumpTrails = [];
+                  ppdu.jumpTrails.push({
+                    fromY: getGridY(ppdu.lastYIndex),
+                    toY: getGridY(ppdu.yIndex),
+                    age: 0,
+                    maxAge: 60,
+                  });
+                }
+              } else if (
+                Math.random() <
+                  ssTransitionBaseProb *
+                    mcsSuccessFactor *
+                    domainIsolationFactor &&
+                ppdu.yIndex < NODE_ROWS - 1
+              ) {
+                const targetSS = ppdu.yIndex + 1;
+                let transitionDifficulty = 0.5;
+                if (ppdu.yIndex === 0) transitionDifficulty = 0.3;
+                else if (ppdu.yIndex === 1)
+                  transitionDifficulty = 0.8; // SS1→SS2 harder
+                else if (ppdu.yIndex === 2) transitionDifficulty = 0.7;
+
+                // Fi-Wi has better eigen structure (more RRHs) = easier transitions
+                const eigenFactor = isFiwi ? 0.7 : 1.0;
+                // Also consider available eigenvectors - need at least targetSS+1 eigenvectors to use that SS
+                const eigenAvailability =
+                  ppdu.availableEigenVectors >= targetSS + 1 ? 0.8 : 0.3;
+
+                // Phase 1 Hardware Limit: Prevent transitions to SS4 (yIndex 3) when Phase 2 is off
+                const phaseSyncEnabled = document.getElementById(
+                  "phaseSyncEnabled",
+                )
+                  ? document.getElementById("phaseSyncEnabled").checked
+                  : true;
+                if (isFiwi && !phaseSyncEnabled && targetSS >= 3) {
+                  // Block transition to SS4 - Phase 1 hardware limit
+                  return; // Skip this transition
+                }
+
+                if (
+                  Math.random() >
+                  transitionDifficulty * eigenFactor * eigenAvailability
+                ) {
+                  ppdu.lastYIndex = ppdu.yIndex;
+                  ppdu.yIndex = targetSS;
+
+                  // Record jump for visualization
+                  if (!ppdu.jumpTrails) ppdu.jumpTrails = [];
+                  ppdu.jumpTrails.push({
+                    fromY: getGridY(ppdu.lastYIndex),
+                    toY: getGridY(ppdu.yIndex),
+                    age: 0,
+                    maxAge: 60,
+                  });
+                }
+              }
+
+              // Phase 1 Hardware Limit: If Phase 2 is off, Fi-Wi particles cannot sustain SS4 (yIndex 3)
+              // They get pushed down gently
+              if (isFiwi && !phaseSyncEnabled && ppdu.yIndex >= 3) {
+                // Push down from SS4 with higher probability
+                if (Math.random() < 0.05) {
+                  ppdu.lastYIndex = ppdu.yIndex;
+                  ppdu.yIndex = Math.max(2, ppdu.yIndex - 1); // Cap at SS3 (yIndex 2)
+                }
+              }
+
+              // Occasionally step down on failure
+              // When viewing single device, keep SS step-downs minimal - MCS changes should dominate
+              const stepDownSSProb = showSingleDevice ? 0.003 : 0.005; // Very low SS drops when viewing single device
+              if (Math.random() < stepDownSSProb && ppdu.yIndex > 0) {
+                ppdu.lastYIndex = ppdu.yIndex;
+                ppdu.yIndex = Math.max(0, ppdu.yIndex - 1);
+              }
+
+              // When showing single device, disable artificial SS turbulence - focus on MCS (PHY rate) turbulence only
+              // SS should stay relatively stable while MCS jumps around (horizontal movement dominates)
+              // Removed artificial SS turbulence to emphasize PHY rate (MCS) changes
+
+              // Update energy level (can vary with transmission success)
+              ppdu.energy = Math.max(
+                0.3,
+                Math.min(1.0, ppdu.energy + (Math.random() - 0.5) * 0.1),
+              );
+
+              // Determine if this device should be drawn
+              // If showSingleDevice is false, draw all devices
+              // If showSingleDevice is true, only draw the first device (index 0)
+              const shouldDraw = showSingleDevice ? ppdu === ppdus[0] : true;
+
+              // Only draw trails if this device should be drawn
+
+              if (shouldDraw) {
+                // Draw horizontal MCS movement trails (before PPDU so it's behind)
+                if (ppdu.mcsTrail) {
+                  ppdu.mcsTrail = ppdu.mcsTrail.filter((trail) => {
+                    trail.age++;
+                    const alpha = 1.0 - trail.age / trail.maxAge;
+                    if (alpha <= 0) return false;
+
+                    ctx.globalAlpha = alpha * 0.5;
+                    ctx.strokeStyle = ppdu.color;
+                    ctx.lineWidth = 2;
+                    ctx.beginPath();
+                    ctx.moveTo(trail.fromX, trail.y);
+                    ctx.lineTo(trail.toX, trail.y);
+                    ctx.stroke();
+                    return true;
+                  });
+                }
+
+                // Draw vertical SS jump trails
+                if (ppdu.jumpTrails) {
+                  ppdu.jumpTrails = ppdu.jumpTrails.filter((jump) => {
+                    jump.age++;
+                    const alpha = 1.0 - jump.age / jump.maxAge;
+                    if (alpha <= 0) return false;
+
+                    const isSS1toSS2 =
+                      Math.abs(jump.fromY - jump.toY) > stepY * 0.8;
+                    ctx.globalAlpha = alpha * (isSS1toSS2 ? 0.6 : 0.3);
+                    ctx.strokeStyle = isSS1toSS2 ? "#ff6b00" : ppdu.color;
+                    ctx.lineWidth = isSS1toSS2 ? 2.5 : 1.5;
+                    ctx.setLineDash(isSS1toSS2 ? [] : [3, 3]);
+                    ctx.beginPath();
+                    ctx.moveTo(ppdu.x, jump.fromY);
+                    if (isSS1toSS2) {
+                      const midY = (jump.fromY + jump.toY) / 2 - 15;
+                      ctx.quadraticCurveTo(ppdu.x, midY, ppdu.x, jump.toY);
+                    } else {
+                      ctx.lineTo(ppdu.x, jump.toY);
+                    }
+                    ctx.stroke();
+                    ctx.setLineDash([]);
+                    return true;
+                  });
+                }
+                ctx.globalAlpha = 1.0;
+              } else {
+                // Still update trail ages even if not drawing
+                if (ppdu.mcsTrail) {
+                  ppdu.mcsTrail = ppdu.mcsTrail.filter((trail) => {
+                    trail.age++;
+                    return trail.age < trail.maxAge;
+                  });
+                }
+                if (ppdu.jumpTrails) {
+                  ppdu.jumpTrails = ppdu.jumpTrails.filter((jump) => {
+                    jump.age++;
+                    return jump.age < jump.maxAge;
+                  });
+                }
+              }
+
+              // Draw PPDU (only if shouldDraw is true, checked above)
+              if (!shouldDraw) {
+                return; // Skip drawing but continue updating for system dynamics
+              }
+
+              // Draw PPDU
+              const y = getGridY(ppdu.yIndex);
+              const energySize = ppdu.size * (0.7 + ppdu.energy * 0.3);
+              const energyAlpha = 0.6 + ppdu.energy * 0.4;
+
+              // Color based on success/failure: green for success, red for failure
+              const successColor = ppdu.successState ? "#28a745" : "#dc3545"; // Green for success, red for failure
+
+              // Draw eigen space rings (if enabled)
+              // For many eigenvectors, show representative sample (e.g., every Nth ring)
+              if (showEigenSpace && ppdu.availableEigenVectors) {
+                const numEigen = ppdu.availableEigenVectors;
+                // If there are many eigenvectors, show a representative sample
+                // Show up to 8 rings visually, but space them to represent the full count
+                const maxRingsToShow = 8;
+                const ringSpacing =
+                  numEigen > maxRingsToShow
+                    ? Math.ceil(numEigen / maxRingsToShow)
+                    : 1;
+                const numRings = Math.min(maxRingsToShow, numEigen);
+
+                for (let ring = 0; ring < numRings; ring++) {
+                  const eigenIndex = ring * ringSpacing;
+                  const ringRadius = energySize + 3 + ring * 5;
+                  const ringAlpha = 0.4 - ring * 0.04; // Fade outer rings
+                  ctx.globalAlpha = Math.max(0.1, ringAlpha);
+                  ctx.strokeStyle = "#888"; // Same neutral gray for both plots
+                  ctx.lineWidth = 1.5;
+                  ctx.setLineDash([2, 2]);
+                  ctx.beginPath();
+                  ctx.arc(ppdu.x, y, ringRadius, 0, Math.PI * 2);
+                  ctx.stroke();
+                  ctx.setLineDash([]);
+                }
+                // If there are more eigenvectors than rings shown, indicate with a thicker outer ring
+                if (numEigen > maxRingsToShow) {
+                  const outerRadius = energySize + 3 + (numRings - 1) * 5 + 8;
+                  ctx.globalAlpha = 0.2;
+                  ctx.strokeStyle = "#888"; // Same neutral gray for both plots
+                  ctx.lineWidth = 2.5;
+                  ctx.setLineDash([4, 4]);
+                  ctx.beginPath();
+                  ctx.arc(ppdu.x, y, outerRadius, 0, Math.PI * 2);
+                  ctx.stroke();
+                  ctx.setLineDash([]);
+                }
+              }
+
+              ctx.globalAlpha = energyAlpha;
+              ctx.fillStyle = successColor;
+              ctx.beginPath();
+              ctx.arc(ppdu.x, y, energySize, 0, Math.PI * 2);
+              ctx.fill();
+
+              // Show domain color as a thin border for reference
+              ctx.strokeStyle = ppdu.color;
+              ctx.lineWidth = 1;
+              ctx.globalAlpha = energyAlpha * 0.4;
+              ctx.beginPath();
+              ctx.arc(ppdu.x, y, energySize + 1, 0, Math.PI * 2);
+              ctx.stroke();
+            });
+            ctx.globalAlpha = 1.0;
+
+            // Note: Probability current visualization is now in separate canvases below
+
+            // Calculate and display PER (Packet Error Rate) - update less frequently
+            // Increment frame counter first (always)
+            if (isFiwi) {
+              window.fiwiFrameCount = (window.fiwiFrameCount || 0) + 1;
+            } else {
+              window.autoFrameCount = (window.autoFrameCount || 0) + 1;
+            }
+
+            const frameCount = isFiwi
+              ? window.fiwiFrameCount || 0
+              : window.autoFrameCount || 0;
+            if (frameCount % 30 === 0) {
+              let per = 0;
+              if (ppdus.length > 0) {
+                // Calculate PER using birthday paradox model (weighted average of PPDU PERs)
+                let totalPER = 0;
+                let perCount = 0;
+                ppdus.forEach((ppdu) => {
+                  if (
+                    ppdu.currentPER !== undefined &&
+                    !isNaN(ppdu.currentPER)
+                  ) {
+                    totalPER += ppdu.currentPER;
+                    perCount++;
+                  }
+                });
+
+                // Calculate average PER
+                const avgPER = perCount > 0 ? (totalPER / perCount) * 100 : 0;
+
+                // Also calculate failure-based PER as a sanity check
+                const failures = ppdus.filter(
+                  (ppdu) => !ppdu.successState,
+                ).length;
+                const failureBasedPER =
+                  ppdus.length > 0 ? (failures / ppdus.length) * 100 : 0;
+
+                // Use the model-based PER if available, otherwise use failure-based
+                if (perCount > 0 && avgPER > 0) {
+                  per = 0.7 * avgPER + 0.3 * failureBasedPER;
+                } else {
+                  // Fallback to failure-based if no model PER is available yet
+                  per = failureBasedPER;
+                }
+              } else {
+                // No devices = no PER
+                per = 0;
+              }
+
+              const perElement = isFiwi
+                ? document.getElementById("perFiwi")
+                : document.getElementById("perAuto");
+              if (perElement) {
+                perElement.textContent = per.toFixed(1) + "%";
+                // Color based on PER level: green for low, yellow for medium, red for high
+                if (per < 1) {
+                  perElement.style.color = "#28a745"; // Green for very low PER (< 1%)
+                } else if (per < 10) {
+                  perElement.style.color = "#ffc107"; // Yellow for medium PER (1-10%)
+                } else {
+                  perElement.style.color = "#dc3545"; // Red for high PER (> 10%)
+                }
+              }
+
+              // Update eigen vector display
+              const phaseSyncEnabled = document.getElementById(
+                "phaseSyncEnabled",
+              )
+                ? document.getElementById("phaseSyncEnabled").checked
+                : true;
+              if (ppdus.length > 0) {
+                const avgEigen = Math.round(
+                  ppdus.reduce(
+                    (sum, ppdu) => sum + ppdu.availableEigenVectors,
+                    0,
+                  ) / ppdus.length,
+                );
+                const maxEigen = Math.max(
+                  ...ppdus.map((ppdu) => ppdu.availableEigenVectors),
+                );
+                const eigenElement = isFiwi
+                  ? document.getElementById("eigenFiwi")
+                  : document.getElementById("eigenAuto");
+                if (eigenElement) {
+                  // Show range: average to max, or just max if they're close
+                  let displayText;
+                  if (maxEigen - avgEigen <= 2) {
+                    displayText = avgEigen + " (max: " + maxEigen + ")";
+                  } else {
+                    displayText = avgEigen + "-" + maxEigen;
+                  }
+                  // Add Phase 1 cap indicator for Fi-Wi
+                  if (isFiwi && !phaseSyncEnabled) {
+                    displayText += " (Capped)";
+                  } else if (isFiwi && phaseSyncEnabled) {
+                    displayText += " (Unbounded)";
+                  }
+                  eigenElement.textContent = displayText;
+                }
+              } else {
+                // No devices = no eigen vectors
+                const eigenElement = isFiwi
+                  ? document.getElementById("eigenFiwi")
+                  : document.getElementById("eigenAuto");
+                if (eigenElement) {
+                  eigenElement.textContent = "0";
+                }
+              }
+
+              // Calculate WLAN link utilization
+              // Sum PHY rates for all successful transmissions
+              // Note: Multiple WLAN transmissions share the same 1 Gbps WAN link
+              let totalWlanCapacityDemand = 0; // Mbps - what WLAN wants to use
+              ppdus.forEach((ppdu) => {
+                if (ppdu.successState) {
+                  const currentMCS = Math.round((ppdu.x - PADDING) / stepX);
+                  const clampedMCS = Math.max(0, Math.min(12, currentMCS));
+                  const spatialStreams = ppdu.yIndex + 1; // SS1 = 1 stream, SS2 = 2 streams, etc.
+                  const phyRatePerStream = MCS_PHY_RATES[clampedMCS] || 6.5;
+                  const totalPhyRate = phyRatePerStream * spatialStreams; // Total rate = rate per stream * number of streams
+                  totalWlanCapacityDemand += totalPhyRate;
+                }
+              });
+
+              // WAN link capacity is 1 Gbps - cap the actual capacity used at this limit
+              // The WLAN can demand more, but the WAN link can only deliver 1 Gbps
+              const totalCapacityUsed = Math.min(
+                totalWlanCapacityDemand,
+                WAN_CAPACITY_MBPS,
+              );
+              let rawUtilization =
+                (totalCapacityUsed / WAN_CAPACITY_MBPS) * 100;
+
+              // L4S with ECN signaling: Sources back off when utilization approaches 100%
+              // BUT: L4S ECN signaling only works for Centralized Concentrator (Fi-Wi)
+              // Autonomous APs can't coordinate to see aggregate WAN link state, so ECN signaling doesn't work
+              // ECN marks signal congestion early, causing sources to reduce rates to keep utilization below threshold
+              const l4sEnabled = document.getElementById("l4sEnabled")
+                ? document.getElementById("l4sEnabled").checked
+                : false;
+              let linkUtilization;
+              // L4S ECN backoff only applies to Fi-Wi (Centralized Concentrator)
+              // The concentrator has visibility into aggregate capacity and can signal congestion
+              if (l4sEnabled && isFiwi && rawUtilization > 90) {
+                // L4S: Apply ECN-based backoff to keep utilization below 95%
+                // As raw utilization approaches 100%, sources back off more aggressively
+                // This models the ECN marking and pacing behavior of L4S
+                const ecnThreshold = 95; // Target max utilization with L4S
+                if (rawUtilization > ecnThreshold) {
+                  // Strong backoff when above threshold
+                  const backoffFactor = ecnThreshold / rawUtilization;
+                  linkUtilization =
+                    ecnThreshold +
+                    (rawUtilization - ecnThreshold) * backoffFactor * 0.3;
+                } else {
+                  // Gradual backoff as approaching threshold
+                  const excess = rawUtilization - 90;
+                  const backoffFactor = 1 - (excess / 10) * 0.2; // Up to 20% backoff
+                  linkUtilization = 90 + excess * backoffFactor;
+                }
+                linkUtilization = Math.min(linkUtilization, ecnThreshold);
+              } else {
+                // Greedy mode OR Autonomous AP (even with L4S checkbox on): Can reach 100%
+                // Autonomous APs can't coordinate ECN signaling, so queue buildup and loss still occur
+                linkUtilization = Math.min(100, rawUtilization);
+              }
+
+              const utilElement = isFiwi
+                ? document.getElementById("utilFiwi")
+                : document.getElementById("utilAuto");
+              const utilBitsElement = isFiwi
+                ? document.getElementById("utilFiwiBits")
+                : document.getElementById("utilAutoBits");
+
+              if (utilElement) {
+                // Display percentage with raw and adjusted values for debugging
+                let displayText = linkUtilization.toFixed(1) + "%";
+                if (l4sEnabled && rawUtilization !== linkUtilization) {
+                  displayText += " (raw: " + rawUtilization.toFixed(1) + "%)";
+                }
+                utilElement.textContent = displayText;
+
+                // Color based on utilization: green < 50%, yellow 50-80%, red > 80%
+                if (linkUtilization < 50) {
+                  utilElement.style.color = "#28a745";
+                } else if (linkUtilization < 80) {
+                  utilElement.style.color = "#ffc107";
+                } else {
+                  utilElement.style.color = "#dc3545";
+                }
+              }
+
+              if (utilBitsElement) {
+                // Display actual WAN link utilization (capped at 1 Gbps)
+                const actualWanRate = totalCapacityUsed.toFixed(1); // Already in Mbps, capped at WAN capacity
+                let displayText = actualWanRate + " Mbps";
+
+                // Show WLAN demand if it exceeds WAN capacity
+                if (totalWlanCapacityDemand > WAN_CAPACITY_MBPS) {
+                  displayText +=
+                    " (WLAN demand: " +
+                    totalWlanCapacityDemand.toFixed(1) +
+                    " Mbps)";
+                }
+
+                // Also show adjusted rate if L4S is applying backoff
+                if (l4sEnabled && rawUtilization !== linkUtilization) {
+                  const adjustedRate = (
+                    totalCapacityUsed *
+                    (linkUtilization / rawUtilization)
+                  ).toFixed(1);
+                  displayText =
+                    adjustedRate + " Mbps (raw: " + actualWanRate + " Mbps)";
+                }
+
+                utilBitsElement.textContent = displayText;
+              }
+
+              // Calculate P99.9 tail latency with breakdown (802.11 + 802.3)
+              // Get current L4S state to filter samples appropriately
+              // NOTE: L4S ECN signaling only works for Fi-Wi (Centralized Concentrator)
+              // Autonomous APs can't coordinate to see aggregate WAN state
+              const l4sCheckboxForP99 = document.getElementById("l4sEnabled");
+              const currentL4sState = l4sCheckboxForP99
+                ? l4sCheckboxForP99.checked
+                : false;
+              // L4S only effective for Fi-Wi architecture
+              const l4sEffective = currentL4sState && isFiwi;
+
+              // Collect all latency samples from all PPDUs
+              let allLatencySamples = [];
+              ppdus.forEach((ppdu) => {
+                if (ppdu.latencySamples && ppdu.latencySamples.length > 0) {
+                  // Handle both old format (number) and new format (object with components)
+                  ppdu.latencySamples.forEach((sample) => {
+                    if (typeof sample === "number") {
+                      // Old format - convert to new format (assume all is 802.11, no 802.3)
+                      allLatencySamples.push({
+                        total: sample,
+                        delay80211: sample,
+                        delay8023: 0,
+                      });
+                    } else {
+                      // New format with components
+                      // If L4S is enabled AND effective (Fi-Wi only), filter out samples with high 802.3 delay
+                      // With L4S (Fi-Wi), 802.3 delay should be ~0.1ms, so anything > 1ms is likely old
+                      // For Autonomous AP, L4S doesn't work, so high 802.3 delay is expected
+                      if (l4sEffective && sample.delay8023 > 1.0) {
+                        // This sample was likely taken before L4S was enabled, adjust it
+                        // Keep the 802.11 delay but set 802.3 to minimal (L4S prevents queue buildup)
+                        allLatencySamples.push({
+                          total: sample.delay80211 + 0.1,
+                          delay80211: sample.delay80211,
+                          delay8023: 0.1,
+                        });
+                      } else {
+                        // Use sample as-is (either L4S not effective, or delay is already low)
+                        allLatencySamples.push(sample);
+                      }
+                    }
+                  });
+                }
+                // Also include estimated current in-flight latency for active transmissions
+                // Only include if transmission has been in progress for a reasonable time
+                if (!ppdu.successState && ppdu.transmissionStartTime) {
+                  // Estimate current latency based on transmission time
+                  const currentMCS = Math.round((ppdu.x - PADDING) / stepX);
+                  const clampedMCS = Math.max(0, Math.min(12, currentMCS));
+                  const spatialStreams = ppdu.yIndex + 1;
+                  const phyRatePerStream = MCS_PHY_RATES[clampedMCS] || 6.5;
+                  const totalPhyRate = phyRatePerStream * spatialStreams;
+                  const PACKET_SIZE_BITS = 12000;
+                  const transmissionTime =
+                    (PACKET_SIZE_BITS / (totalPhyRate * 1e6)) * 1000;
+
+                  // Estimate retransmission overhead
+                  const expectedRetransmissions =
+                    ppdu.currentPER > 0
+                      ? ppdu.currentPER / (1 - ppdu.currentPER)
+                      : 0;
+                  const retransmissionOverhead =
+                    expectedRetransmissions * (transmissionTime + 0.1);
+
+                  // Estimate 802.11 delay (transmission + retransmission)
+                  const estimatedDelay80211 =
+                    transmissionTime + retransmissionOverhead;
+
+                  // Estimate 802.3 delay (minimal for in-flight)
+                  // L4S only effective for Fi-Wi (Centralized Concentrator)
+                  const estimatedDelay8023 = l4sEffective ? 0.1 : 0;
+
+                  const estimatedLatency =
+                    estimatedDelay80211 + estimatedDelay8023;
+
+                  // Only include if it's a reasonable latency (not stuck)
+                  if (estimatedLatency < 1000) {
+                    // Cap at 1 second
+                    allLatencySamples.push({
+                      total: estimatedLatency,
+                      delay80211: estimatedDelay80211,
+                      delay8023: estimatedDelay8023,
+                    });
+                  }
+                }
+              });
+
+              let p999Latency = 0;
+              let p999Delay80211 = 0;
+              let p999Delay8023 = 0;
+              if (allLatencySamples.length > 0) {
+                // Sort by total latency
+                allLatencySamples.sort((a, b) => a.total - b.total);
+                // P99.9 = 99.9th percentile = index at 99.9% of sorted array
+                const p999Index = Math.floor(allLatencySamples.length * 0.999);
+                const p999Sample =
+                  allLatencySamples[
+                    Math.min(p999Index, allLatencySamples.length - 1)
+                  ];
+                p999Latency = p999Sample.total;
+                p999Delay80211 = p999Sample.delay80211;
+                p999Delay8023 = p999Sample.delay8023;
+              }
+
+              const latencyElement = isFiwi
+                ? document.getElementById("latencyFiwi")
+                : document.getElementById("latencyAuto");
+              const latency80211Element = isFiwi
+                ? document.getElementById("latency80211Fiwi")
+                : document.getElementById("latency80211Auto");
+              const latency8023Element = isFiwi
+                ? document.getElementById("latency8023Fiwi")
+                : document.getElementById("latency8023Auto");
+
+              if (latencyElement) {
+                // Always show the actual value, even if small
+                if (p999Latency >= 0) {
+                  // Show 1 decimal place for values < 1ms, round for larger values
+                  const displayValue =
+                    p999Latency < 1
+                      ? p999Latency.toFixed(1)
+                      : Math.round(p999Latency);
+                  latencyElement.textContent = displayValue + " ms";
+                  // Color based on latency: green < 10ms, yellow 10-50ms, red > 50ms
+                  if (p999Latency < 10) {
+                    latencyElement.style.color = "#28a745";
+                  } else if (p999Latency < 50) {
+                    latencyElement.style.color = "#ffc107";
+                  } else {
+                    latencyElement.style.color = "#dc3545";
+                  }
+                } else {
+                  latencyElement.textContent = "0 ms";
+                  latencyElement.style.color = "#666";
+                }
+              }
+
+              if (latency80211Element) {
+                const display80211 =
+                  p999Delay80211 >= 0 ? p999Delay80211.toFixed(1) : "0.0";
+                latency80211Element.textContent = display80211 + " ms";
+              }
+
+              if (latency8023Element) {
+                const display8023 =
+                  p999Delay8023 >= 0 ? p999Delay8023.toFixed(1) : "0.0";
+                latency8023Element.textContent = display8023 + " ms";
+              }
+
+              // Debug: Verify components add up
+              if (
+                p999Latency > 0 &&
+                Math.abs(p999Latency - (p999Delay80211 + p999Delay8023)) > 0.1
+              ) {
+                console.warn("Latency components mismatch:", {
+                  total: p999Latency,
+                  delay80211: p999Delay80211,
+                  delay8023: p999Delay8023,
+                  sum: p999Delay80211 + p999Delay8023,
+                  diff: Math.abs(
+                    p999Latency - (p999Delay80211 + p999Delay8023),
+                  ),
+                });
+              }
+            }
+          }
+
+          // Initialize
+          let ppdusAuto = [];
+          let ppdusFiwi = [];
+          let targetDeviceCount = 15;
+          let numAirtimeDomainsAP = 1; // Autonomous AP(s): typically 1 domain
+          let roomSize = 400; // sq ft per room for Centralized
+          let coverageArea = 10000; // square feet
+
+          const deviceCountSlider = document.getElementById("deviceCount");
+          const deviceCountValue = document.getElementById("deviceCountValue");
+          const airtimeDomainsAPSlider =
+            document.getElementById("airtimeDomainsAP");
+          const airtimeDomainsAPValue = document.getElementById(
+            "airtimeDomainsAPValue",
+          );
+          const roomSizeSlider = document.getElementById("roomSize");
+          const roomSizeValue = document.getElementById("roomSizeValue");
+          const airtimeDomainsFiwiValue = document.getElementById(
+            "airtimeDomainsFiwiValue",
+          );
+          const coverageAreaSlider = document.getElementById("coverageArea");
+          const coverageAreaValue =
+            document.getElementById("coverageAreaValue");
+          const deviceDensity = document.getElementById("deviceDensity");
+
+          deviceCountSlider.addEventListener("input", (e) => {
+            targetDeviceCount = parseInt(e.target.value);
+            deviceCountValue.textContent = targetDeviceCount;
+            updateDensity();
+          });
+
+          airtimeDomainsAPSlider.addEventListener("input", (e) => {
+            numAirtimeDomainsAP = parseInt(e.target.value);
+            airtimeDomainsAPValue.textContent = numAirtimeDomainsAP;
+          });
+
+          roomSizeSlider.addEventListener("input", (e) => {
+            roomSize = parseInt(e.target.value);
+            roomSizeValue.textContent = roomSize;
+            updateAirtimeDomainsFiwi();
+          });
+
+          coverageAreaSlider.addEventListener("input", (e) => {
+            coverageArea = parseInt(e.target.value);
+            coverageAreaValue.textContent = coverageArea.toLocaleString();
+            updateDensity();
+            updateAirtimeDomainsFiwi();
+          });
+
+          function updateDensity() {
+            if (coverageArea > 0) {
+              const density = (targetDeviceCount / coverageArea) * 1000;
+              deviceDensity.textContent = density.toFixed(2);
+            }
+          }
+
+          function updateAirtimeDomainsFiwi() {
+            const numRooms = Math.ceil(coverageArea / roomSize);
+            airtimeDomainsFiwiValue.textContent = numRooms;
+          }
+
+          updateDensity();
+          updateAirtimeDomainsFiwi();
+
+          function init() {
+            console.log("init() called");
+            const canvasAuto = document.getElementById("canvasAuto");
+            const canvasFiwi = document.getElementById("canvasFiwi");
+
+            if (!canvasAuto || !canvasFiwi) {
+              console.error("Canvas elements not found!", {
+                canvasAuto,
+                canvasFiwi,
+              });
+              throw new Error("Canvases not found");
+            }
+
+            const rectAuto = canvasAuto.getBoundingClientRect();
+            const rectFiwi = canvasFiwi.getBoundingClientRect();
+            console.log("Canvas dimensions:", {
+              auto: { w: rectAuto.width, h: rectAuto.height },
+              fiwi: { w: rectFiwi.width, h: rectFiwi.height },
+            });
+
+            if (
+              rectAuto.width === 0 ||
+              rectAuto.height === 0 ||
+              rectFiwi.width === 0 ||
+              rectFiwi.height === 0
+            ) {
+              console.warn("Canvas dimensions are zero, retrying...");
+              throw new Error("Canvas dimensions zero");
+            }
+
+            // Calculate airtime domains for each architecture
+            const airtimeDomainsAuto = numAirtimeDomainsAP; // Autonomous: typically 1
+            const airtimeDomainsFiwi = Math.ceil(coverageArea / roomSize); // Centralized: one per room
+
+            // Calculate stepX for proper initialization
+            const stepXAuto = (rectAuto.width - PADDING * 2) / (NODE_COLS - 1);
+            const stepXFiwi = (rectFiwi.width - PADDING * 2) / (NODE_COLS - 1);
+
+            console.log("Creating PPDUs...");
+            ppdusAuto = createPPDUs(
+              rectAuto.width,
+              stepXAuto,
+              false,
+              airtimeDomainsAuto,
+            );
+            ppdusFiwi = createPPDUs(
+              rectFiwi.width,
+              stepXFiwi,
+              true,
+              airtimeDomainsFiwi,
+            );
+            console.log("PPDUs created:", {
+              auto: ppdusAuto.length,
+              fiwi: ppdusFiwi.length,
+            });
+
+            let frameCount = 0;
+            function animate() {
+              frameCount++;
+              if (frameCount === 1) {
+                console.log("First animation frame");
+              }
+              if (frameCount % 60 === 0) {
+                console.log("Animation running, frame:", frameCount, "PPDUs:", {
+                  auto: ppdusAuto.length,
+                  fiwi: ppdusFiwi.length,
+                });
+              }
+              try {
+                // Calculate airtime domains
+                const airtimeDomainsAuto = numAirtimeDomainsAP;
+                const airtimeDomainsFiwi = Math.ceil(coverageArea / roomSize);
+
+                // Get L4S mode once for the entire animate call (used in multiple places)
+                const l4sEnabled = document.getElementById("l4sEnabled")
+                  ? document.getElementById("l4sEnabled").checked
+                  : false;
+
+                // Adjust PPDU count based on slider
+                const currentCount = ppdusAuto.length;
+                if (currentCount < targetDeviceCount) {
+                  // Add new PPDUs
+                  const stepX =
+                    (rectAuto.width - PADDING * 2) / (NODE_COLS - 1);
+                  const devicesPerDomainAuto = Math.ceil(
+                    targetDeviceCount / airtimeDomainsAuto,
+                  );
+                  const devicesPerDomainFiwi = Math.ceil(
+                    targetDeviceCount / airtimeDomainsFiwi,
+                  );
+
+                  for (let i = currentCount; i < targetDeviceCount; i++) {
+                    const domainIndexAuto = Math.floor(
+                      i / devicesPerDomainAuto,
+                    );
+                    const domainIndexFiwi = Math.floor(
+                      i / devicesPerDomainFiwi,
+                    );
+
+                    // Check L4S mode for dynamic PPDU creation
+                    // l4sEnabled is already declared at function scope
+                    const mcsBaseAuto = l4sEnabled ? 3 : 5; // L4S: conservative, Greedy: aggressive
+                    const targetMCSAuto =
+                      mcsBaseAuto + Math.random() * (l4sEnabled ? 2 : 4);
+                    const initialMCSAuto = Math.max(
+                      0,
+                      Math.min(
+                        NODE_COLS - 1,
+                        Math.floor(targetMCSAuto - 2 + Math.random() * 4), // Start within 2 MCS of target
+                      ),
+                    );
+                    // Allow ~15% of Autonomous AP PPDUs to start at SS3 (3x3/4x4 devices)
+                    const initialSSAuto =
+                      Math.random() < 0.15
+                        ? 2 // 15% chance for SS3
+                        : Math.min(1, Math.floor(Math.random() * 2)); // 85% at SS1-SS2
+                    const startTimeAuto = Date.now();
+                    ppdusAuto.push({
+                      x: PADDING + initialMCSAuto * stepX,
+                      yIndex: initialSSAuto,
+                      vx: (Math.random() - 0.5) * 0.5,
+                      vy: 0, // Vertical velocity for probability current
+                      targetMCS: targetMCSAuto,
+                      airtimeDomain: domainIndexAuto,
+                      color: "#d9534f",
+                      size: 4 + Math.random() * 3,
+                      energy: 0.7 + Math.random() * 0.3,
+                      lastYIndex: initialSSAuto,
+                      lastMCS: initialMCSAuto,
+                      mcsTrail: [],
+                      successState: Math.random() > 0.5,
+                      startTime: startTimeAuto,
+                      availableEigenVectors: getAvailableEigenVectors(
+                        false,
+                        airtimeDomainsAuto,
+                        initialSSAuto,
+                        0,
+                        0,
+                      ),
+                      transmissionStartTime: startTimeAuto, // Track when transmission started (for latency)
+                      latencySamples: [], // Track latency samples for P99.9 calculation
+                    });
+
+                    // Check L4S mode for dynamic PPDU creation
+                    // l4sEnabled is already declared at function scope
+                    let mcsBaseFiwi;
+                    if (l4sEnabled) {
+                      // L4S: Conservative
+                      mcsBaseFiwi =
+                        5 +
+                        (airtimeDomainsFiwi > 10
+                          ? 2
+                          : airtimeDomainsFiwi > 5
+                            ? 1
+                            : 0);
+                    } else {
+                      // Greedy: Aggressive
+                      mcsBaseFiwi =
+                        8 +
+                        (airtimeDomainsFiwi > 10
+                          ? 4
+                          : airtimeDomainsFiwi > 5
+                            ? 3
+                            : 2);
+                    }
+                    const targetMCSFiwi =
+                      mcsBaseFiwi + Math.random() * (l4sEnabled ? 2 : 4);
+                    const initialMCSFiwi = Math.max(
+                      0,
+                      Math.min(
+                        NODE_COLS - 1,
+                        Math.floor(targetMCSFiwi - 2 + Math.random() * 4), // Start within 2 MCS of target
+                      ),
+                    );
+                    const initialSSFiwi = Math.min(
+                      2,
+                      Math.floor(Math.random() * 3),
+                    ); // Fi-Wi: start at SS1-SS3
+                    const stepXFiwi =
+                      (rectFiwi.width - PADDING * 2) / (NODE_COLS - 1);
+                    const startTimeFiwi = Date.now();
+                    ppdusFiwi.push({
+                      x: PADDING + initialMCSFiwi * stepXFiwi,
+                      yIndex: initialSSFiwi,
+                      vx: (Math.random() - 0.5) * 0.5,
+                      vy: 0, // Vertical velocity for probability current
+                      targetMCS: targetMCSFiwi,
+                      airtimeDomain: domainIndexFiwi,
+                      color: `hsl(${220 + domainIndexFiwi * 20}, 70%, ${50 + domainIndexFiwi * 5}%)`,
+                      size: 4 + Math.random() * 3,
+                      energy: 0.7 + Math.random() * 0.3,
+                      lastYIndex: initialSSFiwi,
+                      lastMCS: initialMCSFiwi,
+                      mcsTrail: [],
+                      successState: Math.random() > 0.5,
+                      startTime: startTimeFiwi,
+                      availableEigenVectors: getAvailableEigenVectors(
+                        true,
+                        airtimeDomainsFiwi,
+                        initialSSFiwi,
+                        0,
+                        0,
+                      ),
+                      transmissionStartTime: startTimeFiwi, // Track when transmission started (for latency)
+                      latencySamples: [], // Track latency samples for P99.9 calculation
+                    });
+                  }
+                } else if (currentCount > targetDeviceCount) {
+                  // Remove PPDUs
+                  ppdusAuto.splice(targetDeviceCount);
+                  ppdusFiwi.splice(targetDeviceCount);
+                }
+
+                if (!canvasAuto || !canvasFiwi) {
+                  console.error("Canvases lost during animation!");
+                  return;
+                }
+                draw(canvasAuto, ppdusAuto, false);
+                draw(canvasFiwi, ppdusFiwi, true);
+
+                // Always draw flow field visualizations (with error handling)
+                try {
+                  // Check if showSingleDevice is enabled
+                  const showSingleDevice = document.getElementById(
+                    "showSingleDevice",
+                  )
+                    ? document.getElementById("showSingleDevice").checked
+                    : false;
+                  // Filter to single device if enabled, otherwise use all devices
+                  const flowPpdusAuto =
+                    showSingleDevice && ppdusAuto.length > 0
+                      ? [ppdusAuto[0]]
+                      : ppdusAuto;
+                  const flowPpdusFiwi =
+                    showSingleDevice && ppdusFiwi.length > 0
+                      ? [ppdusFiwi[0]]
+                      : ppdusFiwi;
+                  drawFlowField("canvasAutoFlow", flowPpdusAuto, false);
+                  drawFlowField("canvasFiwiFlow", flowPpdusFiwi, true);
+                } catch (e) {
+                  console.warn("Flow field rendering error:", e);
+                }
+
+                requestAnimationFrame(animate);
+              } catch (e) {
+                console.error("Animation error:", e);
+                // Try to continue anyway
+                setTimeout(() => requestAnimationFrame(animate), 100);
+              }
+            }
+
+            // Start animation
+            console.log("Starting animation loop...");
+            try {
+              animate();
+              console.log("Animation loop started");
+            } catch (e) {
+              console.error("Failed to start animation:", e);
+              throw e;
+            }
+          }
+
+          // Helper function to draw arrows
+          function drawArrow(ctx, x, y, angle, length) {
+            ctx.save();
+            ctx.translate(x, y);
+            ctx.rotate(angle);
+            ctx.beginPath();
+            ctx.moveTo(0, 0);
+            ctx.lineTo(length, 0);
+            ctx.moveTo(length, 0);
+            ctx.lineTo(length - 5, -3);
+            ctx.moveTo(length, 0);
+            ctx.lineTo(length - 5, 3);
+            ctx.stroke();
+            ctx.restore();
+          }
+
+          // Separate function to draw only the flow field (probability current)
+          function drawFlowField(canvasId, ppdus, isFiwi) {
+            const canvas = document.getElementById(canvasId);
+            if (!canvas) return;
+
+            // Defensive check: ensure ppdus is an array
+            if (!Array.isArray(ppdus)) {
+              console.warn("drawFlowField: ppdus is not an array", ppdus);
+              return;
+            }
+
+            const rect = canvas.getBoundingClientRect();
+            if (rect.width === 0 || rect.height === 0) return; // Canvas not ready
+
+            const ctx = canvas.getContext("2d");
+            const dpr = window.devicePixelRatio || 1;
+            canvas.width = rect.width * dpr;
+            canvas.height = rect.height * dpr;
+            ctx.setTransform(dpr, 0, 0, dpr, 0, 0);
+
+            const w = rect.width;
+            const h = rect.height;
+            const stepX = (w - PADDING * 2) / (NODE_COLS - 1);
+            const stepY = (h - PADDING * 2) / (NODE_ROWS - 1);
+
+            function getGridY(rowIndex) {
+              return h - PADDING - rowIndex * stepY;
+            }
+
+            // Clear canvas
+            ctx.clearRect(0, 0, w, h);
+
+            // Draw background grid (same as main canvas)
+            ctx.fillStyle = "#ddd";
+            for (let r = 0; r < NODE_ROWS; r++) {
+              const gy = getGridY(r);
+              for (let i = 0; i < NODE_COLS; i++) {
+                const gx = PADDING + i * stepX;
+                ctx.beginPath();
+                ctx.arc(gx, gy, 2, 0, Math.PI * 2);
+                ctx.fill();
+              }
+
+              // SS labels
+              ctx.fillStyle = "#666";
+              ctx.font = "12px system-ui";
+              ctx.textAlign = "right";
+              ctx.fillText("SS" + (r + 1), PADDING - 10, gy + 4);
+            }
+
+            // MCS labels - show "MCS" once, then just numbers for tick marks
+            ctx.textAlign = "left";
+            ctx.font = "10px system-ui";
+            ctx.fillStyle = "#666";
+            ctx.fillText("MCS", 5, h - 10); // Move to far left to avoid overlap with tick marks
+            ctx.textAlign = "center";
+            for (let i = 0; i < NODE_COLS; i++) {
+              const gx = PADDING + i * stepX;
+              ctx.fillText(i.toString(), gx, h - 10);
+            }
+
+            // Vector field grid - higher resolution for finer flow visualization
+            const VECTOR_GRID_SIZE = 30; // Changed from 50 to 30 for higher resolution
+            const vectorCols = Math.ceil(w / VECTOR_GRID_SIZE);
+            const vectorRows = Math.ceil(h / VECTOR_GRID_SIZE);
+
+            // Initialize vector field with smoothing (persistent across frames)
+            const particleKey = isFiwi ? "fiwi" : "auto";
+            if (!window.smoothedVectorField) window.smoothedVectorField = {};
+            if (!window.smoothedVectorField[particleKey]) {
+              window.smoothedVectorField[particleKey] = [];
+              for (let i = 0; i < vectorCols; i++) {
+                window.smoothedVectorField[particleKey][i] = [];
+                for (let j = 0; j < vectorRows; j++) {
+                  window.smoothedVectorField[particleKey][i][j] = {
+                    vx: 0,
+                    vy: 0,
+                    count: 0,
+                    conflictRatio: 0,
+                    reversal: false,
+                  };
+                }
+              }
+            }
+
+            // Initialize current frame vector field
+            let vectorField = [];
+            for (let i = 0; i < vectorCols; i++) {
+              vectorField[i] = [];
+              for (let j = 0; j < vectorRows; j++) {
+                vectorField[i][j] = { vx: 0, vy: 0, count: 0 };
+              }
+            }
+
+            // Calculate airtime domains
+            // Read from DOM if variables not in scope (defensive)
+            const currentCoverageArea =
+              typeof coverageArea !== "undefined"
+                ? coverageArea
+                : parseInt(
+                    document.getElementById("coverageArea")?.value || "10000",
+                  );
+            const currentRoomSize =
+              typeof roomSize !== "undefined"
+                ? roomSize
+                : parseInt(document.getElementById("roomSize")?.value || "400");
+            const currentNumAirtimeDomainsAP =
+              typeof numAirtimeDomainsAP !== "undefined"
+                ? numAirtimeDomainsAP
+                : parseInt(
+                    document.getElementById("airtimeDomainsAP")?.value || "1",
+                  );
+
+            const numAirtimeDomainsFiwi = isFiwi
+              ? Math.ceil(currentCoverageArea / currentRoomSize)
+              : 1;
+
+            // Define potential well locations (optimal states)
+            // For Fi-Wi: ONE deep, steep well at high MCS + high SS (top-right)
+            // For Autonomous: MULTIPLE shallow wells (one per AP/domain), more distributed
+            const numAirtimeDomains = isFiwi
+              ? Math.ceil(currentCoverageArea / currentRoomSize)
+              : currentNumAirtimeDomainsAP;
+
+            let wells = [];
+            if (isFiwi) {
+              // Fi-Wi: Single deep well at realistic optimal state
+              // Most devices are 2x2 (SS2), so primary well is at SS2, MCS 8-9 (realistic for good conditions)
+              wells.push({
+                x: PADDING + 8 * stepX, // MCS 8 (realistic high MCS for 2x2 devices)
+                y: getGridY(1), // SS2 (2x2 devices - most common)
+                depth: 2.5, // Deep well (steep gradient)
+                radius: 120, // Localized
+                strength: 0.4, // Strong attraction
+              });
+              // Optional: Secondary shallower well for 3x3/4x4 devices (if any)
+              wells.push({
+                x: PADDING + 10 * stepX, // MCS 10
+                y: getGridY(2), // SS3 (3x3 devices - less common)
+                depth: 1.5, // Shallower than primary
+                radius: 100,
+                strength: 0.25,
+              });
+            } else {
+              // Autonomous: Multiple shallow wells representing competing local optima
+              // Even with 1 airtime domain, there are multiple competing forces:
+              // - Different device types (1x1, 2x2, 3x3/4x4) optimize differently
+              // - Different MCS ranges work better under different interference conditions
+              // - The distributed nature creates multiple "sweet spots" that conflict
+              // Create 4-6 shallow wells to show turbulence and competing forces
+              const numWells = Math.max(4, numAirtimeDomains * 3); // At least 4 wells, more if multiple domains
+              for (let w = 0; w < numWells; w++) {
+                // Distribute wells across MCS/SS space to show competing optima
+                // Vary MCS from 2-9 (covering low to mid-high MCS ranges)
+                const wellMCS = 2 + (w % 4) * 2 + Math.floor(w / 4) * 1; // Spread: MCS 2, 4, 6, 8, then 3, 5, 7, 9
+                const wellMCS_clamped = Math.min(11, wellMCS);
+                // Distribute across SS 0-2 (SS1, SS2, SS3)
+                const wellSS = Math.min(2, w % 3);
+                // Vary depth slightly (0.5-0.8) to show some wells are slightly stronger
+                const wellDepth = 0.5 + (w % 3) * 0.1; // 0.5, 0.6, 0.7, then repeat
+                wells.push({
+                  x: PADDING + wellMCS_clamped * stepX,
+                  y: getGridY(wellSS),
+                  depth: wellDepth, // Shallow wells (0.5-0.8)
+                  radius: 70 + (w % 2) * 10, // Vary radius 70-80 (wider, less localized)
+                  strength: 0.12 + (w % 2) * 0.03, // Vary strength 0.12-0.15 (weak attraction)
+                });
+              }
+            }
+
+            // Accumulate velocities from PPDUs and potential well attraction
+            ppdus.forEach((ppdu) => {
+              // Defensive check: ensure ppdu has required properties
+              if (
+                !ppdu ||
+                typeof ppdu.x !== "number" ||
+                typeof ppdu.targetMCS !== "number" ||
+                typeof ppdu.yIndex !== "number"
+              ) {
+                return; // Skip invalid PPDUs
+              }
+
+              // Calculate velocity vectors (same logic as main draw function)
+              let currentMCS = Math.round((ppdu.x - PADDING) / stepX);
+              currentMCS = Math.max(0, Math.min(NODE_COLS - 1, currentMCS));
+              const targetMCS = Math.round(ppdu.targetMCS);
+              const clampedTargetMCS = Math.max(
+                0,
+                Math.min(NODE_COLS - 1, targetMCS),
+              );
+
+              const mcsDirection = clampedTargetMCS - currentMCS;
+
+              // FIX: Declare vx and vy before assigning to them
+              let vx = 0;
+              let vy = 0;
+
+              // For Legacy: Much more turbulence and chaos
+              if (isFiwi) {
+                // Fi-Wi: Smooth, coherent flow
+                const turbulenceFactor = 0.3;
+                const baseVx = mcsDirection * 0.15;
+                const noiseVx = (Math.random() - 0.5) * 0.2 * turbulenceFactor;
+
+                const ssChange =
+                  ppdu.yIndex -
+                  (ppdu.lastYIndex !== undefined
+                    ? ppdu.lastYIndex
+                    : ppdu.yIndex);
+                const baseVy = ppdu.yIndex < NODE_ROWS - 1 ? 0.05 : 0;
+                const noiseVy = (Math.random() - 0.5) * 0.1 * turbulenceFactor;
+
+                vx = baseVx + noiseVx;
+                vy = baseVy + ssChange * 0.5 + noiseVy;
+              } else {
+                // Legacy: High turbulence, chaotic flow
+                // Add significant random jitter that can cause reversals
+                const turbulenceFactor = 2.5; // Much higher turbulence
+                const baseVx = mcsDirection * 0.12; // Slightly weaker base (more chaos)
+
+                // Multiple sources of noise for chaotic behavior
+                const noise1 = (Math.random() - 0.5) * 0.6 * turbulenceFactor;
+                const noise2 = (Math.random() - 0.5) * 0.4 * turbulenceFactor;
+                const noise3 = (Math.random() - 0.5) * 0.3 * turbulenceFactor;
+                const noiseVx = noise1 + noise2 + noise3; // Sum of multiple noise sources
+
+                // Occasional strong backward force (collisions cause reversals)
+                // Legacy firmware panics on collision - hard stop/reverse
+                const collisionForce = Math.random() < 0.15 ? -1.5 : 0; // 15% chance of strong backward force (increased from -0.8)
+
+                const ssChange =
+                  ppdu.yIndex -
+                  (ppdu.lastYIndex !== undefined
+                    ? ppdu.lastYIndex
+                    : ppdu.yIndex);
+                const baseVy = ppdu.yIndex < NODE_ROWS - 1 ? 0.03 : 0; // Weaker base
+                const noiseVy1 = (Math.random() - 0.5) * 0.4 * turbulenceFactor;
+                const noiseVy2 = (Math.random() - 0.5) * 0.3 * turbulenceFactor;
+                const noiseVy = noiseVy1 + noiseVy2;
+
+                // On collision, add vertical instability (dropping spatial streams)
+                const verticalInstability =
+                  collisionForce !== 0 ? (Math.random() - 0.5) * 2.0 : 0;
+
+                vx = baseVx + noiseVx + collisionForce;
+                vy = baseVy + ssChange * 0.3 + noiseVy + verticalInstability;
+              }
+
+              // Add potential well attraction from ALL wells (closest well dominates)
+              let totalWellVx = 0;
+              let totalWellVy = 0;
+              let minDistance = Infinity;
+
+              wells.forEach((well) => {
+                const dx = well.x - ppdu.x;
+                const dy = well.y - getGridY(ppdu.yIndex);
+                const distance = Math.sqrt(dx * dx + dy * dy);
+
+                if (distance > 0.1 && distance < well.radius) {
+                  // Well strength decays with distance, scaled by well depth
+                  const wellStrength =
+                    well.depth *
+                    well.strength *
+                    Math.exp(-distance / well.radius);
+                  totalWellVx += (dx / distance) * wellStrength * 0.3;
+                  totalWellVy += (dy / distance) * wellStrength * 0.3;
+
+                  if (distance < minDistance) minDistance = distance;
+                }
+              });
+
+              // For Legacy: Add competing well forces that can cause reversals
+              if (!isFiwi && wells.length > 1) {
+                // Multiple competing wells create conflicting forces
+                const competingFactor = 0.4; // Strong competing forces
+                totalWellVx += (Math.random() - 0.5) * competingFactor;
+                totalWellVy += (Math.random() - 0.5) * competingFactor;
+              }
+
+              vx = vx + totalWellVx;
+              vy = vy + totalWellVy;
+
+              // Accumulate in vector field
+              const gridX = Math.floor(ppdu.x / VECTOR_GRID_SIZE);
+              const gridY = Math.floor(
+                getGridY(ppdu.yIndex) / VECTOR_GRID_SIZE,
+              );
+              if (
+                gridX >= 0 &&
+                gridX < vectorCols &&
+                gridY >= 0 &&
+                gridY < vectorRows
+              ) {
+                vectorField[gridX][gridY].vx += vx;
+                vectorField[gridX][gridY].vy += vy;
+                vectorField[gridX][gridY].count++;
+              }
+            });
+
+            // Add "Ghost Force" from wells to empty cells (especially for Fi-Wi)
+            // This shows the potential field even when particles are stable at optimum
+            for (let i = 0; i < vectorCols; i++) {
+              for (let j = 0; j < vectorRows; j++) {
+                const cell = vectorField[i][j];
+                if (cell.count === 0 && isFiwi) {
+                  // For Fi-Wi: Show potential field pulling toward wells even in empty cells
+                  wells.forEach((well) => {
+                    const cellX = i * VECTOR_GRID_SIZE + VECTOR_GRID_SIZE / 2;
+                    const cellY = j * VECTOR_GRID_SIZE + VECTOR_GRID_SIZE / 2;
+                    const dx = well.x - cellX;
+                    const dy = well.y - cellY;
+                    const dist = Math.sqrt(dx * dx + dy * dy);
+                    if (dist < well.radius && dist > 5) {
+                      // Fake velocity pointing to well (ghost force)
+                      const ghostStrength =
+                        well.depth *
+                        well.strength *
+                        Math.exp(-dist / well.radius) *
+                        0.3;
+                      cell.vx = (dx / dist) * ghostStrength;
+                      cell.vy = (dy / dist) * ghostStrength;
+                      cell.count = 0.5; // Mark as ghost (less than 1 so it's weaker)
+                    }
+                  });
+                }
+              }
+            }
+
+            // Smooth the vector field using exponential moving average (slows down changes)
+            // For Legacy: Much less smoothing to preserve chaotic reversals and turbulence
+            // For Fi-Wi: More smoothing for coherent flow
+            const smoothingFactor = isFiwi ? 0.15 : 0.35; // Legacy: 35% new (preserves chaos), Fi-Wi: 15% new (smoother)
+
+            for (let i = 0; i < vectorCols; i++) {
+              for (let j = 0; j < vectorRows; j++) {
+                const current = vectorField[i][j];
+                const smoothed = window.smoothedVectorField[particleKey][i][j];
+
+                if (current.count > 0) {
+                  const avgVx = current.vx / current.count;
+                  const avgVy = current.vy / current.count;
+
+                  // Calculate variance/conflict for Legacy (high variance = particles fighting each other)
+                  if (!isFiwi && current.count > 1) {
+                    // Estimate conflict: if average magnitude is small but count is high, there's conflict
+                    // OR if there are many particles with high variance (turbulence)
+                    const avgMagnitude = Math.sqrt(
+                      avgVx * avgVx + avgVy * avgVy,
+                    );
+                    // For autonomous APs with high turbulence, conflict can exist even with higher magnitude
+                    // Lower threshold and make it more sensitive to particle count
+                    const conflictRatio =
+                      avgMagnitude < 0.3 && current.count > 1
+                        ? Math.min(
+                            1.0,
+                            (current.count / 4) * (1 - avgMagnitude / 0.3),
+                          ) // More sensitive: conflict increases with count and decreases with magnitude
+                        : 0;
+                    smoothed.conflictRatio =
+                      smoothed.conflictRatio * 0.6 + conflictRatio * 0.4; // Faster response to conflict (less smoothing)
+                  } else {
+                    smoothed.conflictRatio = smoothed.conflictRatio * 0.9; // Decay conflict for Fi-Wi
+                  }
+
+                  // For Legacy: Detect 180-degree reversals (opposite directions)
+                  if (!isFiwi && smoothed.vx !== 0 && smoothed.vy !== 0) {
+                    const oldAngle = Math.atan2(smoothed.vy, smoothed.vx);
+                    const newAngle = Math.atan2(avgVy, avgVx);
+                    const angleDiff = Math.abs(newAngle - oldAngle);
+                    const normalizedDiff = Math.min(
+                      angleDiff,
+                      2 * Math.PI - angleDiff,
+                    ); // Handle wrap-around
+
+                    // If angle change > 120 degrees, it's a reversal (lower threshold for Legacy to catch more)
+                    if (normalizedDiff > (120 * Math.PI) / 180) {
+                      // For reversals, use much higher smoothing factor to show the change quickly
+                      const reversalSmoothing = 0.6; // 60% new value (very fast change for reversals)
+                      smoothed.vx =
+                        smoothed.vx * (1 - reversalSmoothing) +
+                        avgVx * reversalSmoothing;
+                      smoothed.vy =
+                        smoothed.vy * (1 - reversalSmoothing) +
+                        avgVy * reversalSmoothing;
+                      smoothed.reversal = true; // Mark as reversal
+                    } else {
+                      // Normal smoothing
+                      smoothed.vx =
+                        smoothed.vx * (1 - smoothingFactor) +
+                        avgVx * smoothingFactor;
+                      smoothed.vy =
+                        smoothed.vy * (1 - smoothingFactor) +
+                        avgVy * smoothingFactor;
+                      smoothed.reversal = false;
+                    }
+                  } else {
+                    // Normal exponential moving average
+                    smoothed.vx =
+                      smoothed.vx * (1 - smoothingFactor) +
+                      avgVx * smoothingFactor;
+                    smoothed.vy =
+                      smoothed.vy * (1 - smoothingFactor) +
+                      avgVy * smoothingFactor;
+                    smoothed.reversal = false;
+                  }
+                  smoothed.count = current.count;
+                } else {
+                  // Decay when no particles in this cell
+                  smoothed.vx *= 0.95;
+                  smoothed.vy *= 0.95;
+                  smoothed.reversal = false;
+                }
+              }
+            }
+
+            // Draw potential well visualization with contour lines and gradients
+            // Draw each well with its own gradient
+            wells.forEach((well) => {
+              const wellGradient = ctx.createRadialGradient(
+                well.x,
+                well.y,
+                0,
+                well.x,
+                well.y,
+                well.radius,
+              );
+              if (isFiwi) {
+                // Deep, steep well for Fi-Wi
+                wellGradient.addColorStop(
+                  0,
+                  `rgba(23, 79, 138, ${0.2 * well.depth})`,
+                ); // Deeper = darker
+                wellGradient.addColorStop(
+                  0.2,
+                  `rgba(23, 79, 138, ${0.12 * well.depth})`,
+                );
+                wellGradient.addColorStop(
+                  0.4,
+                  `rgba(23, 79, 138, ${0.06 * well.depth})`,
+                );
+                wellGradient.addColorStop(
+                  0.7,
+                  `rgba(23, 79, 138, ${0.02 * well.depth})`,
+                );
+                wellGradient.addColorStop(1, "rgba(23, 79, 138, 0)");
+              } else {
+                // Shallow wells for Legacy
+                wellGradient.addColorStop(
+                  0,
+                  `rgba(217, 83, 79, ${0.1 * well.depth})`,
+                ); // Shallower = lighter
+                wellGradient.addColorStop(
+                  0.4,
+                  `rgba(217, 83, 79, ${0.05 * well.depth})`,
+                );
+                wellGradient.addColorStop(
+                  0.7,
+                  `rgba(217, 83, 79, ${0.02 * well.depth})`,
+                );
+                wellGradient.addColorStop(1, "rgba(217, 83, 79, 0)");
+              }
+              ctx.fillStyle = wellGradient;
+              ctx.fillRect(0, 0, w, h);
+            });
+
+            // Draw contour lines (like topographic maps) to show well depth
+            // Animate contour lines with a subtle pulse
+            const time = Date.now() / 1000;
+            const pulsePhase = Math.sin(time * 0.5) * 0.1 + 0.9; // Subtle pulsing
+
+            wells.forEach((well) => {
+              ctx.strokeStyle = isFiwi
+                ? `rgba(23, 79, 138, ${0.3 * well.depth * pulsePhase})`
+                : `rgba(217, 83, 79, ${0.2 * well.depth * pulsePhase})`;
+              ctx.lineWidth = isFiwi ? 1.5 : 1;
+
+              // Draw 3-4 contour rings per well (topographic map style)
+              const numContours = isFiwi ? 4 : 3; // More contours for deeper wells
+              for (let c = 1; c <= numContours; c++) {
+                // For Fi-Wi: Rings get closer together near center (steep gradient)
+                // For Legacy: Rings are evenly spaced (shallow gradient)
+                const contourRadius = isFiwi
+                  ? well.radius * (c / numContours) * 0.6 // Closer together (steeper)
+                  : well.radius * (c / numContours); // Evenly spaced (shallow)
+                ctx.beginPath();
+                ctx.arc(well.x, well.y, contourRadius, 0, Math.PI * 2);
+                ctx.stroke();
+              }
+            });
+
+            // Draw vector field arrows with variable thickness based on magnitude (steeper = thicker)
+            // For Legacy: Highlight reversals with different color/thickness
+
+            // Check if showing single device - if so, only show arrows near the device
+            const showSingleDevice = document.getElementById("showSingleDevice")
+              ? document.getElementById("showSingleDevice").checked
+              : false;
+            let deviceX = null,
+              deviceY = null;
+            if (showSingleDevice && ppdus.length > 0) {
+              const singlePpdu = ppdus[0];
+              deviceX = singlePpdu.x;
+              deviceY = getGridY(singlePpdu.yIndex);
+            }
+
+            for (let i = 0; i < vectorCols; i++) {
+              for (let j = 0; j < vectorRows; j++) {
+                const cell = window.smoothedVectorField[particleKey][i][j]; // Use smoothed values
+                const cx = i * VECTOR_GRID_SIZE + VECTOR_GRID_SIZE / 2;
+                const cy = j * VECTOR_GRID_SIZE + VECTOR_GRID_SIZE / 2;
+
+                // If showing single device, only draw arrows near the device (within 2 grid cells)
+                // and only if there's actual data (not just decayed values)
+                if (showSingleDevice && deviceX !== null && deviceY !== null) {
+                  const distX = Math.abs(cx - deviceX);
+                  const distY = Math.abs(cy - deviceY);
+                  if (
+                    distX > VECTOR_GRID_SIZE * 2.5 ||
+                    distY > VECTOR_GRID_SIZE * 2.5
+                  ) {
+                    continue; // Skip arrows far from the device
+                  }
+                  // When showing single device, only show arrows with actual data (count > 0)
+                  // Skip decayed/ghost arrows
+                  if (cell.count === 0) {
+                    continue;
+                  }
+                }
+
+                if (
+                  cell.count > 0 ||
+                  Math.abs(cell.vx) > 0.001 ||
+                  Math.abs(cell.vy) > 0.001
+                ) {
+                  const avgVx = cell.vx;
+                  const avgVy = cell.vy;
+                  const magnitude = Math.sqrt(avgVx * avgVx + avgVy * avgVy);
+                  const cx = i * VECTOR_GRID_SIZE + VECTOR_GRID_SIZE / 2;
+                  const cy = j * VECTOR_GRID_SIZE + VECTOR_GRID_SIZE / 2;
+
+                  // CRITICAL: For Legacy, visualize CONFLICT instead of average
+                  // When particles are fighting each other (high variance, low average), show diverging arrows
+                  // Lower threshold to show more turbulence
+                  // Also show conflict more often for autonomous APs to make turbulence obvious
+                  const showConflict =
+                    !isFiwi &&
+                    (((cell.conflictRatio || 0) > 0.2 && magnitude < 0.4) || // Detected conflict
+                      (cell.count > 3 &&
+                        magnitude < 0.3 &&
+                        Math.random() < 0.3)); // Fallback: show turbulence randomly when many particles
+
+                  if (showConflict) {
+                    // High conflict, low average = particles fighting each other
+                    // Draw DIVERGING arrows to show the conflict
+                    const baseAngle =
+                      magnitude > 0.01
+                        ? Math.atan2(avgVy, avgVx)
+                        : Math.random() * Math.PI * 2;
+                    ctx.strokeStyle = "rgba(220, 53, 69, 0.9)"; // Bright red for conflict
+                    ctx.lineWidth = 2.5;
+
+                    // VISUAL FIX: Add jitter to the angles so they vibrate (not static crosses)
+                    const time = Date.now() / 100; // Time-based jitter
+                    const jitter1 =
+                      Math.sin(time + cx * 0.1) * 0.3 +
+                      (Math.random() - 0.5) * 0.3;
+                    const jitter2 =
+                      Math.sin(time + cy * 0.1) * 0.3 +
+                      (Math.random() - 0.5) * 0.3;
+                    const jitter3 =
+                      Math.sin(time + (cx + cy) * 0.1) * 0.3 +
+                      (Math.random() - 0.5) * 0.3;
+
+                    // Arrow 1: Trying to go forward (Vibrating - not static)
+                    drawArrow(ctx, cx, cy, baseAngle + 0.6 + jitter1, 18);
+
+                    // Arrow 2: Getting kicked backward (Vibrating - not static)
+                    drawArrow(
+                      ctx,
+                      cx,
+                      cy,
+                      baseAngle + Math.PI - 0.6 + jitter2,
+                      15,
+                    );
+
+                    // Optional: Draw a third arrow at another angle to show more chaos (also vibrating)
+                    if ((cell.conflictRatio || 0) > 0.5 || cell.count > 4) {
+                      drawArrow(
+                        ctx,
+                        cx,
+                        cy,
+                        baseAngle + Math.PI / 2 + jitter3,
+                        12,
+                      );
+                    }
+
+                    continue; // Skip standard arrow drawing
+                  }
+
+                  if (magnitude < 0.01) continue;
+
+                  const angle = Math.atan2(avgVy, avgVx);
+
+                  // Arrow length scales with magnitude, but Fi-Wi shows steeper gradients (longer arrows near wells)
+                  const baseArrowLength = magnitude * 40;
+                  const maxArrowLength = isFiwi ? 35 : 25; // Fi-Wi can have longer arrows (steeper)
+                  let arrowLength = Math.min(maxArrowLength, baseArrowLength);
+
+                  // For Legacy: Add random variation to arrow length (turbulent flow - arrows of different sizes)
+                  if (!isFiwi) {
+                    arrowLength *= 0.6 + Math.random() * 0.8; // 60-140% of calculated length (more variation)
+                  }
+
+                  // For Legacy: Highlight reversals with thicker, brighter arrows
+                  const isReversal = !isFiwi && cell.reversal;
+                  const lineWidth = isFiwi
+                    ? 1.5 + magnitude * 2.5 // Fi-Wi: 1.5-4px
+                    : isReversal
+                      ? 2.5 + magnitude * 2.0 // Legacy reversals: 2.5-4.5px (thicker)
+                      : 1.5 + magnitude * 1.5; // Legacy normal: 1.5-3px
+
+                  // Color: Legacy reversals are brighter/more visible
+                  if (isReversal) {
+                    ctx.strokeStyle = "rgba(255, 0, 0, 0.9)"; // Bright red for reversals
+                    ctx.fillStyle = "rgba(255, 0, 0, 0.9)";
+                  } else {
+                    ctx.strokeStyle = isFiwi
+                      ? "rgba(23, 79, 138, 0.8)"
+                      : "rgba(217, 83, 79, 0.7)";
+                    ctx.fillStyle = isFiwi
+                      ? "rgba(23, 79, 138, 0.8)"
+                      : "rgba(217, 83, 79, 0.7)";
+                  }
+
+                  // Draw arrow
+                  ctx.save();
+                  ctx.lineWidth = lineWidth;
+                  ctx.translate(cx, cy);
+                  ctx.rotate(angle);
+                  ctx.beginPath();
+                  ctx.moveTo(0, 0);
+                  ctx.lineTo(arrowLength, 0);
+                  // Arrowhead (larger for steeper gradients or reversals)
+                  const headSize = isFiwi ? 7 : isReversal ? 8 : 6; // Larger head for reversals
+                  ctx.moveTo(arrowLength, 0);
+                  ctx.lineTo(arrowLength - headSize, -headSize * 0.6);
+                  ctx.moveTo(arrowLength, 0);
+                  ctx.lineTo(arrowLength - headSize, headSize * 0.6);
+                  ctx.stroke();
+                  ctx.restore();
+                }
+              }
+            }
+
+            // Mark well centers with animated pulsing indicators
+            wells.forEach((well) => {
+              const pulseSize = isFiwi
+                ? 6 + Math.sin(time * 2) * 2 // Stronger pulse for deep wells
+                : 4 + Math.sin(time * 1.5) * 1; // Gentler pulse for shallow wells
+              const pulseAlpha = 0.6 + Math.sin(time * 2) * 0.2;
+
+              ctx.fillStyle = isFiwi
+                ? `rgba(23, 79, 138, ${pulseAlpha})`
+                : `rgba(217, 83, 79, ${pulseAlpha})`;
+              ctx.beginPath();
+              ctx.arc(well.x, well.y, pulseSize, 0, Math.PI * 2);
+              ctx.fill();
+
+              // Draw depth indicator (number of rings = well depth)
+              ctx.strokeStyle = isFiwi
+                ? `rgba(23, 79, 138, ${0.4 * well.depth})`
+                : `rgba(217, 83, 79, ${0.3 * well.depth})`;
+              ctx.lineWidth = well.depth;
+              for (let r = 0; r < Math.floor(well.depth * 2); r++) {
+                ctx.beginPath();
+                ctx.arc(well.x, well.y, pulseSize + 3 + r * 3, 0, Math.PI * 2);
+                ctx.stroke();
+              }
+            });
+
+            // Draw actual PPDUs from the simulation to show real flow with errors
+            // Sample a subset of PPDUs to avoid clutter (show every Nth PPDU)
+            const sampleRate = Math.max(1, Math.floor(ppdus.length / 15)); // Show ~15 PPDUs
+            ppdus.forEach((ppdu, index) => {
+              if (index % sampleRate !== 0) return; // Sample every Nth PPDU
+
+              const ppduX = ppdu.x;
+              const ppduY = getGridY(ppdu.yIndex);
+
+              // Draw trail showing flow path (only for sampled PPDUs)
+              if (!ppdu.flowTrail) ppdu.flowTrail = [];
+              ppdu.flowTrail.push({
+                x: ppduX,
+                y: ppduY,
+                success: ppdu.successState,
+              });
+              if (ppdu.flowTrail.length > 15) ppdu.flowTrail.shift();
+
+              // Draw trail with color based on success/failure
+              if (ppdu.flowTrail.length > 1) {
+                ctx.beginPath();
+                ppdu.flowTrail.forEach((pt, i) => {
+                  const alpha = (i / ppdu.flowTrail.length) * 0.5;
+                  ctx.globalAlpha = alpha;
+                  ctx.strokeStyle = pt.success
+                    ? isFiwi
+                      ? "rgba(40, 167, 69, 0.6)"
+                      : "rgba(40, 167, 69, 0.5)" // Green for success
+                    : isFiwi
+                      ? "rgba(220, 53, 69, 0.6)"
+                      : "rgba(220, 53, 69, 0.5)"; // Red for failure
+                  ctx.lineWidth = isFiwi ? 2 : 1.5;
+                  if (i === 0) {
+                    ctx.moveTo(pt.x, pt.y);
+                  } else {
+                    ctx.lineTo(pt.x, pt.y);
+                  }
+                });
+                ctx.stroke();
+                ctx.globalAlpha = 1.0;
+              }
+
+              // Draw PPDU with color based on success/failure
+              const successColor = ppdu.successState ? "#28a745" : "#dc3545"; // Green for success, red for failure
+              ctx.fillStyle = successColor;
+              ctx.beginPath();
+              ctx.arc(ppduX, ppduY, isFiwi ? 3.5 : 3, 0, Math.PI * 2);
+              ctx.fill();
+
+              // Add a small border to show domain
+              ctx.strokeStyle = ppdu.color;
+              ctx.lineWidth = 1;
+              ctx.globalAlpha = 0.4;
+              ctx.beginPath();
+              ctx.arc(ppduX, ppduY, isFiwi ? 4 : 3.5, 0, Math.PI * 2);
+              ctx.stroke();
+              ctx.globalAlpha = 1.0;
+            });
+          }
+
+          // Flow field section is always visible now
+
+          // Robust initialization - try multiple approaches
+          function startSimulation() {
+            console.log("Attempting to start simulation...");
+            const canvasAuto = document.getElementById("canvasAuto");
+            const canvasFiwi = document.getElementById("canvasFiwi");
+
+            if (!canvasAuto || !canvasFiwi) {
+              console.error("Canvases not found, retrying...");
+              setTimeout(startSimulation, 100);
+              return;
+            }
+
+            const rectAuto = canvasAuto.getBoundingClientRect();
+            const rectFiwi = canvasFiwi.getBoundingClientRect();
+
+            if (rectAuto.width === 0 || rectFiwi.width === 0) {
+              console.warn("Canvas dimensions zero, retrying...", {
+                rectAuto,
+                rectFiwi,
+              });
+              setTimeout(startSimulation, 100);
+              return;
+            }
+
+            console.log("Canvases found, initializing...", {
+              auto: { width: rectAuto.width, height: rectAuto.height },
+              fiwi: { width: rectFiwi.width, height: rectFiwi.height },
+            });
+
+            try {
+              init();
+              console.log("Initialization complete");
+            } catch (e) {
+              console.error("Init error:", e);
+              setTimeout(startSimulation, 200);
+            }
+          }
+
+          // Try to start immediately if DOM is ready
+          if (document.readyState === "loading") {
+            document.addEventListener("DOMContentLoaded", () => {
+              setTimeout(startSimulation, 50);
+            });
+          } else {
+            // DOM already loaded
+            setTimeout(startSimulation, 50);
+          }
+
+          // Also listen for window load as backup
+          window.addEventListener("load", () => {
+            setTimeout(startSimulation, 100);
+          });
+        })();
+      </script>
+    </div>
+
+    <h3 id="section-15.2">
+      15.2 What Gets Learned: The Transition Rate Matrix
+    </h3>
+
+    <p>
+      Machine learning in Fi-Wi optimizes the transition rate matrix
+      <strong>W</strong> based on telemetry that is only observable in a
+      centralized architecture. For each potential transition from state
+      <em>i</em> (MCS<sub>i</sub>, SS<sub>i</sub>) to state
+      <em>j</em> (MCS<sub>j</sub>, SS<sub>j</sub>), the learned rate depends on:
+    </p>
+
+    <div class="callout">
+      <strong>Per-Transition Learning Inputs:</strong>
+      <ul style="margin: 10px 0">
+        <li>
+          <strong>CSI Matrix:</strong> Channel state from all RRHs (spatial
+          correlation, interference)
+        </li>
+
+        <li>
+          <strong>PER History:</strong> Packet error rate trends for recent
+          transmissions at this MCS/SS
+        </li>
+
+        <li>
+          <strong>Queue Sojourn:</strong> Current queueing delay (L4S feedback
+          signal)
+        </li>
+
+        <li>
+          <strong>Interference Level:</strong> Measured by RRH telemetry and
+          external sensors
+        </li>
+
+        <li>
+          <strong>Client Density:</strong> Number of active stations competing
+          for airtime
+        </li>
+
+        <li>
+          <strong>Temporal Patterns:</strong> Time-of-day usage variations (rush
+          hour vs overnight)
+        </li>
+
+        <li>
+          <strong>Site Characteristics:</strong> Building-specific RF properties
+          (materials, geometry)
+        </li>
+      </ul>
+    </div>
+
+    <p>The learned transition rate function takes the form:</p>
+
+    <div class="diagram-block">
+      <div class="diagram">
+        W[i→j] = f(CSI, PER, queue_depth, interference, density, time,
+        site_params)
+      </div>
+    </div>
+
+    <p>
+      This learned function answers:
+      <em
+        >"Given the current state and observed conditions, what is the optimal
+        next MCS/SS configuration to meet the L4S latency target while
+        maximizing achievable throughput?"</em
+      >
+    </p>
+
+    <div class="callout">
+      <strong>Slow Learning, Fast Execution:</strong>
+      <p>
+        The ML engine operates on the control plane timescale with adaptive
+        update rates: milliseconds for sudden events (interference spike
+        detection requiring rapid response), seconds for typical rate adaptation
+        (matching the timescales demonstrated by minstrel/minstrel_ht
+        schedulers), and minutes for long-term pattern learning (daily traffic
+        patterns, where slower updates are sufficient). This decouples the
+        computational cost of learning from the latency constraints of packet
+        transmission. The scheduler does not run neural network inference per
+        packet—it uses a pre-computed policy matrix updated at rates appropriate
+        to the dynamics being observed.
+      </p>
+    </div>
+
+    <h3 id="section-15.3">15.3 Physics-Informed Learning</h3>
+
+    <p>
+      Fi-Wi uses <strong>physics-informed machine learning</strong> that
+      combines Shannon capacity theory with learned corrections. This hybrid
+      approach provides explainability, sample efficiency, and principled
+      generalization.
+    </p>
+
+    <p>The transition rate decomposes into two components:</p>
+
+    <div class="diagram-block">
+      <div class="diagram">
+        W[i→j] = W<sub>physics</sub>(SNR, BW) · W<sub>learned</sub>(site, time,
+        load) ↑ ↑ Shannon-theoretic Site-specific baseline corrections
+      </div>
+    </div>
+
+    <p>
+      <strong>W<sub>physics</sub>:</strong> The physics baseline uses Shannon
+      capacity to establish theoretical bounds. For each MCS index, the required
+      SNR is known from 802.11 specifications (e.g., MCS 11 requires ~30 dB).
+      The base transition rate is the probability that current SNR exceeds the
+      threshold given measured CSI.
+    </p>
+
+    <p>
+      <strong>W<sub>learned</sub>:</strong> The learned correction factor
+      captures deviations from ideal conditions on a per-station basis, as
+      different spatial stream capabilities and local RF environments require
+      station-specific adaptation:
+    </p>
+
+    <ul>
+      <li>
+        <strong>Building attenuation:</strong> Concrete walls vs drywall,
+        multipath reflections
+      </li>
+
+      <li>
+        <strong>Interference patterns:</strong> Neighboring networks, non-Wi-Fi
+        devices (Bluetooth, microwave)
+      </li>
+
+      <li>
+        <strong>Temporal dynamics:</strong> Rush hour density vs overnight,
+        seasonal changes
+      </li>
+
+      <li>
+        <strong>Client behavior:</strong> Application mix (VoIP vs bulk),
+        protocol compliance variations
+      </li>
+    </ul>
+
+    <p>
+      This approach uses <strong>residual learning</strong>: the physics model
+      W<sub>physics</sub> provides the coarse steering (the "prior"), while the
+      ML model learns the residual error Δ specific to the site. This guarantees
+      the system never performs worse than a standard physics-based model, even
+      before site-specific training converges. The ML correction is additive (or
+      multiplicative) to a known-good baseline.
+    </p>
+
+    <p>This decomposition provides three advantages:</p>
+
+    <ol>
+      <li>
+        <strong>Explainability:</strong> When W<sub>learned</sub> deviates
+        significantly from 1.0, the system can flag anomalies and explain why
+        performance differs from theory.
+      </li>
+
+      <li>
+        <strong>Sample Efficiency:</strong> The physics prior means the ML model
+        only needs to learn <em>corrections</em> rather than the full mapping
+        from scratch.
+      </li>
+
+      <li>
+        <strong>Generalization:</strong> The base model W<sub>physics</sub> is
+        universal. Site-specific W<sub>learned</sub> factors can be initialized
+        from similar deployments and fine-tuned with site-specific data.
+      </li>
+    </ol>
+
+    <h3 id="section-15.4">15.4 Training Data from Centralized Observability</h3>
+
+    <p>
+      The Concentrator's complete state visibility provides labeled training
+      examples that are impossible to obtain in distributed AP systems. Each
+      scheduling decision creates a training tuple:
+    </p>
+
+    <div class="diagram-block">
+      <strong>Training Example Structure:</strong>
+      <div class="diagram">
+        State<sub>t</sub>: • MCS = 9, SS = 2 (current PHY configuration) • Queue
+        depth = 50 packets • Sojourn time = 800 µs • CSI = [λ₁=0.92, λ₂=0.58,
+        κ=8.2 dB] (from RRH-A) • PER<sub>recent</sub> = 0.02 (last 100 packets)
+        • Client density = 12 stations • Interference = -75 dBm Action: •
+        Transition to MCS = 7, SS = 2 Outcome<sub>t+1</sub>: • PER = 0.01
+        (improved) • Throughput = 380 Mbps • Latency = 450 µs (met L4S target) •
+        Queue drain rate = increased Label: ✓ GOOD TRANSITION
+      </div>
+    </div>
+
+    <p>
+      Over time, the Concentrator accumulates thousands of these labeled
+      examples across varying conditions. The ML model learns patterns such as:
+    </p>
+
+    <ul>
+      <li>
+        <em
+          >"When queue sojourn exceeds 700 µs AND client density &gt; 10,
+          stepping down from MCS 9→7 reduces latency by 40%"</em
+        >
+      </li>
+
+      <li>
+        <em
+          >"In specific rooms at certain times, interference spikes require
+          proactive MCS reduction to maintain PER &lt; 5%"</em
+        >
+      </li>
+
+      <li>
+        <em
+          >"4-stream Mu-MIMO (SS=4) only succeeds when κ &lt; 10 dB; above this
+          threshold, SS=2 is optimal"</em
+        >
+      </li>
+    </ul>
+
+    <p>
+      This supervised learning is
+      <strong>only possible with centralized observability</strong>. As detailed
+      in Appendix H, autonomous APs lack:
+    </p>
+
+    <ul>
+      <li>
+        <strong>Global CSI:</strong> Each AP sees only its own channel, not
+        cross-RRH interference
+      </li>
+
+      <li>
+        <strong>Cross-AP operational state:</strong> Cannot observe other APs'
+        band selection, power levels, or scheduling decisions, leading to
+        conflicting local optimizations
+      </li>
+
+      <li>
+        <strong>Queue visibility:</strong> Queue depth and sojourn time are
+        hidden in firmware
+      </li>
+
+      <li>
+        <strong>Deterministic replay:</strong> Cannot reproduce exact conditions
+        to verify decisions
+      </li>
+
+      <li>
+        <strong>Causal observability:</strong> High inference distance between
+        PHY decisions and transport-layer effects
+      </li>
+    </ul>
+
+    <p>
+      It's worth noting that supervised learning doesn't require perfect ground
+      truth labels to be effective—even relative quality assessments ("better"
+      vs "worse") can drive learning. However, Fi-Wi's complete observability
+      provides significantly richer training signals: precise measurements of
+      queue impact, throughput changes, and latency effects that enable more
+      efficient learning compared to the partial observability available to
+      autonomous systems.
+    </p>
+
+    <h3 id="section-15.5">15.5 Transfer Learning Across Sites</h3>
+
+    <p>
+      Fi-Wi's ML strategy uses transfer learning to balance generalization
+      across sites with site-specific optimization:
+    </p>
+
+    <p><strong>Base Model (Cross-Site Training):</strong></p>
+
+    <p>
+      A foundational model is trained across multiple deployment sites to learn
+      universal patterns:
+    </p>
+
+    <div class="diagram-block">
+      <div class="diagram">
+        W<sub>base</sub>[i→j] = f<sub>universal</sub>(CSI, PER, queue_depth,
+        density) Learns: General relationships between SNR, MCS, PER, and
+        density
+      </div>
+    </div>
+
+    <p><strong>Site-Specific Adaptation:</strong></p>
+
+    <p>
+      When deployed to a new site, the base model is augmented with learned
+      corrections:
+    </p>
+
+    <div class="diagram-block">
+      <div class="diagram">
+        W<sub>site</sub>[i→j] = W<sub>base</sub>[i→j] + Δ<sub>building</sub> +
+        Δ<sub>temporal</sub>
+        Δ<sub>building</sub>: Building-specific RF corrections • Material
+        attenuation (concrete vs drywall) • Room geometry (open-plan vs
+        cubicles) • Persistent interference sources Δ<sub>temporal</sub>:
+        Time-varying patterns • Rush hour density • Weekend vs weekday usage •
+        Seasonal variations
+      </div>
+    </div>
+
+    <p><strong>Continuous Adaptation:</strong></p>
+
+    <p>
+      The system continues to adapt using online learning with safety
+      constraints:
+    </p>
+
+    <ul>
+      <li>
+        <strong>Conservative exploration:</strong> Only explore new MCS
+        transitions when queue depth is low
+      </li>
+
+      <li>
+        <strong>Immediate rollback:</strong> If a transition causes PER spike or
+        latency increase, revert and mark as high-risk
+      </li>
+
+      <li>
+        <strong>Periodic retraining:</strong> Update W<sub>learned</sub> based
+        on accumulated telemetry
+      </li>
+
+      <li>
+        <strong>Anomaly detection:</strong> Alert when learned corrections
+        deviate significantly from baseline
+      </li>
+    </ul>
+
+    <h3 id="section-15.6">15.6 The Learning Feedback Loop</h3>
+
+    <p>
+      Fi-Wi's ML capability creates a feedback loop that improves system
+      performance over time:
+    </p>
+
+    <div class="diagram-block">
+      <div class="diagram">
+        1. Centralized Observability → Complete visibility of state, actions,
+        outcomes 2. Supervised Learning → Labeled examples: (State, Action) →
+        Outcome quality 3. Improved Transition Rates → W<sub>learned</sub>
+        optimizes MCS selection per-site 4. Better User Experience → Higher
+        throughput, lower latency, fewer errors 5. More Training Data → New
+        conditions explored → model improves [Cycle repeats continuously]
+      </div>
+    </div>
+
+    <p>
+      This loop is <strong>unique to centralized architectures</strong>.
+      Autonomous APs cannot generate ground truth labels without queue
+      observability. Coordinated AP systems (where APs share summaries via a
+      controller) see effects (latency, ECN) but not causes (queue growth, retry
+      timing, aggregation depth) due to high inference distance.
+    </p>
+
+    <p>
+      Fi-Wi's centralized state graph provides the
+      <strong>causal observability</strong> that machine learning requires. The
+      probability current framework gives this learning a rigorous mathematical
+      foundation: we are learning the transition rate matrix of a physical
+      system governed by conservation laws.
+    </p>
+
+    <div class="callout">
+      <strong>Summary: Centralization Enables Learning</strong>
+      <p>
+        Machine learning requires complete, structured training examples where
+        actions, states, and outcomes are observable under consistent
+        measurement. Fi-Wi's centralized architecture provides this by design:
+        all state transitions occur under a single clock, all queue dynamics are
+        visible, and all RF outcomes are measurable. This makes the MCS
+        probability current learnable—something that is architecturally
+        impossible in distributed, autonomous systems.
+      </p>
+    </div>
+
+    <h3 id="section-15.7">
+      15.7 The Multi-RRH Advantage: Learning the Spatial Network
+    </h3>
+
+    <p>
+      The presence of multiple concurrent Radio Heads (RRHs) serves as the
+      primary multiplier for the Fi-Wi machine learning capability. It
+      transforms the learning problem from optimizing a single isolated link
+      into optimizing a <strong>spatially coupled network</strong>. While a
+      traditional AP optimizes a local objective function (its own throughput),
+      the Fi-Wi Concentrator utilizes concurrent RRHs to construct a global view
+      of the RF environment.
+    </p>
+
+    <p>
+      This multi-RRH architecture impacts the learning model in three critical
+      ways:
+    </p>
+
+    <h4>1. Global RF State Visibility ("The Super-Eye")</h4>
+
+    <p>
+      In traditional systems, an AP is blind to the interference seen by its
+      neighbors. In Fi-Wi, the Concentrator aggregates real-time telemetry from
+      <strong>all RRHs simultaneously</strong>.
+    </p>
+
+    <p>This creates a <strong>Global RF State Matrix</strong> composed of:</p>
+
+    <ul>
+      <li>
+        <strong>Cross-RRH RSSI/SNR:</strong> Seeing how a client's signal is
+        received across multiple rooms.
+      </li>
+
+      <li>
+        <strong>Beamforming Feedback:</strong> Aggregating the standard
+        802.11ax/be compressed V-matrices (CSI) from sounding frames.
+      </li>
+
+      <li>
+        <strong>Interference Maps:</strong> Correlating noise floor spikes
+        across the floorplan.
+      </li>
+    </ul>
+
+    <p>
+      <em
+        >This state matrix is sparse, time-aliased, and derived from
+        standards-compliant telemetry rather than continuous per-packet baseband
+        capture.</em
+      >
+    </p>
+
+    <p>
+      The model learns not just that "Client A has a weak signal," but
+      specifically that "Client A is weak on RRH 1, strong on RRH 2, and creates
+      -80 dBm interference on RRH 3." This global observability enables the
+      prediction of building-wide interference patterns invisible to single-cell
+      learners.
+    </p>
+
+    <h4>2. Expanded Action Space (Selection & Redundancy)</h4>
+
+    <p>
+      Because Fi-Wi treats multiple RRHs as an active redundant set, the ML
+      engine has a broader action space than a standard rate-control algorithm.
+      It learns not only <em>how</em> to transmit (MCS and scheduling decisions)
+      but <em>which RRHs are eligible transmitters</em> for a given packet.
+    </p>
+
+    <ul>
+      <li>
+        <strong>Downlink Path Selection:</strong> The model predicts which RRH
+        is most likely to deliver a packet successfully to a specific client
+        location at a specific time of day, minimizing retries.
+      </li>
+
+      <li>
+        <strong>Uplink Diversity Optimization:</strong> The system learns which
+        set of RRHs provides the best <strong>Selection Diversity</strong> for a
+        given client location, optimizing the receive path to maximize SNR.
+      </li>
+    </ul>
+
+    <h4>3. Phase 2 Capability: Eigenstructure & Rank Expansion</h4>
+
+    <p>
+      <em
+        >Note: This capability requires the hardware-synchronized FPGA
+        architecture (Phase 2).</em
+      >
+    </p>
+
+    <p>
+      With sub-nanosecond synchronization, the ML engine will be able to resolve
+      the true <strong>distributed Eigenstructure</strong> of the
+      environment—the "shape" of available RF paths across distributed radios.
+      This allows for <strong>Rank Expansion</strong>, where the system resolves
+      more spatial streams (Eigenvectors) than a single physical AP could
+      support, scaling capacity approximately with the number of RRHs, subject
+      to channel rank and geometry.
+    </p>
+
+    <h3 id="section-15.8">
+      15.8 Operational Calibration: Zero-Occupancy Sounding
+    </h3>
+
+    <p>
+      To ensure the physics-informed model converges accurately, Fi-Wi employs a
+      specific operational strategy: <strong>Zero-Occupancy Sounding</strong>.
+    </p>
+
+    <p>
+      As described in Section 15.5, the site-specific transfer function is
+      composed of static building characteristics (H<sub>static</sub>) and
+      dynamic temporal variations (Δ<sub>temporal</sub>). To disentangle these
+      variables, the system schedules automated channel sounding during hours of
+      minimum occupancy.
+    </p>
+
+    <ul>
+      <li>
+        <strong>Schedule:</strong> Configurable based on vertical (e.g.,
+        03:00–04:00 for hospitality, weekends for office).
+      </li>
+
+      <li>
+        <strong>Frequency:</strong> Typically runs nightly, or triggers
+        automatically if the system detects significant performance drift (e.g.,
+        after a renovation).
+      </li>
+    </ul>
+
+    <div class="callout">
+      <strong>The "Tare" Operation:</strong>
+      <p>
+        <em
+          >In metrology, "tare" refers to zeroing a scale by removing known
+          weights to isolate what you want to measure. Similarly, Fi-Wi "tares"
+          the RF environment by measuring when human activity (the known
+          variable) is absent.</em
+        >
+      </p>
+
+      <p>
+        H<sub>measured</sub>(empty) ≈ H<sub>static</sub> + Δ<sub>building</sub>
+      </p>
+
+      <p>
+        By sounding when the building is empty, the system effectively removes
+        the noise of human movement and dynamic scatterers. This allows the
+        Concentrator to:
+      </p>
+
+      <ol>
+        <li>
+          <strong>Isolate H<sub>static</sub>:</strong> Establish a high-fidelity
+          ground truth of the static RF environment (walls, glass, steel).
+        </li>
+
+        <li>
+          <strong>Calibrate the Physics Prior:</strong> Fine-tune the Shannon
+          capacity baseline (C<sub>Shannon</sub>) against the specific physical
+          constraints of the deployment.
+        </li>
+      </ol>
+    </div>
+
+    <p>
+      This establishes a stable baseline "Zero State" for the learning model,
+      ensuring that subsequent online learning is optimizing for dynamic changes
+      rather than relearning the static environment.
+      <em
+        >This separation dramatically improves offline RL dataset conditioning
+        by preventing the model from relearning static structure while adapting
+        to temporal dynamics.</em
+      >
+    </p>
+
+    <h3 id="section-15.9">15.9 Bounded Model Validation During Idle Periods</h3>
+
+    <p>
+      While the primary learning mode is offline (using historical data), the
+      centralized Concentrator architecture enables a hybrid approach:
+      <strong
+        >opportunistic, bounded model validation during predicted idle
+        periods</strong
+      >.
+    </p>
+
+    <h4>Idle Period Detection</h4>
+
+    <p>
+      Because the Concentrator has global visibility of queue states across all
+      RRHs in an Airtime Domain, it can predict when the RF channel will be
+      underutilized—a capability fundamentally unavailable to autonomous APs
+      that see only their local queues.
+    </p>
+
+    <ul>
+      <li>
+        <strong>Queue Monitoring:</strong> When all per-client queues fall below
+        threshold and no new flows are active
+      </li>
+
+      <li>
+        <strong>Traffic Pattern Learning:</strong> Historical data reveals
+        predictable idle periods (e.g., 2-4 AM, lunch hours in office buildings)
+      </li>
+
+      <li>
+        <strong>Duration Estimation:</strong> The system predicts how long the
+        idle period will last based on queue derivative trends
+      </li>
+    </ul>
+
+    <h4>Safe Validation Protocol</h4>
+
+    <p>
+      During high-confidence idle predictions, the system can perform controlled
+      validation and calibration—not arbitrary exploration:
+    </p>
+
+    <ul>
+      <li>
+        <strong>Model Validation:</strong> Test the next-best admissible action
+        within the model's confidence bounds to validate prediction accuracy
+      </li>
+
+      <li>
+        <strong>MCS Probing:</strong> Send synthetic frames at alternative MCS
+        values to clients with stale channel estimates
+      </li>
+
+      <li>
+        <strong>Calibration Updates:</strong> Perform extended channel sounding
+        to refresh H<sub>static</sub> estimates
+      </li>
+    </ul>
+
+    <p>
+      These activities refine the offline model without introducing risk to
+      production traffic.
+    </p>
+
+    <h4>Production Traffic Protection</h4>
+
+    <p>
+      Validation is strictly bounded to prevent interference with real traffic:
+    </p>
+
+    <ul>
+      <li>
+        <strong>Immediate Abort:</strong> If real packets arrive during
+        validation, the system immediately reverts to production mode
+      </li>
+
+      <li>
+        <strong>Time Budget:</strong> Validation limited to maximum 5% of
+        channel time per domain
+      </li>
+
+      <li>
+        <strong>Confidence Gating:</strong> Only validate when &gt;90% confident
+        of idle duration (based on queue derivative trends)
+      </li>
+
+      <li>
+        <strong>Domain Isolation:</strong> Validation in one Airtime Domain does
+        not affect others
+      </li>
+
+      <li>
+        <strong>Flow Exclusion:</strong> No validation is performed on flows
+        marked latency-critical or L4S-sensitive
+      </li>
+    </ul>
+
+    <p>
+      This hybrid approach provides the safety of offline learning with the
+      adaptability of continuous refinement, exploiting natural traffic lulls
+      that autonomous APs cannot collectively identify.
+    </p>
+
+    <h3 id="section-15.10">
+      15.10 Architectural Comparison: Why Autonomous APs Cannot Learn
+    </h3>
+
+    <p>
+      Machine learning for MCS optimization is fundamentally enabled by Fi-Wi's
+      centralized architecture and impossible in distributed AP systems:
+    </p>
+
+    <table class="comparison">
+      <thead>
+        <tr>
+          <th>Requirement for ML</th>
+          <th>Autonomous AP</th>
+          <th>Fi-Wi Concentrator</th>
+        </tr>
+      </thead>
+
+      <tbody>
+        <tr>
+          <td><strong>Global CSI visibility</strong></td>
+          <td>
+            ❌ Each AP sees only local channel; no cross-AP interference data
+          </td>
+          <td>
+            ✅ Concentrator receives CSI from all RRHs; computes spatial
+            correlation matrix
+          </td>
+        </tr>
+
+        <tr>
+          <td><strong>Cross-AP coordination state</strong></td>
+          <td>
+            ❌ Cannot observe other APs' band selection, power levels, or
+            scheduling decisions
+          </td>
+          <td>
+            ✅ Centralized scheduler has complete visibility of all RRH
+            configurations and decisions
+          </td>
+        </tr>
+
+        <tr>
+          <td><strong>Queue observability</strong></td>
+          <td>❌ Queue depth hidden in firmware; sojourn time not exposed</td>
+          <td>✅ Centralized queuing with microsecond-resolution timestamps</td>
+        </tr>
+
+        <tr>
+          <td><strong>Deterministic replay</strong></td>
+          <td>
+            ❌ Cannot reproduce exact RF conditions; firmware decisions opaque
+          </td>
+          <td>
+            ✅ Complete event log enables replay of scheduling decisions and
+            outcomes
+          </td>
+        </tr>
+
+        <tr>
+          <td><strong>Inference distance</strong></td>
+          <td>❌ High (5-10 steps from cause to transport-layer effect)</td>
+          <td>
+            ✅ Low (1-2 steps; queue → schedule → TX outcome directly linked)
+          </td>
+        </tr>
+      </tbody>
+    </table>
+
+    <p>
+      This observability gap is not a vendor implementation issue—it is an
+      <strong>architectural limitation</strong>. Autonomous APs cannot generate
+      high-quality training labels without queue observability.
+    </p>
+
+    <h2 id="section-16">
+      16. Concentrator Fast Path: DPDK, DMA, and Queue Determinism
+    </h2>
+
+    <p>
+      The preceding sections established the <em>architecture</em> of the Fi-Wi
+      concentrator: centralized packet memory (Section 4.4), group queues as the
+      sole AQM bottleneck (Section 4.3), microsecond timestamps written into the
+      Fi-Wi shim header (Section 4.2), and ML-driven MCS selection running
+      continuously against that centralized data (Section 15). This section
+      explains how the concentrator executes that pipeline with the determinism
+      the architecture requires — maintaining a single observable bottleneck per
+      airtime domain, applying ECN marks at the right moment, and keeping the
+      RRH free of scheduling logic.
+    </p>
+
+    <h3 id="section-16.1">16.1 Why a Kernel-Bypass Data Plane</h3>
+
+    <p>
+      The Fi-Wi concentrator's latency and determinism targets strongly favor a
+      kernel-bypass data plane. A conventional interrupt-driven kernel path
+      would reintroduce jitter at exactly the point where the architecture is
+      trying to remove it.
+    </p>
+
+    <p>
+      L4S requires ECN marks to be applied at the group queue on the same time
+      scale as a single 802.11 TXOP. The Linux kernel's
+      <code>softirq</code>-based packet path introduces interrupt coalescing and
+      scheduler contention that accumulates across bursts. More fundamentally:
+      every packet that transits the kernel stack competes with arbitrary OS
+      activity for CPU time. The queue depth is not directly visible to
+      userspace without a syscall; the marking decision cannot be co-located
+      with the queue measurement in the same cache line.
+    </p>
+
+    <p>
+      Fi-Wi's concentrator data plane therefore runs via
+      <strong>DPDK</strong> (Data Plane Development Kit): tight busy-poll loops
+      on dedicated cores, with no interrupt-driven jitter. All packet operations
+      — receive, classify, AQM mark, forward — execute in a cache-resident loop
+      that preserves the single-bottleneck, fully-observable queue structure
+      that the rest of the architecture depends on.
+    </p>
+
+    <h3 id="section-16.2">16.2 The Memory Model: IOMMU, VFIO, and Hugepages</h3>
+
+    <p>
+      DPDK allocates all packet buffers (<em>mbufs</em>) from hugepages,
+      eliminating TLB misses during packet processing. Each airtime domain's
+      group queue is a logically contiguous region within this space. The pool
+      is allocated once at startup; no per-packet memory allocation occurs on
+      the fast path.
+    </p>
+
+    <p>
+      Each SFP+ NIC is bound to the <code>vfio-pci</code> driver. The system
+      IOMMU enforces DMA isolation: a card can only reach the memory regions
+      explicitly registered with it at startup. This gives the concentrator two
+      properties simultaneously:
+    </p>
+
+    <ul>
+      <li>
+        <strong>Safety:</strong> A malfunctioning RRH cannot issue PCIe DMA
+        transactions to arbitrary concentrator memory. The IOMMU enforces the
+        boundary in hardware.
+      </li>
+
+      <li>
+        <strong>Performance:</strong> IOMMU mappings are established once when
+        the mbuf pool is created. From that point, <code>rx_burst</code> and
+        <code>tx_burst</code> are zero-copy: the NIC DMA engine writes received
+        frames directly into pre-registered mbuf space and reads transmit frames
+        from the same space, with no per-packet kernel involvement.
+      </li>
+    </ul>
+
+    <div class="diagram-block">
+      <div class="diagram">
+        Startup (once): rte_pktmbuf_pool_create() └─ VFIO registers hugepages
+        with IOMMU └─ NIC DMA engine can now reach mbuf pool directly Per-burst
+        (dedicated lcore, busy-poll): rte_eth_rx_burst(rrh_port, queue, pkts[],
+        N) ← NIC DMA → mbuf, no interrupt └─ classify_airtime_domain(pkt) ←
+        (port, queue_id) → group queue index └─ aqm_mark_l4s(pkt, queue_depth) ←
+        ECN CE if sojourn > threshold └─ rte_eth_tx_burst(out_port, ...) ← mbuf
+        → NIC DMA, zero copy
+      </div>
+      <div class="diagram-caption">
+        Figure 16-1: Concentrator polling loop. No interrupts, no kernel
+        crossings, no per-packet allocation after startup. Queue depth and
+        sojourn time are visible in the same execution context as the ECN
+        marking decision.
+      </div>
+    </div>
+
+    <h3 id="section-16.3">16.3 Airtime Domains as Hardware Queue Partitions</h3>
+
+    <p>
+      DPDK exposes each NIC's hardware receive queues independently. Fi-Wi uses
+      this to achieve a direct, lockless mapping from PCIe port and queue index
+      to airtime domain — the same logical grouping described in Section 6. Each
+      lcore owns a fixed set of (port, queue) pairs. Because ownership is
+      exclusive, there are no locks on the fast path and no shared state between
+      lcores during steady-state forwarding.
+    </p>
+
+    <table class="comparison">
+      <tr>
+        <th>Fast-Path Property</th>
+        <th>Kernel Stack</th>
+        <th>Fi-Wi DPDK Pipeline</th>
+      </tr>
+      <tr class="section-row">
+        <td colspan="3"><strong>Receive and Queue Observability</strong></td>
+      </tr>
+      <tr>
+        <td>Interrupt model</td>
+        <td>
+          Hardware IRQ → <code>softirq</code> → NAPI poll; coalescing adds
+          jitter
+        </td>
+        <td>
+          No interrupts. Dedicated lcore polls hardware queue register directly.
+        </td>
+      </tr>
+      <tr>
+        <td>Queue depth visibility</td>
+        <td>Visible inside kernel only; userspace access requires syscall</td>
+        <td>
+          Directly readable by AQM loop in same CPU cache line as packet pointer
+        </td>
+      </tr>
+      <tr>
+        <td>Buffer allocation</td>
+        <td>Per-packet <code>skb</code> allocation from kernel slab</td>
+        <td>Pre-allocated mbuf pool; zero allocation on fast path</td>
+      </tr>
+      <tr class="section-row">
+        <td colspan="3"><strong>AQM and Forwarding</strong></td>
+      </tr>
+      <tr>
+        <td>ECN marking timing</td>
+        <td>Marked in kernel <code>qdisc</code>; subject to scheduling lag</td>
+        <td>Marked in polling loop body; co-located with queue measurement</td>
+      </tr>
+      <tr>
+        <td>Forwarding lookup</td>
+        <td>Routing table + netfilter traversal</td>
+        <td>(port, queue_id) → group queue index; O(1), cache-hot</td>
+      </tr>
+      <tr>
+        <td>Packet copy</td>
+        <td>Typically 1–2 copies through socket buffer chain</td>
+        <td>Zero copies; mbuf pointer passed through the pipeline</td>
+      </tr>
+      <tr class="section-row">
+        <td colspan="3"><strong>Transmit</strong></td>
+      </tr>
+      <tr>
+        <td>IOMMU interaction</td>
+        <td>Kernel maps and unmaps DMA regions per packet</td>
+        <td>
+          IOMMU mapping established once at pool creation; static thereafter
+        </td>
+      </tr>
+    </table>
+
+    <h3 id="section-16.4">16.4 The L4S Marking Loop</h3>
+
+    <p>
+      The AQM marking step is deliberately minimal. The DPDK data plane does not
+      run a full queue scheduler — that is the outer control loop's
+      responsibility (Section 5). The inner loop does one thing: read sojourn
+      time from the shim header (Section 4.2) and set the ECN CE codepoint if
+      the threshold is exceeded.
+    </p>
+
+    <pre><code>// Per-packet in the rx → tx burst loop:
+uint64_t sojourn_ns = now_tsc() - pkt->t_ingress;
+if (sojourn_ns > THRESHOLD_NS) {
+    rte_ipv4_l4s_mark(pkt);                       // in-place, no copy
+    fiwi_meta(pkt)->ecn_flags |= ECN_CE_APPLIED;
+}
+rte_eth_tx_burst(out_port, queue_id, &pkt, 1);
+</code></pre>
+
+    <p>
+      Because <code>t_ingress</code> is written by the same lcore at enqueue, no
+      cross-core communication is needed to compute sojourn time at dequeue. The
+      marking decision is local to the polling thread. This is what Section 4.3
+      means when it says AQM runs "exactly where the integrator lives": the
+      integrator is the group queue, the group queue is an mbuf ring in hugepage
+      memory, and the marking loop touches that ring on every poll cadence with
+      no additional indirection.
+    </p>
+
+    <h3 id="section-16.5">16.5 Fault Isolation via IOMMU Groups</h3>
+
+    <p>
+      In a multi-card concentrator, each SFP+ card appears in its own IOMMU
+      group, which means each card can be bound to VFIO independently and the
+      IOMMU enforces that one card's DMA cannot reach another card's memory
+      regions. In a deployment with multiple SFP+ cards, the IOMMU topology
+      provides natural fault isolation at the card boundary: a PCIe error or
+      runaway DMA event from one RRH is contained within its card's group and
+      cannot corrupt the packet memory of an adjacent airtime domain. This is a
+      hardware guarantee, not a software policy.
+    </p>
+
+    <h3 id="section-16.6">16.6 What DPDK Does and Does Not Solve</h3>
+
+    <p>
+      The kernel-bypass data plane is not a complexity cost — it is the
+      mechanism that <em>justifies</em> the RRH's simplicity. Because the
+      concentrator runs a deterministic, observable pipeline that applies AQM,
+      tracks sojourn time, and manages all descriptor posting without OS
+      intervention, the RRH never needs to make a queuing or scheduling
+      decision. It remains a pure DMA client, exactly as the silicon cost
+      argument in Section 4.4 requires.
+    </p>
+
+    <p>
+      Incumbent distributed APs have no equivalent. Because each AP operates
+      autonomously, it must run its own Linux network stack, its own
+      <code>qdisc</code>, and its own firmware scheduler. The CPU carrying that
+      stack is the dominant gate cost per RRH (Section 4.4, silicon cost table).
+      A centralized DPDK pipeline eliminates that requirement across every RRH
+      simultaneously — not by optimizing the AP implementation, but by removing
+      the architectural condition that forces the CPU to exist there in the
+      first place.
+    </p>
+
+    <p>
+      That said, DPDK solves a specific problem: it gives the concentrator a
+      deterministic, observable, zero-copy execution path in which queue state,
+      ECN marking, and packet steering remain under unified software control. It
+      does not solve the radio-side interface. Per-packet MCS selection, EDCA
+      parameter control, and TX-outcome metadata from the Wi-Fi silicon remain
+      the next required interface boundary — the point at which concentrator
+      intelligence must reach into the RRH to close the control loop. DPDK is
+      the precondition; radio-side per-packet programmability is what completes
+      it.
+    </p>
+
+<h3 id="section-16.7">16.7 DualPI2 Baseline: Control Law and Queue Structure</h3>
+
+<p>
+  Section 16.4 described the minimal ECN marking step — reading queue state and
+  applying a CE mark in the fast path. That sketch is sufficient to illustrate
+  <em>where</em> marking occurs, but it elides the control structure that makes
+  L4S coexistence with legacy traffic work: the
+  <strong>dual-queue coupled AQM</strong> defined in RFC&nbsp;9332.
+</p>
+
+<p>
+  This section defines the <strong>baseline DualPI2 control law</strong> as it
+  would be realized inside the DPDK polling loop. Fi-Wi preserves this
+  dual-queue topology, coupling mechanism, and PI-based control structure, but
+  Section&nbsp;17 replaces the underlying congestion signal with
+  <strong>Airtime Debt (D<sub>i</sub>)</strong>, grounding the controller in
+  predicted wireless service time rather than raw queue occupancy.
+</p>
+
+<h4 id="section-16.7.1">16.7.1 The Two Queues</h4>
+
+<p>
+  Each airtime domain maintains two logically independent mbuf rings in the
+  concentrator's hugepage pool: an <strong>L4S queue</strong> for scalable
+  congestion-control flows (senders marking with ECT(1)), and a
+  <strong>Classic queue</strong> for legacy RFC&nbsp;3168 flows and unmarked
+  traffic. Classification happens at ingress on the fast path, before the
+  packet is enqueued, and costs a single bitfield check on the IP ECN field:
+</p>
+
+<pre><code>// Ingress classification — per-packet, inline in the rx burst loop
+uint8_t ecn = (pkt_ip->type_of_service &amp; 0x03);
+bool is_l4s = (ecn == 0x01 || ecn == 0x03);   // ECT(1) or CE — scalable sender
+
+fiwi_meta(pkt)->queue_class = is_l4s ? QUEUE_L4S : QUEUE_CLASSIC;
+enqueue_to_domain(pkt, domain_id, fiwi_meta(pkt)->queue_class);
+</code></pre>
+
+<p>
+  Both queues drain toward the same transmit burst for that airtime domain.
+  The scheduler services the L4S queue with a strict low-latency budget and the
+  Classic queue at a rate that saturates the domain's aggregate share, matching
+  the DualPI2 service model from RFC&nbsp;9332.
+</p>
+
+<h4 id="section-16.7.2">16.7.2 The Coupling Mechanism</h4>
+
+<p>
+  The key property of DualPI2 is that the two queues are not independent.
+  The Classic queue's drop probability <code>p<sub>c</sub></code> — computed by
+  a PI controller from a congestion signal representing pressure at the shared
+  bottleneck — also governs the L4S queue's ECN marking probability via a
+  coupling factor <code>k</code> (default 2 in the Linux
+  <code>sch_dualpi2</code> reference implementation).
+</p>
+
+<pre><code>// Outer control loop — runs on a slow timer cadence (~16 ms), same lcore,
+// non-preemptive. Not per-packet.
+double signal_classic = ewma_update(&amp;domain->classic_signal,
+                                    ring_depth(QUEUE_CLASSIC));
+double p_c = max(0.0, K_PI * (signal_classic - TARGET_CLASSIC));  // PI controller
+
+double p_l = COUPLING_K * p_c;   // Coupled L4S marking probability
+
+// Applied per-packet in the L4S dequeue path:
+double p_l_step = (sojourn_L4S_ns > THRESHOLD_L4S_NS) ? 1.0 : p_l;
+if (rte_rand_u64() &lt; (uint64_t)(p_l_step * (double)UINT64_MAX))
+    rte_ipv4_l4s_mark(pkt);      // Set ECN CE in-place, no copy
+</code></pre>
+
+<p>
+  In a conventional queue-based implementation, <code>signal_classic</code>
+  would be an EWMA of Classic queue depth. In Fi-Wi, that queue-derived signal
+  is replaced as the PI controller input by
+  <strong>Airtime Debt (D<sub>i</sub>)</strong>, a forward estimate of wireless
+  service time. The <strong>DualPI2 control law, coupling mechanism, and
+  dual-queue topology remain unchanged</strong>; only the input signal changes.
+</p>
+
+<p>
+  Queue depth is a lagging indicator in Wi-Fi because contention, retries, and
+  variable PHY rates consume airtime without necessarily appearing in buffer
+  occupancy. Airtime Debt provides a forward-looking signal that better matches
+  the true wireless bottleneck while preserving the DualPI2 coexistence
+  structure required for L4S and Classic traffic to share the medium.
+</p>
+
+<h4 id="section-16.7.3">16.7.3 Per-Domain State and the fiwi_update Interface</h4>
+
+<p>
+  Each airtime domain carries its own DualPI2 state alongside the
+  <code>fiwi_rrh_state</code> struct (Section&nbsp;17.5). Because each lcore
+  owns a fixed set of domains exclusively (Section&nbsp;16.8), this state is
+  never shared across cores — no locks, no atomics, no cache-line bouncing on
+  the fast path.
+</p>
+
+<p>
+  The telemetry path (Section&nbsp;17.8) delivers ground-truth airtime
+  measurements back to the lcore via a lockless ring carrying
+  <code>fiwi_update</code> objects. The struct is defined here because it
+  originates in the DPDK fast-path layer and is consumed by it;
+  Section&nbsp;17.8 populates it from Netlink/vendor telemetry events:
+</p>
+
+<pre><code>/**
+ * fiwi_update — telemetry record posted by the Netlink callback,
+ * consumed by the DPDK lcore during its scheduling loop.
+ * Allocated from fiwi_update_pool (rte_mempool); returned after use.
+ */
+struct fiwi_update {
+    uint8_t  type;          /* AIRTIME_RECONCILE (only type currently defined) */
+    uint32_t rrh_id;        /* RRH index, validated &lt; FIWI_MAX_RRHS before enqueue */
+    uint64_t actual_us;     /* Hardware-path-to-status interval (ground truth) */
+    uint64_t expected_us;   /* Forward estimate: T_phy + T_agg at enqueue time */
+    uint32_t retry_us;      /* Observed retry airtime from telemetry metadata */
+};
+</code></pre>
+
+<div class="diagram-block">
+  <div class="diagram">
+Per-domain fast-path structure (allocated in hugepages, lcore-local):
+
+  domain[d]
+  ├── l4s_ring        mbuf ring, N_L4S slots      (RING_F_SP_ENQ | RING_F_SC_DEQ)
+  ├── classic_ring    mbuf ring, N_CLASSIC slots   (RING_F_SP_ENQ | RING_F_SC_DEQ)
+  ├── classic_signal  EWMA accumulator for controller input
+  ├── pi_integral     PI controller integral term
+  ├── p_c             current Classic drop probability
+  ├── p_l             coupled L4S mark probability (= COUPLING_K * p_c)
+  └── port_queue_map  (PCIe port, hw queue_id) → this domain
+
+  rrh_update_rings[d]   per-RRH lockless ring (RING_F_MP_HTS_ENQ | RING_F_SC_DEQ)
+  fiwi_update_pool      shared rte_mempool; safe to get() from non-EAL threads
+
+Slow-path timer (~16 ms, same lcore, non-preemptive):
+  ewma_update → pi_update → refresh p_c, p_l
+
+Fast-path (every poll cadence):
+  rx_burst → classify ECN → enqueue l4s / classic
+  dequeue l4s  (strict sojourn threshold) → mark CE → tx_burst
+  dequeue classic (weighted, drop at p_c) → tx_burst
+  drain rrh_update_rings → apply fiwi_apply_updates()
+  </div>
+  <div class="diagram-caption">
+    Figure 16-2: Per-domain DualPI2 state layout. All per-domain state is
+    lcore-local and single-writer. The update ring uses
+    <code>RING_F_MP_HTS_ENQ</code> because the Netlink callback runs on a
+    non-EAL thread; the lcore-side dequeue uses
+    <code>RING_F_SC_DEQ</code> (single consumer).
+  </div>
+</div>
+
+<h3 id="section-16.8">16.8 Multi-RRH lcore Topology and Control Ownership</h3>
+
+<p>
+  The Umber concentrator runs on a workstation-class host with a Threadripper PRO
+  processor and multiple PCIe-connected RRHs. This section describes how DPDK
+  lcore assignments map onto that hardware topology to preserve cache locality,
+  single-writer semantics, and deterministic fast-path execution.
+</p>
+
+<p>
+  Each lcore owns both the DualPI2 control state (Section 16.7) and the Airtime
+  Debt estimator (Section 17) for its assigned RRHs. This ensures that congestion
+  estimation, scheduling, and ECN marking operate within a single execution context.
+</p>
+
+<h4 id="section-16.8.1">16.8.1 RRH Assignment</h4>
+
+<table class="comparison">
+  <thead>
+    <tr>
+      <th>RRH Range</th>
+      <th>Assigned lcore</th>
+      <th>Airtime Domains</th>
+    </tr>
+  </thead>
+  <tbody>
+    <tr><td>0–3</td><td>lcore 2</td><td>domains 0–3</td></tr>
+    <tr><td>4–7</td><td>lcore 4</td><td>domains 4–7</td></tr>
+    <tr><td>8–11</td><td>lcore 6</td><td>domains 8–11</td></tr>
+    <tr><td>12–15</td><td>lcore 8</td><td>domains 12–15</td></tr>
+    <tr><td>16–19</td><td>lcore 10</td><td>domains 16–19</td></tr>
+    <tr><td>20–23</td><td>lcore 12</td><td>domains 20–23</td></tr>
+  </tbody>
+</table>
+
+<h4 id="section-16.8.2">16.8.2 Control and Data Flow</h4>
+
+<p>
+  Each RRH lcore applies its per-domain DualPI2 loop as described in Section 16.7,
+  with <strong>Airtime Debt (D<sub>i</sub>)</strong> serving as the PI controller
+  input in place of queue depth. This presents a single, airtime-grounded congestion
+  signal per domain to the L4S control loop.
+</p>
+
+<p>
+  Downlink traffic is classified at ingress and directed to the appropriate
+  airtime domain. The owning lcore performs scheduling, ECN marking, and transmission.
+  Uplink traffic follows the reverse path toward the WAN interface.
+</p>
+
+<p>
+  Because each lcore exclusively owns its RRHs and associated Airtime Debt state,
+  congestion estimation, scheduling, and ECN marking operate without cross-core
+  coordination. This preserves deterministic fast-path behavior.
+</p>
+
+<div class="diagram-block">
+  <div class="diagram">
+Ingress → classify → assign domain → lcore owns RRH
+        → compute D_i → schedule → transmit
+        → measure → update C_i/R_i → recompute D_i
+  </div>
+  <div class="diagram-caption">
+    Figure 16-3: lcore ownership of RRHs and control loop execution.
+  </div>
+</div>
+
+    <div class="section" id="section-17">
+      <h2>17. Airtime-Assisted ECN: Airtime Debt as the Congestion Signal</h2>
+
+      <p>
+        Fi-Wi does not infer congestion from queue depth alone. The bottleneck
+        is the wireless medium, and the relevant state variable is the time
+        required to successfully transmit packets over that medium. The system
+        replaces the queue sojourn-time inputs of traditional PI<sup>2</sup>
+        controllers with <strong>Airtime Debt (D<sub>i</sub>)</strong>,
+        converting a stochastic medium into a controlled service process.
+      </p>
+
+      <h3 id="section-17.1">17.1 The Bottleneck is Airtime, Not a Queue</h3>
+      <p>
+        In traditional L4S systems, ECN marking is derived from queue sojourn
+        time, which assumes a stationary service rate. These assumptions fail in
+        Wi-Fi because service time varies per client based on PHY rates,
+        contention, and retries. Fi-Wi replaces backward-looking buffer metrics
+        with a <strong>forward model of wireless service time</strong>. The
+        Concentrator maintains this model continuously and makes scheduling
+        decisions on <strong>predicted service outcomes</strong>, not observed
+        queue growth. This approach provides the AQM with a signal that has a
+        more stationary distribution than raw queue depth over a variable-rate
+        medium, improving marking coherence and L4S stability.
+      </p>
+
+      <h3 id="section-17.2">17.2 Airtime Debt Model (Per RRH)</h3>
+      <p>
+        For each RRH (<code>i</code>), the Concentrator maintains a real-time
+        Airtime Debt (<code>D<sub>i</sub></code
+        >):
+      </p>
+      <div
+        style="
+          background: #f5f7fa;
+          padding: 15px;
+          margin: 20px 0;
+          border-left: 4px solid #174f8a;
+          font-family: monospace;
+          text-align: center;
+        "
+      >
+        D<sub>i</sub> = A<sub>i</sub> + C<sub>i</sub> + R<sub>i</sub>
+      </div>
+      <ul>
+        <li>
+          <strong>A<sub>i</sub> (Backlog Airtime):</strong> Predicted transmit
+          duration for staged and in-flight packets.
+        </li>
+        <li>
+          <strong>C<sub>i</sub> (Contention Penalty):</strong> Predicted medium
+          access delay (µs), informed by driver telemetry and controlled via
+          EDCA scheduling (Section 4.1.4).
+        </li>
+        <li>
+          <strong>R<sub>i</sub> (Retry Penalty):</strong> Predicted
+          retransmission overhead (µs) derived from recent PER history (Section
+          15.4).
+        </li>
+      </ul>
+
+      <h3 id="section-17.3">
+        17.3 Measuring Ground Truth (Hardware-Path-to-Status)
+      </h3>
+      <p>
+        The "Ground Truth" for airtime consumption is measured as the interval
+        from
+        <strong>descriptor posting into the hardware transmit path</strong> to
+        <strong>TX Status</strong> (hardware completion signal via
+        driver/vendor-specific telemetry events such as <code>mt76</code> TX
+        status reports). This interval captures the full service duration,
+        including the full wait for TXOP eligibility (AIFS + backoff),
+        aggregation delay, and all hardware-level retransmission attempts.
+      </p>
+
+      <h3 id="section-17.4">17.4 Predicted Sojourn Time (S<sub>i</sub>)</h3>
+      <p>
+        For any packet, the
+        <strong>Predicted Sojourn Time (S<sub>i</sub>)</strong> is a forward
+        estimate of delivery time:
+      </p>
+      <div
+        style="
+          background: #f5f7fa;
+          padding: 15px;
+          margin: 20px 0;
+          border-left: 4px solid #174f8a;
+          font-family: monospace;
+          text-align: center;
+        "
+      >
+        S<sub>i</sub>(packet) = D<sub>i</sub> + T<sub>service</sub>(packet)
+      </div>
+      <p>
+        The <code>T<sub>service</sub></code> calculation is decomposed into:
+        <code>T<sub>agg</sub></code> (aggregation hold time) +
+        <code>T<sub>phy</sub></code> (modulation time at current MCS) +
+        <code>T<sub>retry</sub></code> (statistical retry overhead). This
+        estimate is packet- and client-specific; it is not a constant service
+        quantum.
+      </p>
+
+      <h3 id="section-17.5">17.5 Implementation: DPDK Fast Path State</h3>
+      <p>
+        The Concentrator tracks RRH state in hugepage-backed memory. The DPDK
+        lcore is the sole writer of <code>fiwi_rrh_state</code>; telemetry
+        updates are applied via per-RRH lockless ring buffers to preserve
+        single-writer semantics and microsecond-level determinism.
+      </p>
+      <div class="diagram-block">
+        <pre>
+struct __rte_cache_aligned fiwi_rrh_state {
+    uint32_t rrh_id;
+    uint64_t D_i;            /* Total airtime debt (A+C+R) */
+    
+    /* Component Estimates (microseconds) */
+    uint64_t A_i;            /* Total scheduled airtime (queued + in-flight) */
+    uint32_t C_i;            /* Estimated contention delay */
+    uint32_t R_i;            /* Estimated retry penalty */
+
+    /* Feedback & Synchronization */
+    uint64_t last_update_us;     /* Timestamp of last lcore application */
+    uint64_t last_tx_status_us;  /* TSC of last hardware completion */
+    uint32_t moving_avg_per;     /* Recent PER (Section 15.4) */
+};
+    </pre
+        >
+      </div>
+      <p>
+        <code>D<sub>i</sub></code> is recomputed in the DPDK fast path after
+        each update to <code>A<sub>i</sub></code
+        >, <code>C<sub>i</sub></code
+        >, or <code>R<sub>i</sub></code
+        >. The loop updates <code>A<sub>i</sub></code> when packets are assigned
+        to an RRH and decrements it upon TX completion using telemetry feedback.
+      </p>
+
+      <h3 id="section-17.6">17.6 Authoritative Congestion Signaling</h3>
+      <p>
+        Airtime Debt replaces physical queue depth as the authoritative input
+        for the Dual-Queue AQM, providing a single, authoritative congestion
+        signal across all RRHs without relying on a shared physical buffer.
+      </p>
+      <ul>
+        <li>
+          <strong>L4S Marking:</strong> Applied if
+          <code>S<sub>i</sub> &gt; T<sub>low</sub></code
+          >. This bypasses traditional sojourn measurements to signal congestion
+          at the true wireless bottleneck.
+        </li>
+        <li>
+          <strong>Classic Drop:</strong> Airtime Debt (<code>D<sub>i</sub></code
+          >) replaces queue depth as the input to the PI controller defined in
+          Section 16.7. This preserves the Dual-Queue AQM structure while
+          grounding the control signal in predicted wireless service time rather
+          than buffer occupancy.
+        </li>
+      </ul>
+
+      <h3 id="section-17.7">17.7 Slow-Path Observability</h3>
+      <p>
+        While <code>D<sub>i</sub></code> provides fast-path control, the system
+        monitors
+        <strong
+          >Airtime Utilization (U<sub>air</sub> = &Delta;TX_DURATION /
+          &Delta;t)</strong
+        >
+        as a slow-path observability metric. This metric is used to identify
+        external interference patterns and long-term capacity shifts in the
+        airtime domain, calibrating the confidence weights applied to the
+        <code>C<sub>i</sub></code> and <code>R<sub>i</sub></code> estimators.
+      </p>
+
+      <h3 id="section-17.8">17.8 Telemetry Feedback: Netlink Calibration</h3>
+      <p>
+        The following logic processes <code>TX_STATUS</code> events from the
+        <code>mt76</code> driver. Completion data is retrieved from a
+        pre-allocated mempool and posted to a per-RRH lockless ring to reconcile
+        state without lcore contention.
+      </p>
+      <div class="diagram-block">
+        <pre>
+/* Telemetry Path (Netlink Callback) */
+static int fiwi_handle_mt76_telemetry(struct nl_msg *msg, void *arg) {
+    struct nlattr *attrs[MT76_ATTR_MAX + 1];
+    nla_parse(attrs, MT76_ATTR_MAX, genlmsg_attrdata(nlmsg_data(nlmsg_hdr(msg)), 0),
+              genlmsg_attrlen(nlmsg_data(nlmsg_hdr(msg)), 0), NULL);
+
+    if (!attrs[MT76_ATTR_TX_DURATION] || !attrs[MT76_ATTR_RRH_ID])
+        return NL_SKIP;
+
+    uint32_t rrh_id = nla_get_u32(attrs[MT76_ATTR_RRH_ID]);
+    if (rrh_id >= FIWI_MAX_RRHS) return NL_SKIP;
+
+    struct fiwi_update *update;
+    if (rte_mempool_get(fiwi_update_pool, (void**)&update) < 0) return NL_SKIP;
+
+    update->type = AIRTIME_RECONCILE;
+    update->rrh_id = rrh_id;
+    update->actual_us = nla_get_u64(attrs[MT76_ATTR_TX_DURATION]);
+    update->retry_us = nla_get_u32(attrs[MT76_ATTR_RETRY_DURATION]);
+    update->expected_us = estimate_service_time(msg); 
+
+    rte_ring_enqueue(rrh_update_rings[rrh_id], update);
+    return NL_PROCEED;
+}
+    </pre
+        >
+      </div>
+
+      <h4 id="section-17.8.1">17.8.1 Telemetry Application (DPDK lcore)</h4>
+      <p>
+        The DPDK lcore closes the control loop by draining the update ring. It
+        decrements the backlog and calibrates penalties to ensure the
+        <strong>Airtime Debt</strong> remains an accurate representation of
+        physical medium pressure.
+      </p>
+
+      <div class="diagram-block">
+        <pre>
+/* DPDK lcore: apply telemetry updates */
+static inline void
+fiwi_apply_updates(struct fiwi_rrh_state *rrh, struct rte_ring *ring)
+{
+    struct fiwi_update *upd;
+    while (rte_ring_dequeue(ring, (void**)&upd) == 0) {
+        /* 1. Discharge processed backlog */
+        rrh->A_i = (rrh->A_i > upd->actual_us) ? (rrh->A_i - upd->actual_us) : 0;
+
+        /* 2. Update contention estimate (drift from expected modulation time) */
+        uint32_t drift = (upd->actual_us > (upd->expected_us + upd->retry_us)) ? 
+                         (upd->actual_us - upd->expected_us - upd->retry_us) : 0;
+        rrh->C_i = (rrh->C_i * 7 + drift) >> 3;
+
+        /* 3. Update retry penalty */
+        rrh->R_i = (rrh->R_i * 7 + upd->retry_us) >> 3;
+
+        /* 4. Recompute total Airtime Debt (D_i) */
+        rrh->D_i = rrh->A_i + rrh->C_i + rrh->R_i;
+
+        rrh->last_tx_status_us = rte_get_tsc_cycles();
+        rte_mempool_put(fiwi_update_pool, upd);
+    }
+}
+    </pre
+        >
+      </div>
+<h3 id="section-17.9">17.9 Visualization: The Airtime Debt Control Loop</h3>
+
+    <div class="diagram-block">
+        <div style="text-align: center; margin: 20px 0;">
+            <img src="./airtime-control-loop.webp" alt="Figure 17-1: Airtime Debt Control Loop showing Forward Service Model and Ground Truth Calibration" style="max-width: 100%; height: auto; border: 1px solid #174f8a;">
+            
+            
+            
+            <p style="font-style: italic; margin-top: 10px;"><strong>Figure 17-1:</strong> The Fi-Wi recursive control loop for stabilizing stochastic wireless service.</p>
+        </div>
+
+        <h4>Diagram Overview: Closing the Feedback Loop</h4>
+        <p>
+            Figure 17-1 synthesizes the technical components of the Airtime Debt model into a continuous functional loop. The architecture separates the <strong>Speculative Forward Path</strong> (Fast Path) from the <strong>Calibrated Feedback Path</strong> (Telemetry Path).
+        </p>
+
+        <div class="callout">
+            <strong>1. Forward Service Model (Prediction):</strong> 
+            Every ingress packet triggers a per-STA calculation of <code>T<sub>service</sub></code>. This is not a global constant; it is a client-specific sum of aggregation hold time (<code>T<sub>agg</sub></code>), PHY modulation time (<code>T<sub>phy</sub></code>), and predicted retry overhead (<code>T<sub>retry</sub></code>) based on that STA's specific RF context.
+        </div>
+
+        <div class="callout">
+            <strong>2. Debt Update & Marking Decision:</strong> 
+            The predicted <code>T<sub>service</sub></code> is added to the RRH's <code>A<sub>i</sub></code> (Backlog). If the resulting Predicted Sojourn Time (<code>S<sub>i</sub></code>) exceeds <code>T<sub>low</sub></code>, an ECN CE mark is applied immediately in the DPDK fast path. This provides the "Virtual Backpressure" that stabilizes L4S senders.
+        </div>
+
+        <div class="callout">
+            <strong>3. Ground Truth Calibration (Correction):</strong> 
+            As the packet is dispatched via DMA, the hardware records the precise interval from descriptor posting into the hardware transmit path to TX Status completion. The Telemetry Path calculates the <strong>Drift</strong>—the delta between the forward prediction and physical reality.
+        </div>
+
+        <div class="callout">
+            <strong>4. Estimator Refinement:</strong> 
+            This drift is fed back into the EWMA filters for <code>C<sub>i</sub></code> (Contention) and <code>R<sub>i</sub></code> (Retries). This ensures that subsequent predictions for the same STA or RRH domain are corrected for changing medium pressure, effectively regularizing the stochastic nature of the 802.11 medium.
+        </div>
+    </div>
+
+
+    <h2 id="section-18">18. Summary</h2>
+
+    <p>
+      The core idea of Umber’s Fi-Wi architecture is to make a building full of
+      Wi-Fi radios behave like a
+      <strong
+        >large number of predictable, low-latency, cellularized
+        bottlenecks</strong
+      >
+      (often cell-per-room) that integrate cleanly with L4S, and to avoid Wi-Fi
+      collapse in the regime that matters most for users:
+      <strong>tail latency</strong>.
+    </p>
+
+    <p>We do that by:</p>
+
+    <ul>
+      <li>Time-synchronizing RRHs and concentrator to microsecond accuracy</li>
+
+      <li>Centralizing packet memory and queues in the concentrator</li>
+
+      <li>Adding a Fi-Wi shim header with timestamps and queue metadata</li>
+
+      <li>
+        Doing all L4S AQM/ECN marking at group queues, where the wireless delay
+        is actually created
+      </li>
+
+      <li>
+        Using shared state to dynamically group RRHs into airtime domains
+        (cells), considering both interference and spatial-stream / eigenvector
+        structure
+      </li>
+
+      <li>
+        Using multiple RRHs per room and per STA as an active redundant RF set,
+        analogous to MLO but building-wide
+      </li>
+
+      <li>
+        Leveraging CSI and learning to predict capacity, interference, and
+        collapse risk
+      </li>
+
+      <li>
+        Enabling, where appropriate, coordinated multi-RRH operation and dynamic
+        point selection (Section 9)
+      </li>
+
+      <li>
+        Keeping RRHs within a strict power/thermal envelope, while using PCIe
+        Gen3 x1 over fiber to connect them as low-latency DMA clients of central
+        packet memory
+      </li>
+    </ul>
+
+    <p>Compared to a building filled with independent APs, Fi-Wi provides:</p>
+
+    <ul>
+      <li>Lower and more predictable latency, especially in the tails</li>
+
+      <li>More stable and effective L4S behavior</li>
+
+      <li>A better aggregation–latency tradeoff</li>
+
+      <li>
+        Centralized, cellularized control with shared state that scales across
+        many RRHs and rooms
+      </li>
+
+      <li>
+        RF-layer redundancy, richer spatial streams, and fast failover with a
+        clean single-bottleneck view for L4S
+      </li>
+
+      <li>
+        A practical path to cell-per-room density with manageable power,
+        thermals, and hardware complexity
+      </li>
+    </ul>
+
+    <hr />
+
+    <h2 id="appendix-a">
+      Appendix A: 802.11 Backoff Timing & Collapse Dynamics
+    </h2>
+
+    <p>
+      This appendix explains the precise behavior of the 802.11 CSMA/CA backoff
+      algorithm, why the freeze/resume mechanics create strong nonlinearities
+      under load, and how this drives the collapse behavior discussed in
+      Sections 2 and 6. We also include reference diagrams, accurate pseudocode,
+      and probability scaling that shows why birthday-paradox collisions appear
+      long before PHY saturation.
+    </p>
+
+    <h3 id="appendix-a.1">A.1 Overview</h3>
+
+    <p>The 802.11 MAC is built around two core mechanisms:</p>
+
+    <ul>
+      <li>
+        <strong>Carrier Sensing (CSMA/CA)</strong> — sense-before-transmit.
+      </li>
+
+      <li>
+        <strong>Random Backoff</strong> — avoid collisions by randomized slot
+        countdown.
+      </li>
+    </ul>
+    These mechanisms interact in a way that works beautifully for light to
+    moderate station counts, but begins to break down sharply once multiple
+    stations become backlogged. Collapse is not a "bug"; it is the
+    mathematically expected outcome under high concurrency.
+    <h3 id="appendix-a.2">A.2 Backoff Decrements Only During Idle SlotTime</h3>
+
+    <p>When a station has a frame to send, it chooses a random integer:</p>
+
+    <pre>B ← Uniform[0, CW]</pre>
+    where <code>CW</code> is the contention window. The counter
+    <strong>decrements only when:</strong>
+    <ul>
+      <li>
+        The PHY senses the channel idle <em>for the entire SlotTime</em> (9 µs
+        typical).
+      </li>
+
+      <li>NAV = 0 (no virtual carrier-sense in effect).</li>
+
+      <li>The station has already observed one full AIFS interval.</li>
+    </ul>
+
+    <p>
+      If any of these conditions break during a SlotTime boundary, backoff does
+      <strong>not</strong> decrement.
+    </p>
+
+    <h4>Diagram A-A — Backoff Countdown with Idle Slots and Freezes</h4>
+
+    <pre class="diagram">
+Time →  ───────────────────────────────────────────────────────────────────────→
+
+Channel:    Busy TXOP      Idle slot     Idle slot     Busy TXOP      Idle ...
+           ────────────┐  ┌─────────┐   ┌─────────┐  ┌───────────┐
+                       │  │ slot OK │   │ slot OK │  │collision  │
+                       └──┘         └───┘         └──────────────┘
+
+Backoff B:   [frozen]        B:=B-1       B:=B-2        [frozen]       B:=B-3
+</pre
+    >
+    <p>
+      This "idle-slot-only" decrement rule is the source of nonlinear timing
+      behavior.
+    </p>
+
+    <h3 id="appendix-a.3">A.3 Freeze Conditions: Physical Busy + NAV Busy</h3>
+
+    <p>
+      The backoff counter <strong>freezes</strong> immediately under either
+      condition:
+    </p>
+
+    <ul>
+      <li>
+        <strong>PHY busy:</strong> Energy detect (ED) threshold exceeded —
+        another station is transmitting.
+      </li>
+
+      <li>
+        <strong>NAV busy:</strong> The local station believes a scheduled TXOP
+        is ongoing (Duration field).
+      </li>
+    </ul>
+
+    <p>
+      NAV counts down in microseconds, not slot units, so a NAV may span dozens
+      or hundreds of SlotTimes, creating long frozen periods.
+    </p>
+
+    <h4>Diagram A-B — NAV Freezes Backoff for Entire Duration</h4>
+
+    <pre class="diagram">
+Frame overheard with Duration=480µs
+     NAV := 480 µs  ─────────────────────────────────────────────▶ 0 µs
+
+Backoff:
+   Frozen until NAV==0
+   Then: AIFS idle interval → first idle SlotTime → resume B countdown
+</pre
+    >
+    <h3 id="appendix-a.4">A.4 Full Backoff State Machine</h3>
+
+    <p>
+      The following pseudocode describes the real 802.11 backoff and retry
+      machine:
+    </p>
+
+    <pre>
+# Variables
+B   = random integer in [0, CW]
+CW  = CWmin initially, doubled on failures
+NAV = virtual carrier sense (µs timer)
+Slot = 9 microseconds (typical)
+AIFS = access category-specific inter-frame space
+
+while True:
+
+    wait_until( medium_idle() and NAV == 0 )
+    wait(AIFS)  # must see idle for entire AIFS
+
+    # Backoff countdown
+    while B &gt; 0:
+
+        if medium_idle() and NAV == 0:
+            wait(Slot)
+            if medium_idle() and NAV == 0:
+                B -= 1      # decrement only if entire slot was idle
+        else:
+            # Freeze B until another idle AIFS appears
+            wait_until( medium_idle() and NAV == 0 )
+            wait(AIFS)
+
+    # Backoff fully expired, attempt TX
+    transmit()
+
+    if ack_received():
+        CW = CWmin
+        B = random(0, CW)
+    else:
+        CW = min(2 * CW, CWmax)
+        B = random(0, CW)
+</pre
+    >
+    <p>
+      The critical detail:
+      <strong
+        >multiple stations freeze and resume their counters in lock-step</strong
+      >
+      after every long TXOP or NAV, making collisions statistically inevitable
+      as station count grows.
+    </p>
+
+    <h3 id="appendix-a.5">
+      A.5 Collision Probability and the Birthday Paradox
+    </h3>
+
+    <p>
+      Each station independently picks a backoff slot in <code>[0, CW]</code>.
+      The probability that no two stations choose the same slot is:
+    </p>
+
+    <pre>
+P(no collision) = (CW+1)! / [(CW+1 - n)! · (CW+1)^n]
+</pre
+    >
+    <p>where <code>n</code> = number of active contenders. Therefore:</p>
+
+    <ul>
+      <li>
+        Even with CW=15 (CWmin for AC_BE), collisions explode once n &gt; 6–8.
+      </li>
+
+      <li>Increasing CW reduces collisions but explodes delay variance.</li>
+
+      <li>
+        Freeze/resume sync makes large groups of stations “compete” on the same
+        few idle slots.
+      </li>
+    </ul>
+
+    <h4>Diagram A-C — Collision Probability vs. Number of Stations</h4>
+
+    <pre class="diagram">
+Stations (n) →   4     6      8      10     12     16
+--------------------------------------------------------
+P(collision)   ~12%   30%    48%    65%    78%    &gt;90%
+
+(CWmin = 15)
+</pre
+    >
+    <p>
+      This is the MAC-level reason collapse begins long before PHY capacity is
+      reached.
+    </p>
+
+    <h3 id="appendix-a.6">A.6 Why Collapse Appears as 2–3 ms TXOP Tails</h3>
+
+    <p>Once collisions become frequent:</p>
+
+    <ul>
+      <li>Stations reach retry limits and enlarge CW exponentially.</li>
+
+      <li>Longer CW → longer idle-time waits → bigger aggregates formed.</li>
+
+      <li>Bigger aggregates → longer TXOPs (1.5–3 ms typical in collapse).</li>
+
+      <li>
+        Long TXOPs → other stations see multi-ms starvation → tail latency
+        balloons.
+      </li>
+    </ul>
+
+    <h4>Diagram A-D — TXOP Length as Collapse Indicator</h4>
+
+    <pre class="diagram">
+Healthy:    T50 ≈ 200–500 µs,   T95 &lt; 0.8 ms,    T99 &lt; 1.2 ms
+Degraded:   T95 = 1–2 ms,       T99 = 2–3 ms
+Collapsed:  T95 &gt; 2 ms AND      T99 ≥ 3 ms (dominant channel monopolization)
+</pre
+    >
+    <p>
+      A single 3 ms TXOP already violates the bottleneck-delay budget required
+      by L4S (≈250–300 µs). With multiple stations taking such TXOPs, service
+      gaps can reach 10–50 ms for unlucky flows.
+    </p>
+
+    <h3 id="appendix-a.7">A.7 Multi-Station Synchronization Example</h3>
+
+    <p>
+      The following diagram illustrates how multiple stations become
+      phase-aligned:
+    </p>
+
+    <pre class="diagram">
+Time →  ────────────────────────────────────────────────────────────────→
+
+TXOP1 by STA-A:   ────────────────
+NAV for others:   ──────────────── (all B frozen)
+
+After NAV expires:
+All stations wait AIFS → begin countdown
+Slot 1:  B_A=2, B_B=4, B_C=2
+Slot 2:  B_A=1, B_C=1
+Slot 3:  B_A=0  ,  B_C=0   → simultaneous transmit → collision
+</pre
+    >
+    <p>
+      This synchronization is why the birthday paradox applies so strongly in
+      Wi-Fi.
+    </p>
+
+    <h3 id="appendix-a.8">A.8 Why Fi-Wi Breaks the Cycle</h3>
+
+    <p>Fi-Wi removes the “every station fends for itself” randomness by:</p>
+
+    <ul>
+      <li>
+        Centralizing <strong>queueing</strong> and
+        <strong>airtime allocation</strong>.
+      </li>
+
+      <li>Eliminating station-driven EDCA decisions.</li>
+
+      <li>Coordinating RRH TXOP lengths and schedules.</li>
+
+      <li>Ensuring no two RRHs “race” via random backoff.</li>
+
+      <li>Driving TXOPs toward safe, L4S-friendly durations (≈250 µs).</li>
+    </ul>
+    Thus Fi-Wi converts Wi-Fi from a chaotic CSMA/CA system into a
+    <strong>scheduled, low-latency cellular MAC</strong>.
+
+    <hr />
+
+    <h2 id="appendix-b">
+      Appendix B: Channel State Information (CSI) and Learning-Enhanced Fi-Wi
+    </h2>
+
+    <p class="small">
+      This appendix describes how Fi-Wi can use Channel State Information (CSI)
+      from each RRH, together with learning models (e.g. LSTM or TCN), to
+      improve grouping, scheduling, redundancy, and control beyond what is
+      possible with queue-based feedback alone.
+    </p>
+
+    <h3 id="appendix-b.1">B.1 What CSI provides in a Fi-Wi context</h3>
+
+    <div class="callout">
+      <strong>Concept: What is CSI?</strong><br />
+      Imagine shouting in a complex room. You hear echoes bouncing off walls,
+      furniture, and people. If you analyze those echoes, you can map the
+      environment.<br />
+      <br />
+      In Wi-Fi, <strong>Channel State Information (CSI)</strong> is that map. It
+      describes exactly how the radio wave traveled from the transmitter to the
+      receiver—including all the bounces (multipath), fading, and phase shifts
+      caused by the physical environment.
+      <ul>
+        <li>
+          <strong>RSSI (Signal Strength):</strong> Tells you
+          <em>how loud</em> the signal is. (One number).
+        </li>
+
+        <li>
+          <strong>CSI (Channel State):</strong> Tells you <em>the shape</em> of
+          the signal distortion. (A matrix of complex numbers).
+        </li>
+      </ul>
+      Traditional APs throw this data away after decoding the packet. Fi-Wi
+      sends it to the Concentrator, allowing the system to "see" the RF
+      environment and mathematically calculate how to steer beams or combine
+      signals.<br />
+      <br />
+      <strong>Wi-Fi Sensing:</strong> Because physical objects reflect radio
+      waves, any movement in the room changes the CSI pattern. By monitoring
+      these changes over time, Fi-Wi can detect presence—such as a person
+      walking or a pet breathing—turning the network into a ubiquitous sensor
+      without cameras.
+    </div>
+
+    <p>
+      Modern 802.11 chipsets can export <strong>CSI per subcarrier</strong> or
+      per resource unit: complex-valued estimates of the channel between an RRH
+      and a station (STA). In a Fi-Wi deployment, each RRH periodically reports:
+    </p>
+
+    <ul>
+      <li>Per-subcarrier CSI magnitude/phase (or compressed variants)</li>
+
+      <li>Signal-to-noise ratio (SNR), RSSI, and noise floor estimates</li>
+
+      <li>
+        Selected MCS, PHY rate, and per-TXOP success / retry / PER statistics
+      </li>
+
+      <li>Spatial-stream usage and beamforming feedback (if applicable)</li>
+    </ul>
+
+    <p>
+      Thanks to centralized time synchronization and packet memory, the
+      concentrator can align CSI reports with:
+    </p>
+
+    <ul>
+      <li>Specific flows and STAs</li>
+
+      <li>Specific group queues and airtime domains</li>
+
+      <li>Queue states and ECN marking decisions at those times</li>
+    </ul>
+
+    <p>This gives Fi-Wi a rich per-domain, per-STA time series:</p>
+
+    <ul>
+      <li>CSI(t) — the RF channel state (including spatial eigenstructure)</li>
+
+      <li>
+        MAC(t) — TXOP, MCS, retries, aggregation size, spatial-stream usage
+      </li>
+
+      <li>Queue(t) — group queue depth, marking probability p<sub>k</sub></li>
+
+      <li>Outcome(t) — achieved throughput, delay, PER, “collapse events”</li>
+    </ul>
+
+    <h3 id="appendix-b.2">B.2 What we want to predict</h3>
+
+    <p>
+      Using this data, Fi-Wi can learn models to help answer questions such as:
+    </p>
+
+    <ul>
+      <li>
+        <strong>Short-horizon capacity prediction:</strong> Given recent CSI +
+        MAC statistics, what is the effective service rate C_eff for this RRH or
+        airtime group in the next Δt?
+      </li>
+
+      <li>
+        <strong>Interference and grouping:</strong> Which RRHs are likely to
+        interfere strongly in the next Δt (e.g. due to mobility, reflections, or
+        external interferers)? Which ones share useful eigenmodes?
+      </li>
+
+      <li>
+        <strong>Scheduling decisions:</strong> For a given flow/STA, which RRH
+        (or band/channel) and which spatial-stream configuration is likely to
+        give the best service (throughput vs. delay) in the next window?
+      </li>
+
+      <li>
+        <strong>Collapse risk:</strong> Are we heading into a region where
+        adding more load will cause retries, PER, and tail latency to spike?
+      </li>
+    </ul>
+
+    <p>These predictions can feed directly into:</p>
+
+    <ul>
+      <li>Queue grouping and re-grouping (airtime domains)</li>
+
+      <li>Per-group AQM parameter adaptation (e.g., queue target, gains)</li>
+
+      <li>RRH selection, channel selection, and per-STA scheduling</li>
+
+      <li>
+        Active redundancy policies (which RRHs should listen or transmit in
+        parallel)
+      </li>
+
+      <li>Spatial-stream / eigenmode choices within each airtime domain</li>
+    </ul>
+
+    <h3 id="appendix-b.3">B.3 Example model: LSTM / TCN</h3>
+
+    <p>
+      One reasonable approach is to use a sequence model such as an LSTM or
+      Temporal Convolutional Network (TCN) per airtime domain:
+    </p>
+
+    <pre><code>Input features (per timestep):
+  - queue depth q_k
+  - marking probability p_k
+  - throughput, PER, retries
+  - per-RRH CSI summary (e.g. dominant eigenvalues/eigenvectors)
+  - beacon power settings, channel, bandwidth
+
+Outputs:
+  - predicted effective capacity C_eff,k+1
+  - predicted collapse risk score
+  - recommended group reconfiguration / beacon adjustments (optional)
+</code></pre>
+    <p>A higher-level policy layer then uses these predictions to:</p>
+
+    <ul>
+      <li>Re-balance load across domains</li>
+
+      <li>Merge or split airtime domains</li>
+
+      <li>Adjust beacon powers to reshape cell edges</li>
+
+      <li>Tweak PI2 / L4S parameters per domain</li>
+    </ul>
+
+    <p>
+      The key point is that Fi-Wi has access to the
+      <strong>joint state across all RRHs</strong>—queues, CSI, MAC outcomes,
+      and beacon configuration—so learning can be done on a true building-scale
+      view rather than a per-AP snippet.
+    </p>
+
+    <h3 id="appendix-b.4">
+      B.4 The Non-Linear Control Policy (Feature Vectors)
+    </h3>
+
+    <p>
+      While the PI² controller (Section 5.2) provides a robust baseline using
+      linear control theory, the wireless medium is inherently non-linear. A
+      small drop in SNR can cause a discrete, non-linear step-down in MCS,
+      cutting capacity by half in microseconds. A linear controller often reacts
+      too slowly to these step-changes.
+    </p>
+
+    <p>
+      Because the Concentrator terminates both the MAC (Inner Loop) and L4S
+      (Outer Loop), it possesses a complete, global view of the system state.
+      This allows Fi-Wi to implement a
+      <strong>Non-Linear Marking Signal</strong> derived from a rich real-time
+      feature vector:
+    </p>
+
+    <div class="diagram-block">
+      <pre class="diagram">
+Feature Vector x(t) = [
+   MCS_t,          // Current Modulation (Capacity potential)
+   PHY_Rate_t,     // Raw drain rate
+   RTT_outer,      // End-to-end latency (Sojourn + Flight)
+   Q_depth_t,      // Current backlog
+   d_arrival/dt    // Arrival rate gradient (ARM Policer)
+]
+  </pre
+      >
+    </div>
+
+    <p>
+      <strong>Optimization Objective: Efficiency vs. Latency</strong><br />
+      The system uses this vector to solve the fundamental Wi-Fi trade-off:
+      <em>Aggregation Efficiency</em> vs. <em>Serialized Latency</em>.
+    </p>
+
+    <ul>
+      <li>
+        <strong>The Physics:</strong> PHY efficiency increases with aggregation
+        size (amortizing the preamble/SIFS overhead). However, larger aggregates
+        typically imply deeper queues and higher latency.
+      </li>
+
+      <li>
+        <strong>The Logic:</strong> By correlating <strong>MCS</strong> with
+        <strong>Arrival Rate</strong>, the control policy can dynamically adjust
+        the marking threshold.
+        <ul>
+          <li>
+            <em>High MCS Scenario:</em> The policy knows the "pipe" is wide. It
+            relaxes the marking threshold, allowing the "Inner Integrator"
+            (Section 5.3) to accumulate more packets. This naturally builds
+            larger, more efficient aggregates (e.g., 64 MPDUs) without violating
+            the L4S latency budget because they drain instantly.
+          </li>
+
+          <li>
+            <em>Low MCS Scenario:</em> The policy knows the "pipe" is narrow. It
+            aggressively marks via the ARM Policer, forcing the sender to slow
+            down. This keeps the aggregate size small (e.g., 4 MPDUs),
+            preventing the massive "sawtooth" latency spikes that occur when
+            sending large aggregates at low data rates.
+          </li>
+        </ul>
+      </li>
+    </ul>
+
+    <p>
+      This creates a <strong>Non-Linear Marking Signal</strong> that optimizes
+      <em>Throughput per Microsecond of Latency</em>, rather than simply
+      targeting a fixed queue depth.
+    </p>
+
+    <hr />
+
+    <h2 id="appendix-c">Appendix C: Latency Hiding via Scatter-Gather DMA</h2>
+
+    <p>
+      Early architectural models of C-RAN often assumed a "Store-and-Forward"
+      approach, where full packets must be buffered at the edge to meet timing.
+      Fi-Wi eliminates this inefficiency by leveraging the natural physics of
+      the 802.11 air interface. We utilize a
+      <strong>Scatter-Gather DMA</strong> engine with
+      <strong>Preamble Hiding</strong> to enable a "Thin RRH" design with
+      minimal local SRAM.
+    </p>
+
+    <h3 id="appendix-c.1">C.1 The "Preamble Shield" Physics</h3>
+
+    <p>
+      The critical timing constraint in Wi-Fi is the transition from "Decision
+      to Transmit" to "Energy on Air." However, the 802.11 PHY does not transmit
+      user data immediately. Every transmission begins with a PHY Preamble
+      (PLCP) and MAC Headers.
+    </p>
+
+    <div class="diagram">
+      Time-Domain View of a Transmission Start: T=0 µs T=5 µs T=24 µs (approx) |
+      TX Trigger | | | | Preamble & Headers | Payload Data Starts... [ MAC Logic
+      ]-&gt;[/////////////////////////][......................] ^ ^ | | Source:
+      Local RRH SRAM Source: Host Concentrator DRAM (Instant Access) (Fetched
+      via Fiber)
+    </div>
+
+    <p>
+      <strong>The Insight:</strong> The transmission of the Preamble and Headers
+      takes roughly <strong>20–40 µs</strong> (depending on PHY generation). The
+      round-trip time to fetch payload data over 100m of PCIe-over-Fiber is
+      roughly <strong>2–5 µs</strong>.
+    </p>
+
+    <p>
+      Consequently, the fetch latency is completely "hidden" behind the
+      transmission of the headers. The payload data arrives at the RRH's small
+      FIFO well before the PHY is ready to modulate it.
+    </p>
+
+    <h3 id="appendix-c.2">C.2 Scatter-Gather Architecture</h3>
+
+    <p>
+      Instead of a large packet buffer, the Fi-Wi RRH implements a
+      Scatter-Gather DMA engine that composes frames on the fly from two
+      distinct memory regions:
+    </p>
+
+    <ol>
+      <li>
+        <strong>Template RAM (Local RRH SRAM):</strong> Stores 802.11 MAC
+        headers, PLCP headers, and delimiter signatures. This memory is small
+        (&lt; 16 KB), fast, and populated by the Concentrator during the
+        descriptor posting phase.
+      </li>
+
+      <li>
+        <strong>Payload Buffer (Remote Concentrator DRAM):</strong> Stores the
+        actual 802.3 Ethernet payloads. These remain in the host server's memory
+        until the exact moment of transmission.
+      </li>
+    </ol>
+
+    <h3 id="appendix-c.3">C.3 The Transmit Sequence</h3>
+
+    <ol>
+      <li>
+        <strong>Descriptor Posting:</strong> The Concentrator posts a descriptor
+        to the RRH. This descriptor points to the header in Local RAM and the
+        payload in Remote DRAM.
+      </li>
+
+      <li>
+        <strong>Contention:</strong> The RRH MAC performs EDCA backoff. No data
+        is moved during this phase.
+      </li>
+
+      <li>
+        <strong>TX Trigger:</strong> When backoff reaches zero, the MAC
+        immediately begins transmitting the Preamble from Local RAM.
+      </li>
+
+      <li>
+        <strong>Just-in-Time Fetch:</strong> Simultaneously with the Preamble
+        start, the DMA engine issues a read request to the Concentrator for the
+        payload data.
+      </li>
+
+      <li>
+        <strong>Cut-Through:</strong> Data returns from the fiber, flows into a
+        small speed-matching FIFO (e.g., 4 KB), and flows directly into the PHY
+        serialization path immediately following the header.
+      </li>
+    </ol>
+
+    <h3 id="appendix-c.4">C.4 Solving the Retry Timing (SIFS)</h3>
+
+    <p>
+      A common objection to C-RAN is the SIFS deadline (16 µs) required for
+      retries. If a transmission fails, the station must retransmit immediately.
+    </p>
+
+    <p>
+      With Scatter-Gather, the RRH does <strong>not</strong> need to buffer the
+      packet for retries. If a NACK occurs, the MAC simply resets the
+      Scatter-Gather engine. It re-transmits the Preamble (from Local RAM) while
+      re-issuing the DMA fetch (from Remote RAM). Because the fiber latency (5
+      µs) is significantly shorter than the SIFS + Preamble duration, the data
+      again arrives in time.
+    </p>
+
+    <h3 id="appendix-c.5">C.5 Architectural Benefits</h3>
+
+    <ul>
+      <li>
+        <strong>Zero Copy:</strong> Packets are never copied CPU-to-Buffer or
+        Buffer-to-Buffer. They move from Host DRAM to Air.
+      </li>
+
+      <li>
+        <strong>Minimal SRAM:</strong> The RRH does not need MBs of RAM to
+        buffer aggregates. It only needs KB for headers and FIFOs. This
+        significantly reduces leakage power and die cost.
+      </li>
+
+      <li>
+        <strong>Infinite Retries:</strong> The system is not limited by local
+        buffer size for retries; it relies on the massive capacity of Host DRAM.
+      </li>
+    </ul>
+
+    <hr />
+
+    <h2 id="appendix-d">
+      Appendix D: 802.11ax/be Features and Fi-Wi Integration
+    </h2>
+
+    <p>
+      Modern Wi-Fi standards — particularly 802.11ax (Wi-Fi 6/6E) and 802.11be
+      (Wi-Fi 7) — introduce features that appear to address some of the same
+      problems as Fi-Wi: uplink scheduling, spatial reuse, and multi-AP
+      coordination. This appendix clarifies how these features relate to Fi-Wi's
+      architecture, where they're complementary, and why they don't eliminate
+      the need for Fi-Wi's centralized data-plane approach.
+    </p>
+
+    <p>
+      <strong>Key takeaway:</strong> 802.11ax/be features like trigger frames
+      and multi-AP coordination are valuable enhancements that Fi-Wi can
+      leverage when client support is available, but they operate at a different
+      architectural level (per-AP MAC features vs. building-scale data-plane
+      unification) and cannot replace Fi-Wi's core innovations: centralized
+      queues, shared state, L4S marking coordination, and dynamic RF grouping
+      across the entire building.
+    </p>
+
+    <h3 id="appendix-d.1">D.1 Trigger Frames and Uplink Scheduling</h3>
+
+    <p>
+      802.11ax introduced <strong>trigger frames (TF)</strong> to enable
+      centralized uplink scheduling. Instead of clients contending for the
+      channel using stochastic EDCA backoff, the AP sends a trigger frame that
+      grants specific clients permission to transmit on specific OFDMA resource
+      units (RUs) or spatial streams at a specific time.
+    </p>
+
+    <p><strong>What trigger frames provide:</strong></p>
+
+    <ul>
+      <li>
+        <strong>Uplink OFDMA:</strong> Multiple clients can transmit
+        simultaneously on different frequency resource units within the same
+        channel, controlled by the AP's trigger frame.
+      </li>
+
+      <li>
+        <strong>Uplink MU-MIMO:</strong> Multiple clients can transmit
+        simultaneously on different spatial streams, again coordinated by the
+        AP.
+      </li>
+
+      <li>
+        <strong>Deterministic uplink airtime:</strong> The AP controls which
+        clients transmit when, reducing contention and collision probability on
+        the uplink.
+      </li>
+    </ul>
+
+    <p><strong>How trigger frames align with Fi-Wi:</strong></p>
+
+    <p>
+      Trigger frames match Fi-Wi's philosophy of centralized scheduling rather
+      than distributed contention. In a Fi-Wi deployment where RRHs support
+      802.11ax and clients support uplink OFDMA/MU-MIMO, the concentrator can:
+    </p>
+
+    <ul>
+      <li>
+        Generate trigger frames via RRHs to schedule uplink transmissions from
+        multiple STAs
+      </li>
+
+      <li>
+        Coordinate uplink RU allocation across an airtime domain to minimize
+        collision and maximize spatial reuse
+      </li>
+
+      <li>
+        Use shared state (CSI, queue levels, traffic patterns) to make smarter
+        trigger scheduling decisions than an isolated AP could
+      </li>
+
+      <li>
+        Treat uplink trigger-based transmissions as part of the same controlled
+        airtime resource managed by the group queue
+      </li>
+    </ul>
+
+    <p><strong>Reality check — client support in 2025:</strong></p>
+
+    <p>
+      While 802.11ax was ratified in 2019, uplink OFDMA support remains
+      inconsistent.
+      <strong
+        >Crucially, trigger frames only control 802.11ax/be clients; legacy
+        devices (iPhone 11, older IoT) are invisible to this schedule.</strong
+      >
+      These legacy clients cannot parse the trigger, so they continue to contend
+      via random EDCA, acting as unmanaged interference sources. In contrast,
+      Fi-Wi's reception diversity (Section 8.1) enhances uplink reliability for
+      <em>all</em> clients, regardless of generation, by combining signals from
+      multiple RRHs.
+    </p>
+
+    <h3 id="appendix-d.2">
+      D.2 Why Trigger Frames Don't Eliminate the Need for Fi-Wi
+    </h3>
+
+    <p>
+      A natural question: "If 802.11ax APs can use trigger frames for uplink
+      scheduling, why do we need Fi-Wi's centralized architecture?"
+    </p>
+
+    <p>
+      <strong>Answer:</strong> Trigger frames address only a
+      <strong>small subset</strong> of the problems Fi-Wi solves, and even for
+      uplink scheduling, they provide per-AP control, not building-scale
+      coordination.
+    </p>
+
+    <p><strong>What trigger frames do NOT provide:</strong></p>
+
+    <ol>
+      <li>
+        <strong>Centralized queues across APs:</strong> Even with trigger
+        frames, each AP maintains its own independent downlink and uplink
+        queues. There's no shared queue state, no unified bottleneck, and no
+        coordinated ECN marking across APs.
+      </li>
+
+      <li>
+        <strong>Shared state:</strong> Trigger-capable APs still operate
+        autonomously. They don't share CSI, retry statistics, airtime usage, or
+        queue metrics. Each AP makes trigger scheduling decisions based only on
+        its local view.
+      </li>
+
+      <li>
+        <strong>Coordinated L4S marking:</strong> There's no mechanism in
+        802.11ax for multiple APs to coordinate ECN marking or present a single
+        logical bottleneck to L4S. Each AP marks (or doesn't mark)
+        independently.
+      </li>
+
+      <li>
+        <strong>Dynamic RF grouping:</strong> 802.11ax APs don't dynamically
+        reconfigure which radios share airtime resources based on interference,
+        CSI structure, or collapse risk. They're fixed islands.
+      </li>
+
+      <li>
+        <strong>Tail latency control:</strong> Trigger frames help with uplink
+        efficiency, but they don't address the fundamental problem of hidden
+        queues, uncontrolled aggregation, and tail latency blowup under load
+        across a multi-AP building.
+      </li>
+    </ol>
+
+    <h3 id="appendix-d.3">D.3 OFDMA Resource Units and Airtime Domains</h3>
+
+    <p>
+      802.11ax OFDMA subdivides a channel into resource units (RUs). In Fi-Wi,
+      an <strong>airtime domain</strong> is a logical entity representing a
+      shared RF resource. OFDMA RUs provide
+      <strong>finer-grained subdivision of that airtime resource</strong>.
+    </p>
+
+    <p>Conceptually:</p>
+
+    <ul>
+      <li>
+        <strong>Without OFDMA:</strong> An airtime domain on channel 36 has one
+        resource: the full 80 MHz channel. Only one transmission can occur at a
+        time within that domain.
+      </li>
+
+      <li>
+        <strong>With OFDMA:</strong> The same airtime domain can be subdivided
+        into RUs. The concentrator's scheduler can allocate RU1 to STA-A, RU2 to
+        STA-B simultaneously.
+      </li>
+    </ul>
+
+    <p>
+      This does <strong>not</strong> change the fact that all RRHs in that
+      airtime domain share a single group queue and marking point. It simply
+      allows the service process to be more efficient.
+    </p>
+
+    <h3 id="appendix-d.4">D.4 BSS Coloring and Spatial Reuse</h3>
+
+    <p>
+      802.11ax BSS coloring allows STAs to distinguish between intra-BSS frames
+      (same color) and inter-BSS frames (different color), enabling more
+      aggressive spatial reuse.
+    </p>
+
+    <p>
+      <strong>Relationship to Fi-Wi RF grouping:</strong> Fi-Wi's dynamic RF
+      grouping (Section 6) serves a similar but more sophisticated purpose.
+      Fi-Wi uses richer information (CSI, retry statistics, airtime) to decide
+      grouping, not just RSSI thresholds. In a Fi-Wi deployment, the
+      concentrator can assign BSS colors to RRHs strategically: RRHs in the same
+      airtime domain get the same color, while isolated domains get different
+      colors.
+    </p>
+
+    <h3 id="appendix-d.5">D.5 802.11be (Wi-Fi 7) Multi-AP Coordination</h3>
+
+    <p>
+      802.11be (Wi-Fi 7) introduces
+      <strong>multi-AP coordination</strong> features that appear to move in
+      Fi-Wi's direction:
+    </p>
+
+    <ul>
+      <li>
+        <strong>Coordinated Spatial Reuse (C-SR):</strong> Multiple APs
+        coordinate their transmissions to reduce mutual interference.
+      </li>
+
+      <li>
+        <strong>Coordinated Beamforming (C-BF):</strong> APs share CSI and
+        coordinate beamforming weights to improve SINR.
+      </li>
+
+      <li>
+        <strong>Joint Transmission (JT):</strong> Multiple APs transmit the same
+        data to a client simultaneously, providing diversity or increased SNR.
+      </li>
+    </ul>
+
+    <p>
+      <strong>How these relate to Fi-Wi:</strong> These features acknowledge the
+      problem of autonomous APs but approach it incrementally. 802.11be uses
+      distributed AP-to-AP messaging, which limits scale and speed. Fi-Wi
+      centralizes the data plane, enabling deeper coordination than distributed
+      messaging can achieve.
+    </p>
+
+    <h3 id="appendix-d.6">D.6 Deployment Strategy: Mixed Client Populations</h3>
+
+    <p>
+      A key advantage of Fi-Wi's architecture is that it
+      <strong>degrades gracefully</strong> with mixed client populations and
+      doesn't require forklift client upgrades.
+    </p>
+
+    <p><strong>Client capability tiers in a 2025 deployment:</strong></p>
+
+    <ol>
+      <li>
+        <strong>Legacy 802.11ac and earlier:</strong> No trigger frame support,
+        no OFDMA, no BSS coloring.
+        <ul>
+          <li>
+            Fi-Wi provides: centralized downlink queuing, L4S marking, reception
+            diversity on uplink, beacon shaping to reduce contention.
+          </li>
+
+          <li>
+            Result: Significantly better latency and stability than traditional
+            multi-AP, even without 802.11ax features.
+          </li>
+        </ul>
+      </li>
+
+      <li>
+        <strong>802.11ax with partial features:</strong> May support downlink
+        OFDMA, BSS coloring, some power save enhancements, but not uplink OFDMA
+        or uplink MU-MIMO.
+        <ul>
+          <li>
+            Fi-Wi provides: All of the above, plus downlink MU-OFDMA where
+            beneficial, coordinated BSS coloring across RRH groups.
+          </li>
+
+          <li>
+            Result: Better spatial reuse and efficiency, still robust to clients
+            that don't support full 802.11ax.
+          </li>
+        </ul>
+      </li>
+
+      <li>
+        <strong>802.11ax with full features:</strong> Supports uplink OFDMA and
+        uplink MU-MIMO via trigger frames.
+        <ul>
+          <li>
+            Fi-Wi provides: All of the above, plus trigger-based uplink
+            scheduling, uplink MU-OFDMA for small packets, coordinated
+            uplink/downlink airtime management.
+          </li>
+
+          <li>
+            Result: Bidirectional sub-millisecond latency control, maximum
+            airtime efficiency.
+          </li>
+        </ul>
+      </li>
+
+      <li>
+        <strong>802.11be (Wi-Fi 7):</strong> Adds MLO, 320 MHz channels,
+        4096-QAM, possibly multi-AP coordination support.
+        <ul>
+          <li>
+            Fi-Wi provides: Can leverage MLO via concentrator coordination
+            (Section 13.3), wider channels for capacity, and potentially
+            integrate with 802.11be multi-AP features while maintaining superior
+            shared-state coordination.
+          </li>
+
+          <li>
+            Result: Cutting-edge performance while maintaining backward
+            compatibility.
+          </li>
+        </ul>
+      </li>
+    </ol>
+
+    <p><strong>Deployment strategy:</strong></p>
+
+    <ul>
+      <li>
+        <strong>Day 1:</strong> Fi-Wi delivers core benefits (low latency, L4S,
+        collapse avoidance) with whatever client mix exists, including large
+        populations of 802.11ac and partial-802.11ax devices.
+      </li>
+
+      <li>
+        <strong>Ongoing:</strong> As clients are naturally refreshed, Fi-Wi
+        automatically takes advantage of newer capabilities without requiring
+        RRH hardware changes.
+      </li>
+    </ul>
+
+    <h3 id="appendix-d.7">
+      D.7 Summary: 802.11ax/be as Enhancements, Not Replacements
+    </h3>
+
+    <p>
+      802.11ax and 802.11be introduce valuable features — trigger frames, OFDMA,
+      BSS coloring, multi-AP coordination — that align with Fi-Wi's centralized
+      control philosophy and can enhance Fi-Wi deployments when clients support
+      them. However:
+    </p>
+
+    <ol>
+      <li>
+        <strong
+          >These features do not eliminate the need for Fi-Wi's
+          architecture.</strong
+        >
+        They provide per-AP enhancements and limited inter-AP coordination, but
+        they cannot create the unified data plane, shared state, and
+        building-scale control that Fi-Wi provides.
+      </li>
+
+      <li>
+        <strong>Fi-Wi is designed to work with or without them.</strong> Core
+        benefits (centralized queues, L4S marking, tail latency control) are
+        independent of client 802.11ax/be support.
+      </li>
+
+      <li>
+        <strong>Fi-Wi leverages them when available.</strong> As client
+        capabilities improve, Fi-Wi automatically benefits from trigger-based
+        uplink scheduling, OFDMA efficiency, and other enhancements without
+        requiring architectural changes.
+      </li>
+    </ol>
+
+    <p>
+      In short:
+      <strong
+        >802.11ax/be features make Fi-Wi better, but Fi-Wi solves problems these
+        standards cannot address within the constraints of the distributed-AP
+        model.</strong
+      >
+      Fi-Wi is not "better APs" — it's a different architecture that happens to
+      integrate well with modern Wi-Fi standards as they evolve.
+    </p>
+
+    <hr />
+
+    <h2 id="appendix-e">Appendix E: ASIC Evolution to Complexity</h2>
+
+    <h3 id="appendix-e.1">E.1 Why ASICs accumulate legacy complexity</h3>
+
+    <p>
+      Unlike software, ASICs cannot easily “refactor away” unused features.
+      Removing blocks typically requires re-verifying entire subsystems, while
+      adding blocks often requires verifying only the new logic. This asymmetry
+      encourages accumulation:
+    </p>
+
+    <ul>
+      <li>
+        <strong>Removal risk &gt; addition risk:</strong> removing a feature may
+        impact existing customers in hard-to-observe ways; adding a feature does
+        not threaten prior deployments.
+      </li>
+
+      <li>
+        <strong>Career incentives favor addition:</strong> “I added feature X”
+        is a visible win; “I removed 50K gates” is nearly invisible but carries
+        blame if anything breaks.
+      </li>
+    </ul>
+
+    <p>
+      Over many product generations, this leads to RTL codebases that only grow.
+      Legacy modulation modes, preambles, power-save FSMs, calibration paths,
+      and debug hooks persist long after their practical value has disappeared.
+    </p>
+
+    <h3 id="appendix-e.2">E.2 Real costs of legacy bloat</h3>
+
+    <p>This accumulated complexity has tangible costs:</p>
+
+    <ul>
+      <li>
+        <strong>Die area:</strong> more logic means a larger die, lower yield
+        per wafer, and higher unit cost.
+      </li>
+
+      <li>
+        <strong>Power:</strong> unused or rarely used blocks still consume
+        leakage and often dynamic power.
+      </li>
+
+      <li>
+        <strong>Verification:</strong> more RTL means more corner cases, more
+        regressions, and longer bring-up.
+      </li>
+
+      <li>
+        <strong>Timing closure:</strong> deep, interdependent datapaths make
+        timing more difficult and fragile.
+      </li>
+    </ul>
+
+    <h3 id="appendix-e.3">E.3 How Fi-Wi changes the design equation</h3>
+
+    <p>Fi-Wi’s architecture separates the system into:</p>
+
+    <ul>
+      <li>
+        <strong>Concentrator:</strong> complex, software-defined control plane
+        and data plane intelligence.
+      </li>
+
+      <li>
+        <strong>RRH:</strong> thin, timing-critical RF endpoint with minimal
+        state.
+      </li>
+    </ul>
+
+    <p>
+      This separation dictates where complexity must live. RRHs implement only
+      what must be fast and deterministic: RF front end, PHY processing, minimal
+      MAC TX/RX, DMA, PTP synchronization, and PCIe-over-fiber transport. All
+      high-level behavior (queueing, L4S policy, aggregation strategy) lives in
+      the concentrator.
+    </p>
+
+    <h3 id="appendix-e.4">E.4 Economic and engineering leverage</h3>
+
+    <p>
+      For a modern Wi-Fi chip at an advanced node, even a modest reduction in
+      unnecessary logic can translate into significant savings: smaller die,
+      lower power, simpler verification, and faster time to market.
+    </p>
+
+    <h3 id="appendix-e.5">E.5 Design principle for Fi-Wi RRH silicon</h3>
+
+    <p>The guiding principle for Fi-Wi RRH design is:</p>
+
+    <blockquote>
+      <strong
+        >Complexity belongs in the concentrator; only latency-critical functions
+        belong in RRH silicon.</strong
+      >
+    </blockquote>
+
+    <p>
+      Concretely, this means: no autonomous AP queueing/scheduling logic, no
+      legacy PHY/MAC support beyond what Fi-Wi needs, and no embedded firmware
+      CPU managing per-station behavior at the edge.
+    </p>
+
+    <hr />
+
+    <h2 id="appendix-f">
+      Appendix F: A Day in the Life of a Packet (The "Preamble Shield" in
+      Action)
+    </h2>
+
+    <p>
+      To truly understand Fi-Wi, we must follow a single packet through the
+      system at the microsecond scale. This narrative illustrates how the
+      <strong>Workstation Concentrator</strong> (Section 13) and the
+      <strong>Scatter-Gather RRH</strong> (Appendix C) collaborate to trick the
+      physics of latency.
+    </p>
+
+    <h3 id="appendix-f.1">F.1 The Scenario</h3>
+
+    <div class="diagram">
+      The Setting: Room 304 (Served by RRH-A and RRH-B). The Flow: A 4K Video
+      Frame (Downlink) destined for "Alice's Laptop." The Constraint: L4S
+      requires &lt;1ms tail latency. The Challenge: The packet is currently 200
+      meters away in the Concentrator's DRAM.
+    </div>
+
+    <h3 id="appendix-f.2">F.2 The Downlink Race (The "Preamble Shield")</h3>
+
+    <p>
+      <strong>T = 0 µs (Arrival):</strong> The video packet arrives at the
+      Concentrator's NIC. The CPU timestamps it immediately.
+    </p>
+
+    <p>
+      <strong>T = 2 µs (The Decision):</strong> The Concentrator's software
+      scheduler inspects the packet.
+    </p>
+
+    <ul>
+      <li>
+        <em>Queue Check:</em> It places the packet in the "Room 304 Airtime"
+        queue.
+      </li>
+
+      <li>
+        <em>L4S Check:</em> Current sojourn time is low (50 µs). No ECN mark
+        applied.
+      </li>
+
+      <li>
+        <em>RF Check:</em> Recent CSI shows RRH-A has the best vector to Alice.
+      </li>
+    </ul>
+
+    <p>
+      <strong>T = 10 µs (The Setup):</strong> The scheduler posts a
+      <strong>DMA Descriptor</strong> to RRH-A via PCIe.<br />
+      <em
+        >Note: The payload data (1500 bytes) stays in the Concentrator. Only a
+        16-byte pointer moves to the edge.</em
+      >
+    </p>
+
+    <p>
+      <strong>T = 50 µs (The Trigger):</strong> RRH-A's LBT logic sees the
+      airtime is clear. It begins the transmission sequence.
+      <strong>This is where the magic happens:</strong>
+    </p>
+
+    <div class="callout">
+      <strong>The Race Against the PHY:</strong><br />
+      <strong>Action 1:</strong> RRH-A starts transmitting the 802.11 Preamble
+      (PLCP) from its local SRAM. This takes <strong>20 µs</strong> of
+      airtime.<br />
+      <strong>Action 2:</strong> <em>Simultaneously</em>, RRH-A issues a PCIe
+      Read Request to fetch the payload from the Concentrator.<br />
+      <br />
+      The payload must travel 200m up the fiber and back <em>before</em> the
+      Preamble finishes transmitting.
+    </div>
+
+    <p>
+      <strong>T = 52 µs (The Fetch):</strong> The Read Request hits the
+      Concentrator's PCIe controller. Because of the 92-lane non-blocking fabric
+      (Section 13), there is zero switching delay.
+    </p>
+
+    <p>
+      <strong>T = 55 µs (The Return):</strong> The payload data flies back down
+      the fiber.
+    </p>
+
+    <p>
+      <strong>T = 58 µs (The Handover):</strong> The payload data arrives at
+      RRH-A's FIFO. The PHY is just finishing the last symbol of the Preamble.
+    </p>
+
+    <p>
+      <strong>T = 59 µs (Seamless Serialization):</strong> The PHY seamlessly
+      switches from transmitting the Preamble to transmitting the payload. To
+      the air, it looks like one continuous stream. The 200-meter fiber latency
+      effectively vanished because it was hidden behind the mandatory PHY
+      training sequence.
+    </p>
+
+    <h3 id="appendix-f.3">F.3 The Uplink Journey (Diversity & Sensing)</h3>
+
+    <p><strong>T = 200 µs:</strong> Alice sends a TCP ACK.</p>
+
+    <p>
+      <strong>T = 204 µs (The Multi-Stat):</strong> Both RRH-A and RRH-B hear
+      the ACK.
+    </p>
+
+    <ul>
+      <li>RRH-A sees a strong signal (-45 dBm).</li>
+
+      <li>RRH-B sees a weak reflection (-72 dBm).</li>
+    </ul>
+
+    <p>
+      <strong>T = 210 µs (The Race Up):</strong> Both RRHs push the packet + CSI
+      metadata to the Concentrator.
+    </p>
+
+    <p>
+      <strong>T = 215 µs (The Deduplication):</strong> The Concentrator sees two
+      copies of Sequence #104. It discards the weak one from RRH-B but keeps the
+      CSI data to update the "Sensing Model" (detecting that someone is standing
+      near RRH-B, blocking the line of sight).
+    </p>
+
+    <h3 id="appendix-f.4">F.4 Contrast with Legacy Wi-Fi</h3>
+
+    <p>If this were a traditional AP:</p>
+
+    <ul>
+      <li>
+        <strong>Downlink:</strong> The packet would be buffered in a per-TID
+        queue inside the AP hardware. It might wait behind a large aggregate for
+        another user (Head-of-Line blocking). ECN marking would be speculative
+        (based on enqueue time), likely missing actual congestion events. The
+        "Preamble Shield" trick is impossible because the AP CPU can't fetch
+        data reactively—it must buffer it proactively.
+      </li>
+
+      <li>
+        <strong>Uplink:</strong> Only the associated AP would hear the ACK. If
+        Alice moved slightly and blocked the signal, the packet might be lost,
+        requiring a MAC-level retry (adding 2-5ms latency). Fi-Wi's diversity
+        combining prevents this retry.
+      </li>
+    </ul>
+
+    <h3 id="appendix-f.5">F.5 Edge Cases and Advanced Scenarios</h3>
+
+    <p>
+      <strong>RRH Failure:</strong> If RRH-A fails during the prefetch (e.g.,
+      power loss), the concentrator detects the link loss immediately via PCIe
+      link state. Because the packet payload never left Concentrator DRAM, the
+      scheduler simply re-posts the descriptor to RRH-B. No packet is lost, and
+      TCP does not see a drop.
+    </p>
+
+    <p>
+      <strong>Congestion:</strong> The scatter-gather pipeline depth allows the
+      Concentrator to queue up the <em>next</em> descriptor while the current
+      one is transmitting. This allows back-to-back TXOPs (SIFS spacing) without
+      idle gaps on the air, even with the fiber latency.
+    </p>
+
+    <p>
+      <strong>Coordinated Transmission:</strong> The Concentrator can schedule
+      RRH-A and RRH-B to transmit concurrently to spatially separated clients.
+      It analyzes the CSI matrix to determine if spatial isolation is sufficient
+      (&gt;25 dB cross-coupling attenuation). If yes, both RRHs transmit
+      simultaneously using standard 802.11 frames. If interference is detected,
+      the Concentrator schedules sequential TXOPs. This dynamic decision happens
+      per-packet based on real-time CSI.
+    </p>
+
+    <h3 id="appendix-f.6">F.6 Summary: The Packet's Perspective</h3>
+
+    <p>
+      From the packet's view, Fi-Wi provides uplink diversity, per-flow fair
+      queuing, accurate ECN marking, and speculative DMA that hides PCIe
+      latency. The packet experiences the network as a transparent, zero-wait
+      pipe.
+    </p>
+
+    <h3 id="appendix-f.7">F.7 The Critical Insight: Timing vs. Intelligence</h3>
+
+    <p>
+      Fi-Wi separates <strong>timing</strong> (RRH hardware) from
+      <strong>intelligence</strong> (Concentrator software), bridged by the
+      speculative DMA prefetch pipeline. This allows the hardware to meet strict
+      microsecond deadlines while the software retains the flexibility to run
+      complex scheduling, L4S, and spatial multiplexing logic.
+    </p>
+
+    <hr />
+
+    <h2 id="appendix-g">
+      Appendix G: The Strategic Case for Fiber Infrastructure
+    </h2>
+
+    <p>
+      The upfront cost of installing fiber is often the primary friction point
+      for C-RAN adoption ("The Fiber Tax"). However, this framing ignores the
+      physics of modern signaling and the macroeconomics of construction.
+      Fi-Wi's reliance on fiber is not a tax; it is a strategic asset
+      conversion.
+    </p>
+
+    <h3 id="appendix-g.1">G.1 The Physics of 100G (The Copper Wall)</h3>
+
+    <p>
+      We are hitting a hard physical limit with copper cabling. At modern data
+      center speeds (100Gb/s), signal loss in copper is so high it is
+      characterized in <strong>dB per inch</strong>.
+    </p>
+
+    <ul>
+      <li>
+        <strong>High-Frequency Attenuation:</strong> 100Gb/s signaling requires
+        Nyquist frequencies of 25-50 GHz. At these frequencies, skin effect and
+        dielectric loss in twisted-pair copper make long-distance transmission
+        physically impossible.
+      </li>
+
+      <li>
+        <strong>The Reach Limit:</strong> In the data center, 100G copper DAC
+        cables are limited to 1-3 meters. To run 100G+ speeds through a
+        building's walls (50-100m), copper is effectively dead.
+      </li>
+
+      <li>
+        <strong>Future-Proofing:</strong> Fiber defies this gravity. It carries
+        100G, 400G, or 800G over hundreds of meters. Installing Cat6A today
+        guarantees a "rip-and-replace" expense tomorrow.
+      </li>
+    </ul>
+
+    <h3 id="appendix-g.2">G.2 Labor Rate Hedging (Inflation Proofing)</h3>
+
+    <p>
+      In low-voltage construction, the cost of cabling is dominated by
+      <strong>labor</strong> (often 70-80%), not material.
+    </p>
+
+    <ul>
+      <li>
+        <strong>The Inflation Trap:</strong> Construction labor rates rise over
+        time. Relying on copper commits the owner to re-cabling cycles at
+        <em>future, higher rates</em>.
+      </li>
+
+      <li>
+        <strong>The Fiber Hedge:</strong> Installing fiber today locks in
+        <strong>today's labor rates</strong> for a permanent infrastructure
+        asset.
+      </li>
+    </ul>
+
+    <h3 id="appendix-g.3">G.3 Asset vs. Consumable</h3>
+
+    <p>
+      Unlike <strong>HDMI</strong> or <strong>Copper Ethernet</strong>—which are
+      purpose-built cables engineered for a single generation—fiber is a raw
+      transport medium. It is a "pipe for light" that supports Ethernet, DWDM,
+      and PCIe-over-Fiber simultaneously.
+    </p>
+
+    <p>
+      While cable standards have cycled (Cat5e → Cat6 → Cat6A), they remain
+      tethered to the <strong>legacy RJ45 connector</strong>. This physical
+      interface is rapidly becoming obsolete. Fi-Wi recognizes that
+      <strong>the connection is what matters</strong>, not the physical port. In
+      this architecture, the
+      <strong>802.11 wireless interface becomes the new connector</strong>. By
+      installing fiber once as a permanent asset and treating Wi-Fi as the
+      universal 'plug' inside the room, the building infrastructure is 'one and
+      done'. This finally breaks the cycle of physical obsolescence.
+    </p>
+
+    <h2 id="appendix-h">
+      Appendix H: Centralized Observability and the ML Advantage
+    </h2>
+
+    <p>
+      Fi-Wi's centralized architecture provides observability that is difficult
+      or impractical to achieve in distributed AP systems. This appendix
+      presents the <strong>Observability Matrix</strong>—a systematic comparison
+      of what telemetry is directly observable, partially observable, or hidden
+      across different measurement approaches. This complete visibility is the
+      prerequisite for effective machine learning (Section 15) and deterministic
+      L4S control.
+    </p>
+
+    <h3>The Observability Gap</h3>
+
+    <p>
+      Traditional Wi-Fi deployments rely on tools that provide only partial
+      visibility into system state. Operators attempt to infer problems from
+      symptoms (latency spikes, ECN marks, throughput degradation) without
+      directly observing root causes (queue growth, retry timing, MCS selection
+      under interference). This <strong>inference distance</strong>—the number
+      of steps between observable effects and hidden causes—makes control
+      systems less stable and limits the effectiveness of machine learning.
+    </p>
+
+    <p>
+      The table below compares observability across six measurement approaches.
+      The legend indicates:
+    </p>
+
+    <div style="display: flex; gap: 20px; margin: 20px 0; flex-wrap: wrap">
+      <div style="display: flex; align-items: center; gap: 8px">
+        <span
+          ><strong>Direct:</strong> Directly measurable with
+          microsecond-resolution timestamps</span
+        >
+      </div>
+
+      <div style="display: flex; align-items: center; gap: 8px">
+        <span
+          ><strong>Partial:</strong> Partially observable or requires
+          inference</span
+        >
+      </div>
+
+      <div style="display: flex; align-items: center; gap: 8px">
+        <span
+          ><strong>Not Observable:</strong> Hidden or cannot be reliably
+          measured</span
+        >
+      </div>
+    </div>
+
+    <h3>Observability Matrix</h3>
+
+    <div
+      style="
+        overflow-x: auto;
+        border: 1px solid var(--border-soft);
+        border-radius: 6px;
+        margin: 20px 0;
+      "
+    >
+      <table
+        style="
+          width: 100%;
+          min-width: 980px;
+          border-collapse: separate;
+          border-spacing: 0;
+          font-size: 0.92rem;
+        "
+      >
+        <thead>
+          <tr style="background: #f5f7fa">
+            <th
+              style="
+                border-bottom: 1px solid var(--border-soft);
+                padding: 10px;
+                text-align: left;
+                font-weight: 600;
+                position: sticky;
+                left: 0;
+                background: #f5f7fa;
+                z-index: 2;
+              "
+            >
+              Telemetry / Metric
+            </th>
+            <th
+              style="
+                border-bottom: 1px solid var(--border-soft);
+                padding: 10px;
+                text-align: center;
+                font-weight: 600;
+              "
+            >
+              ESP32-C5<br />
+              <span style="font-weight: 500; color: #666; font-size: 0.85rem"
+                >RF sensor</span
+              >
+            </th>
+            <th
+              style="
+                border-bottom: 1px solid var(--border-soft);
+                padding: 10px;
+                text-align: center;
+                font-weight: 600;
+              "
+            >
+              RPi 5<br />
+              <span style="font-weight: 500; color: #666; font-size: 0.85rem"
+                >Monitor mode</span
+              >
+            </th>
+            <th
+              style="
+                border-bottom: 1px solid var(--border-soft);
+                padding: 10px;
+                text-align: center;
+                font-weight: 600;
+              "
+            >
+              RPi 5<br />
+              <span style="font-weight: 500; color: #666; font-size: 0.85rem"
+                >L4S node</span
+              >
+            </th>
+            <th
+              style="
+                border-bottom: 1px solid var(--border-soft);
+                padding: 10px;
+                text-align: center;
+                font-weight: 600;
+              "
+            >
+              tcpdump<br />
+              <span style="font-weight: 500; color: #666; font-size: 0.85rem"
+                >Packet capture</span
+              >
+            </th>
+            <th
+              style="
+                border-bottom: 1px solid var(--border-soft);
+                padding: 10px;
+                text-align: center;
+                font-weight: 600;
+              "
+            >
+              iperf2<br />
+              <span style="font-weight: 500; color: #666; font-size: 0.85rem"
+                >L4S</span
+              >
+            </th>
+            <th
+              style="
+                border-bottom: 1px solid var(--border-soft);
+                padding: 10px;
+                text-align: center;
+                font-weight: 600;
+              "
+            >
+              Fi-Wi<br />
+              <span style="font-weight: 500; color: #666; font-size: 0.85rem"
+                >Concentrator</span
+              >
+            </th>
+          </tr>
+        </thead>
+
+        <tbody>
+          <tr>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                font-weight: 600;
+                position: sticky;
+                left: 0;
+                background: #fff;
+                z-index: 1;
+              "
+            >
+              Energy detect / CCA
+            </td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+          </tr>
+
+          <tr>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                font-weight: 600;
+                position: sticky;
+                left: 0;
+                background: #fff;
+                z-index: 1;
+              "
+            >
+              Channel busy time
+            </td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+          </tr>
+
+          <tr>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                font-weight: 600;
+                position: sticky;
+                left: 0;
+                background: #fff;
+                z-index: 1;
+              "
+            >
+              NAV / medium reservation
+            </td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+          </tr>
+
+          <tr>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                font-weight: 600;
+                position: sticky;
+                left: 0;
+                background: #fff;
+                z-index: 1;
+              "
+            >
+              CSI / channel matrix
+            </td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+          </tr>
+
+          <tr>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                font-weight: 600;
+                position: sticky;
+                left: 0;
+                background: #fff;
+                z-index: 1;
+              "
+            >
+              MCS / GI / NSS
+            </td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+          </tr>
+
+          <tr>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                font-weight: 600;
+                position: sticky;
+                left: 0;
+                background: #fff;
+                z-index: 1;
+              "
+            >
+              PER / retry counts
+            </td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+          </tr>
+
+          <tr>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                font-weight: 600;
+                position: sticky;
+                left: 0;
+                background: #fff;
+                z-index: 1;
+              "
+            >
+              RSSI / SNR
+            </td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+          </tr>
+
+          <tr style="background: #fff8f0">
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                font-weight: 600;
+                position: sticky;
+                left: 0;
+                background: #fff8f0;
+                z-index: 1;
+              "
+            >
+              Queue depth
+            </td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+          </tr>
+
+          <tr style="background: #fff8f0">
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                font-weight: 600;
+                position: sticky;
+                left: 0;
+                background: #fff8f0;
+                z-index: 1;
+              "
+            >
+              Sojourn time
+            </td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+          </tr>
+
+          <tr>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                font-weight: 600;
+                position: sticky;
+                left: 0;
+                background: #fff;
+                z-index: 1;
+              "
+            >
+              ECN marks
+            </td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+          </tr>
+
+          <tr>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                font-weight: 600;
+                position: sticky;
+                left: 0;
+                background: #fff;
+                z-index: 1;
+              "
+            >
+              One-way delay (OWD)
+            </td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+          </tr>
+
+          <tr>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                font-weight: 600;
+                position: sticky;
+                left: 0;
+                background: #fff;
+                z-index: 1;
+              "
+            >
+              Responsiveness
+            </td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+          </tr>
+
+          <tr>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                font-weight: 600;
+                position: sticky;
+                left: 0;
+                background: #fff;
+                z-index: 1;
+              "
+            >
+              Throughput / goodput
+            </td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+            <td
+              style="
+                border-bottom: 1px solid #f0f0f0;
+                padding: 8px;
+                text-align: center;
+              "
+            ></td>
+          </tr>
+
+          <tr style="background: #fff8f0">
+            <td
+              style="
+                border-bottom: none;
+                padding: 8px;
+                font-weight: 600;
+                position: sticky;
+                left: 0;
+                background: #fff8f0;
+                z-index: 1;
+              "
+            >
+              Deterministic playback
+            </td>
+            <td
+              style="border-bottom: none; padding: 8px; text-align: center"
+            ></td>
+            <td
+              style="border-bottom: none; padding: 8px; text-align: center"
+            ></td>
+            <td
+              style="border-bottom: none; padding: 8px; text-align: center"
+            ></td>
+            <td
+              style="border-bottom: none; padding: 8px; text-align: center"
+            ></td>
+            <td
+              style="border-bottom: none; padding: 8px; text-align: center"
+            ></td>
+            <td
+              style="border-bottom: none; padding: 8px; text-align: center"
+            ></td>
+          </tr>
+        </tbody>
+      </table>
+    </div>
+
+    <h3>Critical Observations</h3>
+
+    <p><strong>Queue Depth and Sojourn Time (highlighted rows):</strong></p>
+
+    <p>
+      These metrics are essential for L4S congestion control and machine
+      learning. Traditional tools (tcpdump, Wi-Fi packet capture) cannot
+      directly observe queue state because it exists inside firmware or kernel
+      layers. While synchronized ingress and egress packet captures could
+      theoretically infer queue depth through timing correlation, this approach
+      requires nanosecond-precise time synchronization across physically
+      separated capture points, perfect packet correlation despite potential
+      losses, and still cannot observe firmware-internal retry queues,
+      aggregation buffer states, or PHY scheduling decisions. External sniffers
+      see the explosion (the packet hitting the air), but they cannot see the
+      fuse burning (the packet sitting in the driver queue). Only centralized
+      queueing architectures expose these values with direct
+      microsecond-resolution timestamps.
+    </p>
+
+    <p><strong>MCS / GI / NSS (PHY Configuration):</strong></p>
+
+    <p>
+      Monitor-mode packet capture can partially infer MCS from radiotap headers,
+      but this only shows what was transmitted—not the decision process, CSI
+      data, or PER history that informed the choice. The Fi-Wi Concentrator has
+      direct access to the complete decision state.
+    </p>
+
+    <p><strong>Deterministic Playback (bottom row):</strong></p>
+
+    <p>
+      This capability enables machine learning. Deterministic playback means the
+      Concentrator can reproduce its own decision sequence from a log file:
+      packet arrivals, queue transitions, scheduling decisions, MCS selections,
+      and RRH transmission commands. While actual RF outcomes depend on station
+      behavior and channel conditions that may vary, the Concentrator can replay
+      its control decisions under the logged RF environment to evaluate
+      alternative strategies offline and verify whether different MCS/scheduling
+      choices would have improved performance. This is only possible when all
+      Concentrator-controlled components operate under a single clock with
+      complete state visibility. Distributed systems cannot reconstruct this
+      causal chain from partial packet traces because they lack visibility into
+      queue state, retry logic, and the decision-making process itself.
+    </p>
+
+    <h3>Why This Enables More Effective Machine Learning</h3>
+
+    <p>
+      Section 15 describes how Fi-Wi uses machine learning to optimize MCS
+      transition rates. The observability matrix demonstrates significant
+      practical advantages that Fi-Wi's centralized architecture provides for ML
+      training:
+    </p>
+
+    <ul>
+      <li>
+        <strong>Direct Queue Observability:</strong> Can label training examples
+        based on actual queue impact and sojourn time measurements
+      </li>
+
+      <li>
+        <strong>Global CSI Visibility:</strong> Can learn cross-RRH interference
+        patterns from centralized measurements
+      </li>
+
+      <li>
+        <strong>Deterministic Replay Capability:</strong> Can evaluate whether
+        alternative decisions would improve outcomes under the same conditions
+      </li>
+
+      <li>
+        <strong>Low Inference Distance:</strong> 1-2 steps between PHY decisions
+        and observable transport-layer effects versus 5-10 steps in distributed
+        systems
+      </li>
+    </ul>
+
+    <p>
+      Fi-Wi's centralized architecture provides these observability advantages.
+      The Concentrator's event log becomes a high-quality training dataset where
+      every state transition is labeled with measured outcomes under consistent
+      instrumentation. While autonomous AP systems could attempt ML-based rate
+      adaptation using the partial observability available to them, Fi-Wi's
+      richer telemetry—particularly queue visibility, global CSI, and
+      deterministic replay—enables significantly more effective learning and
+      optimization.
+    </p>
+
+    <div class="callout">
+      <strong>Coordination Shares Outcomes; Fi-Wi Centralizes Causes</strong>
+      <p>
+        Coordinated AP systems can share summaries (throughput, ECN marks,
+        interference reports) but cannot share hidden internal state (queue
+        depth, firmware retry logic, aggregation decisions). This creates
+        inference distance—the controller sees effects but not causes. Fi-Wi
+        eliminates inference distance by removing autonomous decision-making
+        from the edge. Queues, scheduling, and PHY selection are centralized
+        under a single clock, producing an observable state graph where causes
+        are explicit, replayable, and directly controllable. This architectural
+        difference translates to measurably better ML training data quality.
+      </p>
+    </div>
+
+    <div class="section" id="appendix-i">
+      <h2>Appendix I: Channel Width Orchestration and Service Time Variance</h2>
+
+      <p>
+        The Fi-Wi architecture treats channel width as a dynamic control
+        parameter managed by the <strong>Concentrator</strong>. While 802.11be
+        (Wi-Fi 7) emphasizes 320 MHz peak PHY rates, Fi-Wi's orchestration
+        engine strategically selects <strong>40 MHz channel widths</strong> in
+        high-density environments to ensure
+        <strong>Service Time Stationarity</strong> and the stability of the L4S
+        control loop.
+      </p>
+
+      <h3 id="appendix-i-1">
+        I.1 The Contention-Domain Collapse of Wideband Channels
+      </h3>
+
+      <p>
+        In shared-spectrum MDUs (Multi-Dwelling Units), the theoretical gain of
+        wider channels is often negated by
+        <strong>contention-domain collapse</strong>. In a CSMA/CA environment, a
+        transmission opportunity (TXOP) requires the entire bonded channel to be
+        idle. In a 6-AP overlapping scenario with 50% aggregate airtime
+        occupancy, the probability of finding all sub-bands simultaneously idle
+        drops exponentially with bandwidth.
+      </p>
+
+      <p>
+        Under a simplified independent-sub-band occupancy assumption, a basic
+        model suggests <code>P(160 MHz idle) ≈ (P(40 MHz idle))^4</code>,
+        resulting in 4–16× fewer transmission opportunities. In practice,
+        partial correlation between sub-bands moderates the exponent but does
+        not eliminate the super-linear decline in idle probability. This leads
+        to:
+      </p>
+
+      <ul>
+        <li>
+          <strong>Effective Airtime Utilization:</strong> 160 MHz utilization
+          often falls below 10% due to fragmented TXOPs and CCA (Clear Channel
+          Assessment) busy deferrals.
+        </li>
+
+        <li>
+          <strong>NAV Extensions:</strong> A significantly higher probability of
+          virtual carrier-sense freezes from neighboring uncoordinated cells
+          occupying any portion of the wideband mask.
+        </li>
+      </ul>
+
+      <h3 id="appendix-i-2">I.2 Queueing Theory and L4S Stability</h3>
+
+      <p>
+        From an <strong>M/G/1 queueing perspective</strong>, the performance of
+        the L4S control loop depends on the stability of the service rate (μ).
+        L4S stability requires frequent service opportunities and low variance
+        in service time to prevent the decoupling of the sender's congestion
+        window from the actual queue state.
+      </p>
+
+      <ul>
+        <li>
+          <strong>Service Time Variance (σ²):</strong> Wide channels in dense
+          environments increase the variance of service time due to heavy-tailed
+          CCA deferrals.
+        </li>
+
+        <li>
+          <strong>Feedback Coherence:</strong> When a 160 MHz channel freezes
+          for 100ms, it represents 2.5–20 RTTs of stale feedback for typical
+          broadband RTTs (5–40 ms). Typical PI² parameterizations (Section 5.1)
+          assume feedback coherence within a small number of RTTs; exceeding
+          this threshold leads to oscillatory behavior.
+        </li>
+
+        <li>
+          <strong>The 40 MHz Advantage:</strong> By providing a
+          <strong>stationary service rate</strong>, 40 MHz ensures continuous
+          ECN marking and stable RTT sampling, allowing the L4S loop to remain
+          synchronized with the physical medium.
+        </li>
+      </ul>
+
+      <h3 id="appendix-i-3">I.3 Link Adaptation and Spectral Robustness</h3>
+
+      <p>
+        Narrower channels reduce the probability that partial-band interference
+        (e.g., unmanaged IoT bursts) forces a full MCS downgrade across the
+        entire bonded width. This allows the <strong>Concentrator</strong> to
+        maintain stable link adaptation and a predictable drain rate, avoiding
+        the chaotic rate-shifting common in 160 MHz deployments.
+      </p>
+
+      <h3 id="appendix-i-4">I.4 Orchestration: Width as a Control Variable</h3>
+
+      <p>
+        Fi-Wi is not anti-wideband; channel width is an orchestrated variable.
+        The system expands width opportunistically when contention is low to
+        leverage PHY gains and contracts it to 40 MHz when deterministic latency
+        is required. This prioritizes
+        <strong>spatial reuse and airtime isolation</strong> over maximum burst
+        rate—the fundamental technical unlock for Fi-Wi’s cell-per-room model.
+      </p>
+
+      <h3 id="appendix-i-5">
+        I.5 Capacity Density: Throughput Under a Latency SLO
+      </h3>
+
+      <p>
+        Fi-Wi optimizes <strong>Capacity Density under a Latency SLO</strong>,
+        rather than peak PHY on a single link. In dense OBSS environments, wide
+        channels reduce spatial reuse; narrower channels increase the number of
+        bounded contention domains. Consequently, aggregate goodput per area
+        increases even if per-link PHY decreases.
+      </p>
+
+      <div
+        class="callout"
+        style="background: #f8f9fa; border: 1px solid #dee2e6"
+      >
+        <strong>Metric Definition: Low-Latency Goodput Density (ρ_LL)</strong>
+        <p style="font-family: monospace; margin: 10px 0">
+          ρ_LL [Mbps / 1,000 sq ft] = (Σ Goodput_i) / Area | subject to p95 OWD
+          ≤ 20ms
+        </p>
+
+        <p class="small">
+          Where <em>Goodput_i</em> is the application-layer payload throughput
+          delivered while maintaining the p95 one-way delay (OWD) constraint.
+          The 20ms threshold reflects the target for interactive L4S
+          applications.
+        </p>
+      </div>
+
+      <p>
+        <strong
+          >Example Calculation (1,000 sq ft section of a 10,000 sq ft
+          floor):</strong
+        >
+      </p>
+
+      <p class="small">
+        <em
+          >Assumptions: 50% aggregate offered load per BSS, default EDCA
+          parameters, and no explicit inter-AP coordination in the autonomous
+          case.</em
+        >
+      </p>
+
+      <ul>
+        <li>
+          <strong>160 MHz Autonomous:</strong> 1 AP (Wide coverage) × 1.2 Gbps
+          peak × 10% airtime efficiency ≈ 120 Mbps aggregate. Under heavy-tailed
+          service intervals, a large fraction of packets violate the 20ms p95
+          constraint. When filtered by the latency SLO (ρ_LL definition), usable
+          goodput is reduced to <strong>~12 Mbps</strong> in this model.
+        </li>
+
+        <li>
+          <strong>40 MHz Fi-Wi:</strong> 8 RRHs (Cell-per-room) × 400 Mbps peak
+          × 40% airtime efficiency ≈ 1,280 Mbps aggregate. Bounded service
+          intervals allow nearly all traffic to meet the latency SLO, resulting
+          in <strong>~128 Mbps</strong>.
+        </li>
+      </ul>
+
+      <h3 id="appendix-i-6">
+        I.6 Application: Aligning Wireless Capacity to Gigabit WAN Service
+      </h3>
+
+      <p>
+        To align with a Gigabit-class WAN service, the wireless architecture
+        must match the <strong>aggregate wireline supply</strong> to
+        <strong>orchestrated spatial demand</strong>. In a dense MDU, Contention
+        Delay is 10–100× larger than serialization time. A single 160 MHz AP
+        attempting to serve a Gigabit load creates a "fast but flaky" link that
+        collapses under co-channel interference, delivering only a fraction of
+        the ISP's provided capacity to real-time applications.
+      </p>
+
+      <p>
+        Fi-Wi resolves this by using 40 MHz orchestration to spread the Gigabit
+        load across <strong>N coordinated spatial domains</strong>. This ensures
+        that the building-wide wireless fabric can actually saturate a 1 Gbps
+        WAN link with deterministic, multi-user goodput, rather than relying on
+        single-device peak bursts that starve other users and destabilize shared
+        airtime.
+      </p>
+
+      <h3 id="appendix-i-7">
+        I.7 Aggregation Quantization and L4S Feedback Mismatch
+      </h3>
+
+      <p>
+        L4S signals congestion at Layer 3 (IP ECN), but wideband Wi-Fi operates
+        via massive Layer 2 A-MPDU aggregation to maintain PHY efficiency. This
+        creates a fundamental control-loop mismatch:
+      </p>
+
+      <ul>
+        <li>
+          <strong>Feedback Sparsity:</strong> Massive 160 MHz aggregates
+          transmit dozens of packets in a single burst. This results in "binary"
+          feedback—either a massive burst of ECN marks or none at all—which
+          prevents the PI² controller from calculating the smooth probability
+          signal required for precise rate pacing.
+        </li>
+
+        <li>
+          <strong>The Sawtooth Latency Trap:</strong> Wide channels induce a
+          "fast but infrequent" cadence; the AP must buffer data longer to build
+          efficient aggregates. This creates multi-millisecond stalls that look
+          like random network noise to L4S senders, causing congestion window
+          oscillation.
+        </li>
+
+        <li>
+          <strong>Timing Incoherence:</strong> Traditional APs lose
+          microsecond-level visibility once packets enter hardware aggregation
+          pipelines. Fi-Wi's 40 MHz orchestration enforces smaller, frequent
+          TXOPs (~250 µs), ensuring the MAC service frequency is significantly
+          higher than the L4S control frequency, enabling accurate sojourn-time
+          measurement and coherent ECN marking at the IP layer.
+        </li>
+      </ul>
+
+      <p>
+        The Fi-Wi architecture addresses these challenges through its DualQ
+        implementation (Section 5.2), which maintains separate queues for L4S
+        and Classic traffic and performs per-packet sojourn time measurements at
+        the Concentrator before entering the A-MPDU aggregation pipeline.
+      </p>
+
+      <hr />
+
+      <h3>Comparison of Service Metrics (Dense MDU Contention Model)</h3>
+
+      <p class="small">
+        <em
+          >Scenario: 2x2 MIMO, 6+ overlapping BSSIDs, shared unlicensed spectrum
+          (5/6 GHz), 50% aggregate offered load, autonomous EDCA parameters. See
+          Appendix J for full simulation parameters.</em
+        >
+      </p>
+
+      <table class="comparison">
+        <thead>
+          <tr>
+            <th>Metric</th>
+            <th>160 MHz (Autonomous CSMA)</th>
+            <th>40 MHz (Fi-Wi Orchestrated)</th>
+          </tr>
+        </thead>
+
+        <tbody>
+          <tr>
+            <td><strong>Peak PHY Rate</strong> (2x2, MCS 11)</td>
+            <td>~1.2 Gbps</td>
+            <td>~300-400 Mbps</td>
+          </tr>
+
+          <tr>
+            <td><strong>Effective Airtime Utilization</strong></td>
+            <td>&lt;10% (Fragmented TXOPs)</td>
+            <td>30–50% (Planned reuse / Bounded domain)</td>
+          </tr>
+
+          <tr>
+            <td><strong>Service Time Variance (σ²)</strong></td>
+            <td>High (Heavy-tailed)</td>
+            <td>Low (Near-stationary)</td>
+          </tr>
+
+          <tr>
+            <td><strong>Queue Service Interval (median)</strong></td>
+            <td>Tens to &gt;100 ms</td>
+            <td>5–15 ms (Stationary)</td>
+          </tr>
+
+          <tr>
+            <td><strong>DualQ ECN Feedback Coherence</strong></td>
+            <td>Sparse / Burst-marked</td>
+            <td>Continuous / Stable marking</td>
+          </tr>
+
+          <tr>
+            <td>
+              <strong>Goodput Density (ρ_LL)</strong><br />
+              <span class="small">(Mbps per 1,000 sq ft)</span>
+            </td>
+            <td>
+              ~12 Mbps<br />
+              <span class="small">(Overlapping contention domains)</span>
+            </td>
+            <td>
+              ~128 Mbps<br />
+              <span class="small">(8 RRHs, orthogonal 40 MHz channels)</span>
+            </td>
+          </tr>
+        </tbody>
+      </table>
+
+      <p>
+        <strong>Economic Conclusion:</strong> Under realistic dense MDU
+        conditions, Fi-Wi's orchestrated 40 MHz architecture delivers ~10×
+        higher usable goodput density compared to autonomous wide-channel
+        deployments. This is the fundamental advantage of Fi-Wi: capacity scales
+        with RRH density and spatial reuse, not channel width alone.
+      </p>
+
+      <p class="small">
+        <em
+          >See Appendix J for detailed contention modeling and simulation
+          methodology.</em
+        >
+      </p>
+    </div>
+
+    <div class="section" id="appendix-j">
+      <h2>Appendix J: 10-Node MDU Simulation Methodology</h2>
+
+      <p>
+        This appendix details the Monte Carlo simulation and analytical models
+        used to derive the
+        <strong>Low-Latency Goodput Density (ρ_LL)</strong> metrics. The
+        framework evaluates Fi-Wi's spatial capacity gains under realistic
+        Multi-Dwelling Unit (MDU) contention scenarios.
+      </p>
+
+      <h3 id="appendix-j-1">J.1 Spatial and RF Environment Model</h3>
+
+      <p>
+        The simulation contrasts traditional wide-area coverage with Fi-Wi's
+        localized orchestration.
+      </p>
+
+      <div
+        class="callout"
+        style="background: #f8f9fa; border: 1px solid #dee2e6"
+      >
+        <strong>Building & RF Assumptions:</strong>
+        <ul class="small">
+          <li>
+            <strong>Geometry:</strong> 10,000 sq ft floor divided into 8 units
+            (~1,250 sq ft each). Metrics are normalized to "per 1,000 sq ft" for
+            comparative analysis.
+          </li>
+
+          <li>
+            <strong>Path Loss Model:</strong>
+            <code>PL(d) = PL(d₀) + 10n log₁₀(d/d₀) + Xσ</code> with
+            <code>n = 2.8</code>.
+          </li>
+
+          <li>
+            <strong>OBSS Overlap:</strong> Autonomous case assumes 6 neighboring
+            BSSIDs audible at ≥ -62 dBm.
+          </li>
+
+          <li>
+            <strong>Fi-Wi Isolation:</strong> 8 RRHs achieving &gt;25 dB
+            co-channel isolation through planned orthogonal reuse.
+          </li>
+        </ul>
+      </div>
+
+      <h3 id="appendix-j-2">J.2 Contention and Backoff Logic</h3>
+
+      <p>
+        The simulation models 20 active stations (STAs) distributed across the
+        8-unit floor (average 2.5 STAs per unit).
+        <strong>Service Time Variance (σ²)</strong> is calculated by observing
+        the delay between <code>TX_START</code> and <code>ACK_END</code> across
+        10⁶ simulated TXOPs.
+      </p>
+
+      <ul>
+        <li>
+          <strong>Autonomous Case (160 MHz):</strong> All 20 STAs compete for
+          the shared 160 MHz channel. Probability of transmission is modeled as:
+          <code>P(TX) = [1 - p_occ]^4</code>, where <code>p_occ</code> is
+          aggregate occupancy from overlapping neighbors.
+        </li>
+
+        <li>
+          <strong>Fi-Wi Case (40 MHz):</strong> STAs are partitioned into 8
+          isolated airtime domains. <code>P(TX) = 1 - p_local_occ</code>,
+          restricted to immediate room-level neighbors.
+        </li>
+      </ul>
+
+      <h3 id="appendix-j-3">J.3 The ρ_LL Filtration Process</h3>
+
+      <p>
+        The <strong>Goodput Density</strong> is derived by filtering raw
+        throughput through the 20ms p95 OWD constraint.
+      </p>
+
+      <pre style="background: #f1f1f1; padding: 15px; border-radius: 5px">
+// Derivation for ρ_LL Calculation
+for each packet i:
+    delay_i = contention_delay + serialization_delay + retry_overhead
+    if delay_i &lt;= 20ms:
+        accepted_payload += size_i
+    else:
+        dropped_from_goodput_metric++
+
+ρ_LL = (accepted_payload) / (total_time * area)
+  </pre
+      >
+      <h4 id="appendix-j-3-1">J.3.1 Numerical Results and Derivation</h4>
+
+      <p>
+        The simulation produces the following goodput derivation for a 1,000 sq
+        ft sections:
+      </p>
+
+      <ul>
+        <li>
+          <strong>160 MHz Autonomous:</strong> 120 Mbps raw aggregate
+          throughput. However, heavy-tail contention spikes cause 90% of packets
+          to exceed the 20ms SLO. <code>120 Mbps * 0.10 = 12 Mbps ρ_LL</code>.
+        </li>
+
+        <li>
+          <strong>40 MHz Fi-Wi:</strong> 1,280 Mbps raw aggregate throughput
+          (across 8 orthogonal domains). Stationary service rate allows 99.8% of
+          packets to meet the 20ms SLO.
+          <code>1,280 Mbps * 0.998 = ~128 Mbps ρ_LL</code>.
+        </li>
+      </ul>
+
+      <h3 id="appendix-j-4">J.4 Traffic Model and Payload Composition</h3>
+
+      <table class="comparison">
+        <thead>
+          <tr>
+            <th>Traffic Type</th>
+            <th>% of Load</th>
+            <th>Constraint</th>
+          </tr>
+        </thead>
+
+        <tbody>
+          <tr>
+            <td>Interactive (L4S/Gaming)</td>
+            <td>20%</td>
+            <td>Strict SLO subject</td>
+          </tr>
+
+          <tr>
+            <td>Streaming (4K Video)</td>
+            <td>50%</td>
+            <td>Freeze sensitive</td>
+          </tr>
+
+          <tr>
+            <td>Bulk (Background)</td>
+            <td>30%</td>
+            <td>Throughput focused</td>
+          </tr>
+        </tbody>
+      </table>
+    </div>
+    <script>
+      document.addEventListener("DOMContentLoaded", () => {
+        const tocItems = document.querySelectorAll("nav.toc li");
+        tocItems.forEach((item) => {
+          const subList = item.querySelector("ul");
+          if (subList) {
+            const toggle = document.createElement("span");
+            toggle.className = "toc-toggle";
+            toggle.onclick = (e) => {
+              e.stopPropagation();
+              item.classList.toggle("expanded");
+            };
+            // Insert toggle before the link
+            item.insertBefore(toggle, item.firstChild);
+          } else {
+            // Add spacer for alignment
+            const spacer = document.createElement("span");
+            spacer.className = "toc-no-child";
+            item.insertBefore(spacer, item.firstChild);
+          }
+        });
+      });
+    </script>
+    <footer
+      style="
+        margin-top: 5rem;
+        padding: 2rem 0;
+        border-top: 1px solid var(--border-soft);
+        text-align: center;
+        color: var(--text-muted);
+        font-size: 0.9rem;
+      "
+    >
+      <p>
+        <strong>Umber Networks Fi-Wi Technical Architecture Overview</strong
+        ><br />
+        Version 1.2, March 2026
+      </p>
+
+      <p
+        style="
+          margin: 1.5rem auto;
+          max-width: 600px;
+          font-size: 0.85rem;
+          line-height: 1.4;
+          color: #666;
+        "
+      >
+        © 2025 Umber Networks. All rights reserved.<br />
+        The Fi-Wi architecture, scatter-gather DMA mechanisms, and centralized
+        control loops described herein are proprietary intellectual property of
+        Umber Networks.
+      </p>
+
+      <p style="margin-top: 1rem">
+        <a href="#toc" style="font-weight: 600; text-decoration: none"
+          >↑ Return to Table of Contents</a
+        >
+      </p>
+    </footer>
+    <a href="#toc" class="back-to-toc">↑ Contents</a>
+  </body>
+</html>

Component	Traditional AP	Fi-Wi RRH
+ MAC/PHY Silicon + (802.11 Radio Logic) +	+ ~15-20M gates + MIMO, error correction, etc. + Complexity dictated by physics +	+ ~15-20M gates + Same physics, same complexity + No savings here +
+ Host SoC / CPU + (The "Brains") +	+ ~50-100M gates + Multi-core ARM CPU + DDR4 controller + Peripherals, caches, etc. +	+ ~100K-500K gates + Simple DMA state machine + Descriptor buffer only + 100-1000x simpler +
DRAM	+ 256MB - 1GB DDR4 + (Required for OS + buffers) +	+ 16-64KB SRAM + (Descriptor storage only) +
Operating System	+ Linux (millions of LOC) + Requires security patches +	+ None + Zero software attack surface +
Total Silicon	~70-120M gates	~15-20M gates
PCIe Gen	Per-Lane Rate	x1 Link	x4 Link	x8 Link
Gen 3	~8 GT/s	+ ~985 MB/s + (7.88 Gbps) +	+ ~3.94 GB/s + (31.5 Gbps) +	+ ~7.88 GB/s + (63 Gbps) +
Gen 4	~16 GT/s	+ ~1.97 GB/s + (15.75 Gbps) +	+ ~7.88 GB/s + (63 Gbps) +	+ ~15.75 GB/s + (126 Gbps) +
Gen 5	~32 GT/s	+ ~3.94 GB/s + (31.5 Gbps) +	+ ~15.75 GB/s + (126 Gbps) +	+ ~31.5 GB/s + (252 Gbps) +
Component	Latency
PCIe TLP formation (concentrator)	0.2-0.5 µs
Optical transceiver (TX)	0.1-0.3 µs
Fiber propagation (100m)	0.5 µs
Optical transceiver (RX)	0.1-0.3 µs
PCIe TLP processing (RRH)	0.2-0.5 µs
PCIe switch (if used)	0.1-0.3 µs per hop
Total one-way	1.2-2.4 µs
Round-trip (DMA read)	2.4-4.8 µs
Fronthaul Type	Round-Trip Latency	Determinism
PCIe over fiber	2.4-4.8 µs	Excellent (credit-based)
10GbE (cut-through)	10-30 µs	Good (with QoS)
10GbE (store-forward)	20-100 µs	Fair (subject to congestion)
PCIe Gen	Multi-Mode Fiber	Single-Mode Fiber
Gen 3 (8 GT/s)	300 m	10 km
Gen 4 (16 GT/s)	100 m	2-10 km
Gen 5 (32 GT/s)	50-100 m	2 km
Component	Cost (approx.)
RRH-side PCIe optical adapter	$150-300
Fiber pair (50m installed)	$50-100
Optical transceiver pair	$50-100
PCIe switch port allocation	$100-200
Total per RRH	$350-700
Approach	Cost per RRH	Latency	Determinism
PCIe over fiber	$350-700	2-5 µs	Excellent
10GbE + TSN	$300-600	10-30 µs	Good
Standard 10GbE	$200-400	20-100 µs	Fair
Requirement	Target	Achieved with PCIe Gen 3 x1
Bandwidth per RRH	~1.5 Gbps	✓ 7.88 Gbps (5× margin)
Aggregate (50 RRH)	~30 Gbps avg	✓ PCIe switch or multi-CPU
Round-trip latency	<10 µs	✓ 2.4-4.8 µs
Jitter	<200 ns	✓ <50 ns (credit-based)
Distance	≤100 m	✓ 300m MM / 10km SM
Determinism	No drops, predictable	✓ Credit-based flow control
Cost per RRH	<$700	✓ $350-700
Feature	Fi-Wi (PCIe over Fiber)	Traditional APs (Ethernet)
Protocol	+ PCIe PTM (Precision Time Measurement) + Hardware-native, bus-level messages +	+ IEEE 1588 PTP + Packet-based, software/firmware stack +
Sync Accuracy	+ 20-50 nanoseconds + Bus cycle precision + fiber margin +	+ 100ns – 10µs + Highly dependent on network load +
Jitter Source	+ Minimal + Point-to-point hardware flow control +	+ High + Switch queuing & software interrupt latency +
CPU Overhead	+ Zero + Handled entirely by PCIe PHY/Controller +	+ Moderate to High + CPU must interrupt to process sync packets +
Primary Benefits	+ Accurate L4S timestamps, TSF synchronization, unified timeline for + clients +	Basic time sync for logging and management
Loop	Parameter	Target Value	Rationale
Outer	Queue Reference	200 µs	Maintains ultra-low queuing delay.
Outer	Update Interval	5 ms (~1 RTT)	Matches typical control loop frequency.
Inner	Target TXOP	250 µs	Ensures ω_mac >> ω_tcp.
Inner	Max Aggregate	32 MSDUs	Limits tail latency contribution.