Track 16: Debugging & Case Studies
Real-world debugging requires systematic approaches and pattern recognition. This track presents compliance debug flowcharts, failure case studies, redesign examples, and interactive "What Went Wrong?" exercises to build debugging intuition.
1. Compliance Debug Process: Radiated Emissions
Follow this systematic flowchart when your product fails radiated emissions testing.
2. SI Failure Debug: Eye Diagram Case Studies
Case Study A: PCIe Gen3 Link Fails at Receiver
Symptoms: Eye height reduced to 50mV (spec: 120mV min). Eye width OK. BER > 1e-8.
Root Cause: Via stub of 8 mils creating resonant null at 4.5 GHz, right at Nyquist for 8 GT/s.
Fix: Back-drill vias to reduce stub length to < 2 mils. Eye height recovered to 180mV.
Case Study B: DDR4 Fails Write Leveling at 3200 MT/s
Symptoms: DQ eye shows severe ISI with eye closure. Signals ring for 2 UI after transitions.
Root Cause: 15mm trace length mismatch between DQ and DQS within byte lane. ODT misconfigured (240 ohm instead of 34 ohm).
Fix: Correct ODT to 34 ohm, reduce length mismatch to < 2mm. Clean eye with 200ps margin.
Case Study C: 25G Ethernet SFP+ Link Marginal
Symptoms: Eye diagram shows excessive jitter (>0.3 UI). Jitter tracks a 156.25 MHz pattern.
Root Cause: Reference clock coupled into data lane through inadequate isolation (only 4mm spacing, shared via field).
Fix: Increase spacing to 15mm, add ground via fence between clock and data. Jitter reduced to 0.08 UI.
3. PI Failure Analysis: Power Delivery Case Studies
Case Study A: FPGA Fails During High-Activity Modes
Symptoms: VCC_INT drops by 120mV during burst activity. FPGA resets randomly. Scope shows 200MHz ringing.
Root Cause: Anti-resonance between bulk caps (4.7uF) and MLCC (100nF) at 200MHz. PDN impedance peak of 500 mOhm at that frequency.
Fix: Added 10nF caps to bridge the anti-resonance gap. Peak impedance reduced to 30 mOhm. Droop reduced to 25mV.
Case Study B: GPU Power Rail Shows Excessive Ripple
Symptoms: 1.0V rail shows 80mV ripple at 500 kHz switching frequency. GPU throttles under load.
Root Cause: VRM output inductor placed 25mm from GPU BGA. Long power trace adds 2nH inductance. Only 2x 22uF bulk caps near GPU.
Fix: Relocated VRM within 10mm of GPU. Added 4x 100uF bulk caps and 20x 100nF MLCC. Ripple reduced to 15mV.
Case Study C: DDR4 VTT Reference Voltage Noise
Symptoms: VTT (0.6V) shows 50mV noise correlated with burst reads. DDR4 DQ signals show asymmetric eye.
Root Cause: VTT regulator has 100mOhm output impedance at DDR switching frequency. 4 DIMM slots share single VTT source with 40mm trace runs.
Fix: Added local 100nF decoupling at each DIMM slot VTT pin. Used wider VTT distribution traces. Noise reduced to 8mV.
4. PCB Redesign Examples
Before/after comparisons showing how targeted PCB changes resolve SI/PI/EMI issues.
Before: High-Speed Routing (Rev A)
- PCIe traces routed on layer 1 (microstrip)
- Traces cross 3 plane splits
- No length matching (30mm skew)
- Via stubs: 60 mil
- No ground via stitching
Result: Eye closed, EMI fail at 2.5GHz, 4GHz
After: High-Speed Routing (Rev B)
- PCIe traces on layer 3 (stripline)
- Continuous reference plane, no splits crossed
- Length matched within 0.5mm
- Via back-drilling, stub < 5 mil
- Ground via fence on both sides
Result: Clean eye, 6dB EMI margin
Before: PDN Layout (Rev A)
- VRM 40mm from processor
- 12x 100nF MLCC, 2x 22uF bulk
- Caps placed on back side only
- 2 power vias per cap pad
- No embedded capacitance consideration
Result: 150mV droop, anti-resonance at 50MHz
After: PDN Layout (Rev B)
- VRM within 15mm of processor
- 40x 100nF + 10x 10nF + 4x 100uF
- Caps on both sides, interleaved values
- 8 power vias per cap, wide power shapes
- Thin dielectric for embedded capacitance
Result: 20mV droop, flat impedance to 500MHz
Before: EMI Containment (Rev A)
- I/O connectors grounded with single via
- No common-mode chokes on cables
- Board edge traces within 3mm of edge
- No ground stitching around board perimeter
- Crystal oscillator near board edge
Result: Failed FCC Class B at 6 frequencies
After: EMI Containment (Rev B)
- 360-degree ground pad on all I/O connectors
- CM chokes on USB, HDMI, Ethernet
- 20mm keep-out from board edge
- Ground via fence at 200mil pitch around perimeter
- Crystal moved to board center, shielded
Result: Passed FCC Class B with 8dB margin
5. Troubleshooting Flowcharts
SI Debug Flowchart
EMI Debug Flowchart
6. "What Went Wrong?" Interactive Scenarios
Click each scenario to reveal the answer and explanation.
What Went Wrong: The USB 3.0 differential pair was routed on layer 1 (microstrip) with no ground via stitching and crossed a power plane split. The return path discontinuity created a common-mode conversion that destroyed the 5 GHz signal quality.
Fix: Rerouted USB 3.0 on inner stripline layer, added ground via fence, ensured continuous reference plane. Device enumerated at SuperSpeed immediately.
What Went Wrong: The two failing byte lanes had DQ-to-DQS length mismatch of 12mm (within spec at 2400 but out of spec at 3200). Additionally, the ODT values were optimized for 2400 and too weak for 3200 MT/s operation.
Fix: Reduced length mismatch to 1mm, updated ODT to 48 ohm for write and 40 ohm for read. All byte lanes passed at 3200 MT/s.
What Went Wrong: The MLCC decoupling capacitors lost 40% of their capacitance at high temperature (X5R ceramic, DC bias + temperature derating). This shifted the PDN anti-resonance into the frequency range of the switching regulator, creating a cavity resonance that radiated.
Fix: Replaced X5R caps with C0G (NPO) for critical locations. Added additional X7R caps with higher nominal values to compensate for derating. Passed at all temperatures.
What Went Wrong: The PCB via at the receiver end had an 8-mil stub (12-layer board, signal on layer 3, via goes to layer 12). The stub resonance at 4.6 GHz created a notch in S21 exactly at the Nyquist frequency of 8 GT/s PCIe Gen3.
Fix: Implemented via back-drilling to reduce stub to 2 mils. S21 improved by 4dB at Nyquist. BER improved from 1e-9 to < 1e-15.
What Went Wrong: The 25 MHz crystal oscillator for the Ethernet PHY was placed near a switching regulator. The 500 kHz switching noise modulated the reference clock, creating 200 Hz of periodic jitter. While small in absolute terms, this jitter accumulated in the PLL, causing occasional packet loss when combined with cable loss.
Fix: Relocated crystal 20mm away from the switcher, added ground via ring around crystal area, and added an LC filter on the power supply to the crystal driver. Zero packet loss thereafter.
Track 16 Quiz: Debugging & Case Studies
Q1: The first step in debugging a radiated emissions failure should be:
Q2: A via stub creates a resonant null in S21 at approximately:
Q3: An anti-resonance in the PDN impedance is caused by:
Q4: When a near-field probe localizes EMI to a specific connector, the most effective first fix is:
Q5: X5R MLCC capacitors lose significant capacitance due to: