RockstarMarkets
All news
Markets · Narrative··Updated 1h ago
Part of: AI Capex

NVDA Inference Share Forecast at 50% by 2028 as AMD and TPUs Gain

With inference margins running 10-15 points below training, a share erosion from 80%-plus to 50% would meaningfully compress NVDA's blended gross margin, a risk sell-side consensus has yet to fully price, with spillover pressure on AVGO and AMD valuations.

R
Rocky · RockstarMarkets desk
Synthesised from 8 wires · 50 mentions in the last 24h
Sentiment
-35
Momentum
70
Mentions · 24h
50
Articles · 24h
62
Affected sectors
Related markets

Key facts

  • AMD MI300, Google TPU, Amazon Trainium all gaining inference share; competitive threat real, not theoretical
  • Forecast: NVIDIA inference share to decline to 50% by 2028 from current 80%+ levels
  • Inference margins are 10-15 pts lower than training margins; margin dilution risk if share erodes
  • Hyperscalers building proprietary silicon; software ecosystem (ROCm, TPU) maturing; switching costs declining

What's happening

The narrative around NVIDIA's competitive moat has begun to crack under scrutiny. While NVIDIA remains the overwhelming leader in AI training chips, the real battlefield is shifting to inference, where margins are lower but volumes are vastly larger. AMD, Google's internally developed TPUs, and Amazon's Trainium chips are all making material gains, and a growing cohort of market observers believes NVIDIA's inference market share could fall to 50% by 2028, down from current levels above 80%.

This structural shift has several dimensions. First, hyperscalers have grown increasingly sophisticated at building proprietary silicon for their own workloads. Google, Amazon, and Meta have the engineering talent, the financial resources, and the incentive to reduce their dependence on NVIDIA. Second, AMD's MI300 and successor products have closed the performance gap in many workloads, and third-party benchmarks increasingly show feature parity at lower cost. Third, the software ecosystem around AMD (ROCm), Google (TPU), and others is maturing, lowering switching costs for new deployments.

For NVIDIA, this matters because gross margins in inference are 10-15 percentage points lower than in training, and if NVIDIA loses half the inference market, it directly pressures the blended gross margin that the street has baked into valuations. Several sell-side houses have not yet adjusted their margin forecasts to reflect this risk, creating a subtle but material disconnect between consensus estimates and the actual competitive landscape. The sell-side remains bullish on NVIDIA's FY2026-2027 earnings, but that view rests on assumptions about sustained market share that are increasingly contested.

Retail traders and momentum players have extrapolated from the Q1 earnings beat and the cup-and-handle breakout into a conviction that NVIDIA remains a one-way trade. But institutional flows suggest more caution. Money is flowing into broadsheet semiconductor and AI exposure (via SPY and QQQ) rather than concentrating in NVDA, a potential warning sign that smart money is diversifying away from single-stock concentration risk.

What to watch next

  • 01NVIDIA inference revenue growth rates in coming quarters; ASP (average selling price) trajectory
  • 02AMD quarterly revenue growth and margin trends; market share gains quantified by customers
  • 03Gross margin guidance from NVIDIA; any commentary on competitive intensity in inference
Mention velocity · last 24 hours
Coverage from these sources
Previously on this story

Related coverage

More about $NVDA

Topic hub
AI Capex: Who's Spending, Who's Earning, and What's at Risk

Tracking AI infrastructure capex — hyperscaler spend, data center buildouts, memory demand and the margin compression risk.