NOR Flash was supposed to be boring.
For years, it was the quiet workhorse of the semiconductor world — the chip that stores your firmware, boots your system, and then gets completely forgotten about. Demand was predictable. Pricing was stable. Nobody wrote breathless articles about NOR Flash shortages.
That era is over. In 2026, NOR Flash — particularly Winbond’s W25Q128 — has become one of the most supply-constrained components in the electronics industry. And unlike cyclical semiconductor shortages that resolve in a few quarters, this one has structural roots that won’t easily unwind.
Here’s what happened, and why it matters for your production planning.
Why NOR Flash Can’t Be Replaced
Before diving into the shortage, you need to understand why NOR Flash is irreplaceable. Not “hard to replace” — genuinely without substitute.
NOR Flash has a property called execute-in-place (XiP): the processor can run code directly from the Flash chip without copying it to RAM first. Boot firmware starts executing within microseconds of power-on. Read access latency is 70–100 nanoseconds — about 1,000 times faster than NAND Flash.
Can you use NAND Flash instead? No. NAND requires reading an entire page (2–4 KB minimum) and copying it to RAM before the processor can use it. This adds milliseconds of boot latency and requires DRAM to be initialized first — a chicken-and-egg problem for the firmware that initializes the DRAM.
Can you use DRAM? No. DRAM is volatile. Turn off the power, lose the data. Firmware must survive power cycles.
NOR Flash is the only commercially available technology combining non-volatility, byte-level random access, and execute-in-place capability. There is nothing else. When NOR Flash is in shortage, there is no Plan B at the technology level — only alternative suppliers of the same technology.
Three Demand Drivers Hitting Simultaneously
This is what makes the situation structural rather than cyclical. The 2021–2022 chip shortage was broad-based demand hitting broad-based supply. The NOR Flash shortage of 2026 is three specific, independent demand surges hitting a component category that has no reason to have built spare capacity.
Edge AI Multiplied the NOR Flash Per Device
A basic IoT device in 2022 needed one NOR Flash chip — perhaps a W25Q16 (2 MB) or W25Q32 (4 MB) — to store its firmware. Done.
An edge AI device in 2026 needs dramatically more. The AI model weights that enable on-device inference need to be stored somewhere that allows instant access at power-on. You can’t wait for a network download or a NAND-to-RAM copy. The model needs to be there when the processor wakes up.
A smart camera with AI object recognition now needs 128 Mbit (16 MB) or more of NOR Flash — plus a second chip for fail-safe OTA updates (so you can update one bank while running from the other). An AR/VR headset might need three 256 Mbit chips. A voice assistant with local wake-word detection uses 128 Mbit where it once used 32 Mbit.
The per-device NOR Flash content has grown 8–16x in three years. And the number of devices with AI capabilities is growing simultaneously. The compound effect is staggering.
Automotive ADAS Went From 2 Chips to 12
A standard vehicle five years ago used 2–3 NOR Flash chips totaling maybe 128 Mbit. Instrument cluster firmware, radio boot code, maybe a telematics module. Simple.
A modern vehicle with Level 2+ ADAS uses 8–12 NOR Flash ICs with total capacity over 1 Gbit. Every camera module (front-facing plus four corner/surround cameras) needs its own NOR Flash. Every radar module needs one. The ADAS domain controller needs a large one (256–512 Mbit). The instrument cluster, the telematics unit, the V2X module — each needs fast-boot firmware storage that survives power cycles.
And here’s the procurement headache: automotive NOR Flash must be AEC-Q100 qualified. That qualification takes 12–18 months for new production lines. You can’t quickly shift consumer-grade production to automotive-grade to meet demand. The qualification barrier creates a structural supply ceiling that demand has now exceeded.
AI Servers: The Unexpected Consumer
This one caught the industry by surprise. AI servers use significantly more NOR Flash than standard servers, for a reason that’s not immediately obvious.
HBM (High Bandwidth Memory) — the stacked DRAM that gives AI accelerators their bandwidth — requires NOR Flash to store initialization sequences that must execute deterministically at power-on. Each HBM stack needs its own NOR Flash for boot-time configuration. A GPU with six HBM stacks needs six NOR Flash chips just for memory initialization, separate from the main system firmware.
A standard server uses 1–2 NOR Flash chips. A GPU server with HBM uses 5–8. An AI training rack can need 30 or more. Nobody planned NOR Flash capacity for this use case, because nobody predicted AI training clusters would deploy at this scale this quickly.
Why Supply Can’t Keep Up
NOR Flash is manufactured on mature process nodes — 45nm and 55nm. These are the same nodes that produce display drivers, power management ICs, automotive MCUs, and analog chips. There’s fierce competition for mature-node wafer capacity, and NOR Flash — with its relatively lower revenue per wafer — often loses the allocation battle to higher-margin products.
The industry’s strategic investment has flowed to two places: advanced nodes (for AI chips) and capacity expansion for power semiconductors and automotive MCUs. NOR Flash hasn’t attracted significant new capital because, until recently, nobody expected it to be a growth market.
Winbond — commanding roughly 28% of global SPI NOR Flash — has acknowledged that high-end product allocation extends through early 2027. Their investment in 3D NOR Flash technology (the “CUBE” architecture) promises higher density per wafer, but volume production is a 2027–2028 story. It doesn’t help 2026.
The Specific Part: W25Q128
If I had to name the single most supply-constrained NOR Flash SKU in 2026, it would be the Winbond W25Q128JVSIQ.
128 Mbit (16 MB) is the Goldilocks capacity — large enough for edge AI firmware bundles but cost-effective enough for high-volume IoT products. It’s also the standard capacity for automotive camera modules. Demand is concentrated at exactly this density node, from both consumer and automotive applications simultaneously.
Smaller capacities (16–64 Mbit) and larger capacities (512 Mbit+) have more available supply. If your design can tolerate a capacity change in either direction, you may find shorter lead times.
What To Do
If NOR Flash would stop your production line, secure 6–9 months of safety stock now. The carrying cost (2–3% of component value per month) is trivial compared to line-down costs. Also: begin qualification of GigaDevice’s GD25Q128E immediately. It’s pin-compatible with the W25Q128, command-set compatible, and typically ships with shorter lead times (12–16 weeks versus Winbond’s 16–24 weeks).
If NOR Flash is important but not critical-path, extend your planning horizon to 16–20 weeks (versus the historical 8–12) and ensure your firmware driver supports JEDEC SFDP-compliant devices from multiple vendors without hardware changes. This is the single best investment you can make: design-time flexibility that creates procurement-time optionality.
If you’re designing a new product, ensure your PCB footprint and SPI interface support multiple NOR Flash vendors interchangeably. Standardize on JEDEC SFDP discovery protocol in your bootloader. A small software investment now gives your procurement team the freedom to source from Winbond, GigaDevice, Macronix, or ISSI based on whoever has stock — without a hardware revision.
How Long Will This Last?
Meaningful relief requires either new 2D capacity (unlikely to receive investment given the 3D transition) or 3D NOR Flash reaching volume production (2027–2028 at earliest).
My best estimate: the structural shortage persists through at least mid-2027, with spot pricing remaining 15–25% above 2025 contract levels. Beyond that, 3D NOR and potential capacity additions should gradually restore balance — but “balance” will be at higher price levels than the pre-2025 era, because the demand drivers (edge AI, automotive ADAS, AI servers) are permanent, not cyclical.
Plan accordingly.
Need NOR Flash allocated? Cosolvic sources Winbond, GigaDevice, Macronix, and ISSI NOR Flash through established supply channels. Submit your requirements with specific part numbers and quantities.