AI Hardware Timeline

2026
2025
2024
2022
2020
2018
2016
2012

2026

June 10, 2026 Midjourney

Midjourney CEO says invites are going out for the company's first hardware launch

Midjourney founder David Holz posted that invites are being sent for the company's first hardware product. No specifications have been shared publicly. If the product ships, Midjourney would become an AI-software company that also sells physical hardware — a notable expansion of scope from its origins as an image-generation service. Details are thin; treat this as an early signal rather than a confirmed product.David Holz on X →

Info

June 10, 2026 NVIDIA

NVIDIA publishes open humanoid robot reference design using Unitree hardware and Isaac GR00T

NVIDIA released an open reference design for humanoid robots combining Unitree hardware, Jetson Thor compute, the Isaac GR00T foundation model, Isaac Sim, and Isaac Lab. The reference stack is intended to lower the barrier for labs, universities, and startups to build humanoids by providing a documented, tested starting configuration rather than requiring each team to integrate hardware and software from scratch. It positions NVIDIA's physical-AI software stack as the default development environment for humanoid robots.NVIDIA press release →

Info

May 28, 2026 Brockovich Data Center Reporting

Erin Brockovich launches crowdsourced AI data-centre map

A crowdsourced mapping project associated with Erin Brockovich began tracking US AI data centres by status — operational, under construction, proposed and community-reported — with the site showing a late-May 2026 update. The map aggregates official records and resident reports of new and planned facilities. The project reflects mounting local scrutiny of the AI build-out's footprint on power, water and land, giving communities a public tool to follow data-centre siting that has often advanced with little visibility.Brockovich Data Center Reporting

Info

May 20, 2026 Blackstone / Google

Blackstone and Google form joint venture for new TPU Cloud — $5 billion equity

Blackstone and Google announced a joint venture to build a new TPU-based compute-as-a-service cloud, with Blackstone committing approximately $5 billion in equity. The venture is timed alongside Google I/O 2026 and is intended to expand TPU capacity available to third-party customers, alongside the eighth-generation TPU 8t/8i chips Google introduced at Cloud Next 2026. Coverage frames the deal as a structural step toward private-capital-backed AI compute build-outs, similar in shape to power-and-data-centre infrastructure deals that Blackstone and other private-equity firms have closed in recent quarters.Blackstone press release → Google blog →

Warning

#AIInfrastructure

May 20, 2026 Google

Google announces Android XR Glasses — audio-first this autumn, with cameras and partner brands

At Google I/O 2026, Google announced its Android XR Glasses platform, with an audio-first launch this autumn followed by camera-equipped models. Partners named include Samsung and Warby Parker, with the platform positioned as the hardware companion for Google's agentic Gemini stack (Spark, Daily Brief, Universal Cart). The category is Google's most overt re-entry into wearable hardware since the original Glass programme, building on the Android XR developer platform announced in late 2024.Google I/O 2026 announcements →

Info

#AIInfrastructure

May 13, 2026 US EIA

EIA: US data-centre servers projected at 22-33% of commercial-building electricity by 2050

The US Energy Information Administration's Annual Energy Outlook 2026 projects that data-centre servers — driven by AI workloads — will account for roughly 22 to 33% of all US commercial-building electricity by 2050, depending on adoption scenarios. In high-demand cases, server electricity use rises to more than 16 times its 2020 level. Servers already represented around 7% of commercial electricity in 2025. The outlook reverses the prior assumption of broadly flat US electricity growth and singles out standalone data centres as a faster-growing segment than in-building deployments, with knock-on implications for utilities, grid capacity, and commercial-building energy intensity.EIA Today in Energy → Annual Energy Outlook 2026 →

Warning

#AIInfrastructure

May 13, 2026 Unitree

Unitree unveils GD01 — a manned transformable "mecha" priced at roughly $650,000

Unitree revealed the GD01, a roughly 500 kg manned transformable machine that switches between a humanoid stance and a wheeled vehicle configuration. Global Times and follow-on coverage described it as production-ready at a price of about 3.9 million yuan — about $650,000 — placing it well above the company's own consumer-grade quadruped and humanoid lines. Independent verification is still thin: most coverage so far traces back to Unitree's own demo footage and a small number of Chinese-market outlets. Treat the capability claims as marketing-grade until third parties get hands-on time. Even so, the product slot itself is novel — a non-trivial step from research-platform humanoids toward operator-ridable showcase hardware.Coverage of the GD01 reveal

Info

April 22, 2026

Google splits its eighth-generation TPU into two chips: TPU 8t for training and TPU 8i for inference

At Google Cloud Next 2026, Google introduced two eighth-generation TPUs designed in partnership with DeepMind. TPU 8t targets frontier training, with a single superpod scaling to 9,600 chips and two petabytes of shared high-bandwidth memory; Google claims roughly 2.8x better price-performance over the previous generation and near-linear scaling to a million chips via the new Virgo network, JAX, and Pathways. TPU 8i targets latency-sensitive inference for concurrent agent workloads, with a new Boardfly serving topology, 3x more on-chip SRAM, a Collectives Acceleration Engine, and up to 80% better performance-per-dollar over its predecessor. Both chips claim up to twice the performance-per-watt of Ironwood, and native PyTorch support entered preview alongside the launch.Google ↗

Apr 2026

Sony AI reveals Ace, a table tennis robot that beats elite players Sony

Sony AI presented Ace, an autonomous table tennis robot that won three of five matches against elite players under ITTF rules at Sony's Tokyo headquarters. The system pairs custom high-speed robotic hardware with event-based vision sensors — cameras that report pixel changes as they happen rather than at a fixed frame rate — giving it reaction times fast enough to return high-speed, high-spin shots. The work was published in Nature on 22 April 2026.Nature ↗

Apr 2026

ASUS launches UGen300 USB AI Accelerator — 40 TOPS at 2.5 watts ASUS

ASUS announced the UGen300, a USB-stick AI accelerator built around a Hailo processor. It delivers up to 40 TOPS of inference performance at roughly 2.5 watts of power draw, targeting local generative-AI workloads on laptops, mini-PCs, and edge devices. The product slots into a growing category of plug-in accelerators aimed at keeping inference off the cloud and on the user's own hardware.ASUS Press Release ↗

Mar 2026

TSMC adds $100 billion to Arizona chip manufacturing TSMC

TSMC announced a further $100 billion investment in its Arizona fab complex, on top of $65 billion already committed. The expansion adds fabrication plants, advanced packaging, and a research centre. The first 4nm fab is expected to begin production in early 2025.TSMC Press Release ↗

Mar 2026

Tesla, SpaceX, and xAI announce Terafab — a $20 billion chip factory Tesla / xAI

Elon Musk announced plans for a vertically integrated semiconductor facility in Austin, Texas, targeting 2nm process technology and eventually one million wafer starts per month. The project, budgeted at $20–25 billion, would consolidate chip design, fabrication, memory, packaging, and testing under one roof — aimed at supplying custom AI chips for Tesla vehicles, Optimus robots, and SpaceX systems.Tom's Hardware ↗

Mar 2026

Helium supply shock removes 30% of global chipmaking gas Supply Chain

A major helium facility shutdown in the Middle East removed roughly 30% of global semiconductor-grade helium from the market, leaving chipmakers including TSMC and Samsung with weeks of reserves. Helium cools silicon wafers during lithography at temperatures no other gas can maintain, and there is no substitute. Spot prices doubled within days of the disruption.Tom's Hardware ↗

Mar 2026

Starcloud raises $170M to build data centres in orbit Starcloud

Starcloud closed a $170 million Series A at a $1.1 billion valuation to build the first commercial data centres in low-Earth orbit, using SpaceX launches to lift hardware. Space-based compute eliminates terrestrial cooling costs — orbital radiators can reject heat passively into the void, without water or air conditioning — and avoids the land-use and grid-connection delays that are stalling ground-based builds. If viable at scale, it marks the first genuinely new compute-infrastructure category since the hyperscale cloud era began.TechCrunch ↗

Jan 2026

OpenAI and Cerebras sign a $10 billion inference infrastructure deal Cerebras

OpenAI partnered with Cerebras to deploy 750 megawatts of wafer-scale inference hardware, designed for real-time GPT-5 inference. Cerebras separately revived its IPO plans for mid-2026.Cerebras ↗

2025

Dec 2025

NVIDIA acquires Groq for $20 billion NVIDIA

NVIDIA bought Groq's inference chip technology and engineering team in its largest acquisition. Groq's engineers joined a new Real-Time Inference division, reflecting the industry shift from training hardware toward inference-optimised systems.NVIDIA ↗

Jun 2025

AMD ships MI355X, its most competitive data centre GPU AMD

AMD released the Instinct MI355X, claiming four times the performance of its MI300X for AI training and inference. The chip gave cloud providers a credible alternative to NVIDIA and some leverage on pricing.AMD Newsroom ↗

Jan 2025

NVIDIA GeForce RTX 5090 launches — Blackwell arrives for consumers NVIDIA

NVIDIA released the GeForce RTX 5090 on 30 January 2025, priced at $1,999 and built on the Blackwell consumer architecture. With 92 billion transistors and 3,352 AI TOPS, it was twice as fast as the RTX 4090 for AI-accelerated workloads. DLSS 4 introduced multi-frame generation, producing up to three AI-rendered frames for every real frame.NVIDIA Newsroom ↗

2025

AI data centre power demands strain US electrical grids Industry

AI compute pushed electrical grids toward capacity limits. The largest US grid operator projected a six-gigawatt reliability shortfall by 2027. Chip designers responded by making energy efficiency a first-class design goal alongside raw performance.Utility Dive ↗

2025

IEA: global data centres used 415 TWh in 2024, demand set to more than double by 2030 IEA

The International Energy Agency reported that global data centres consumed about 415 terawatt-hours of electricity in 2024, roughly 1.5% of world demand, and projected that this figure could more than double by 2030. The agency identified the United States as the largest contributor and warned that data centres will account for a significant share of US electricity-demand growth over the next decade. It also estimated that around 20% of planned data-centre projects could face delays if grid-infrastructure constraints are not addressed, with some utilities already revising forecasts upward in response.IEA — Energy and AI ↗

Warning

2025

Memory shortages cause 40–60% AI deployment delays Industry

High-bandwidth memory hit severe shortages, creating bottlenecks even as GPU supply improved. Enterprise customers reported significant deployment delays. The pattern showed that AI hardware supply chains involve far more than GPUs alone — memory, packaging, and cooling all became chokepoints in sequence.AI News ↗

2025

Google TurboQuant cuts memory use 6x and speeds attention 8x Google

Google researchers published TurboQuant, an algorithm that compresses the KV cache — the memory store holding context during inference — by six times, while speeding up attention computation eightfold. The approach made it practical to run much longer context windows on existing hardware, easing one of the chief bottlenecks driving the memory shortages of 2025.VentureBeat ↗

2024

Sep 2024

xAI completes Colossus — 100,000 H100s in 122 days xAI

xAI built the Colossus supercomputer cluster in Memphis, Tennessee in 122 days, assembling 100,000 NVIDIA H100 GPUs. It became operational in September 2024 and is used to train and serve the Grok family of models. The speed of construction — roughly four months for a cluster that would normally take over a year — demonstrated what could be done when power, space, and hardware were treated as an emergency procurement problem.xAI ↗

May 2024

Microsoft launches Copilot+ PCs — AI becomes a hardware spec Microsoft / Qualcomm

Microsoft introduced the Copilot+ PC category in May 2024, requiring a minimum 40 TOPS neural processing unit from any manufacturer. The first devices shipped on Qualcomm Snapdragon X Elite, with ARM NPUs handling on-device tasks like live captions, image generation, and semantic search without a cloud call. Intel and AMD followed with their own NPU-equipped chips by year-end, making dedicated AI silicon standard in consumer laptops.Microsoft Blog ↗

2024

Cerebras builds WSE-3 with 4 trillion transistors on a single wafer Cerebras

Cerebras announced its third-generation Wafer-Scale Engine on TSMC 3nm, packing roughly 4 trillion transistors onto a single wafer-sized die. The company raised $1.1 billion at an $8.1 billion valuation to scale production. It remains the largest chip ever built.Cerebras ↗

2022

Mar 2022

NVIDIA announces the H100 Hopper GPU NVIDIA

The H100 introduced a Transformer Engine built specifically for large language models, with up to 9x faster training over the A100. Demand massively outstripped supply throughout 2023, with individual GPUs trading above $40,000 on secondary markets. This was the chip behind the GPT-4 era.NVIDIA Newsroom ↗

2020

Nov 2020

Apple ships the M1, bringing neural engines to consumer laptops Apple

Apple's first custom ARM silicon for Mac included a 16-core Neural Engine running 11 trillion operations per second on TSMC 5nm. It proved that dedicated ML hardware in a consumer device could outperform general-purpose processors in both speed and energy efficiency.Apple Newsroom ↗

May 2020

NVIDIA launches the A100, its first GPU built for AI from scratch NVIDIA

The A100 delivered 19.5 teraflops of FP32 performance with multi-instance GPU technology that let one chip run multiple AI jobs simultaneously. It became the standard training hardware for GPT-3, DALL-E, and the first wave of foundation models.NVIDIA Newsroom ↗

2018

Google opens TPU access to cloud customers Google

After developing Tensor Processing Units internally since 2015, Google made TPU v3 pods available through Google Cloud. TPUs became the training hardware behind BERT and later PaLM, establishing the template for tech giants building their own AI silicon rather than relying entirely on NVIDIA.Google Cloud ↗

Info

2016

May 2016

Google reveals it has been running custom AI chips since 2015 Google

At Google I/O, Google disclosed that custom Tensor Processing Units had been running in its data centres since 2015, powering Search, Street View, and AlphaGo. The announcement showed that the largest AI workloads were already outgrowing general-purpose hardware.DataCenter Knowledge ↗

Info

2012

AlexNet wins ImageNet using two gaming GPUs NVIDIA

Alex Krizhevsky's deep neural network won the ImageNet competition by a wide margin, trained on two NVIDIA GTX 580 consumer graphics cards with 3 GB of memory each. The result proved that GPUs designed for gaming could train neural networks far faster than CPUs — the insight that would eventually redirect NVIDIA's entire business.IEEE Spectrum ↗

2025

Mid-2025

NVIDIA Blackwell B200 GPUs reach mass production NVIDIA

NVIDIA's Blackwell architecture hit full-scale production after initial GB200 NVL72 systems shipped to cloud providers in late 2024. The B200 offered roughly 2.5x speed and 25x energy efficiency gains over Hopper for inference work.NVIDIA ↗

2026

June 22, 2026 Nvidia

Nvidia says warm-liquid cooling can cut AI data-centre cooling load

Axios reported Nvidia's claim that warm-liquid cooling can reduce the cooling burden of AI data centres, easing some of the water and energy pressure that comes with dense compute. The pitch reflects how the AI infrastructure story has moved beyond chips to cooling, water and power. How much it helps in practice will depend on real deployments at scale.Axios →

Info

June 18, 2026 Axios

US regulators push faster grid connections for AI data centres

Axios reported that US energy regulators were moving to speed up grid connections for AI data centres, as power and interconnection queues become a binding constraint on compute growth. The bottleneck for AI is increasingly the grid, cooling and permitting rather than the chips themselves. Faster connections would let more announced capacity actually come online.Axios →

Info

June 2026 Meta

Meta partners with Reliance on an AI-enabled data centre in India

Meta announced a partnership with Reliance to build an AI-enabled data centre in India, adding regional compute capacity in one of the world's largest internet markets. The deal is part of a broader push to expand AI infrastructure beyond the US. It ties a major frontier-AI company to a leading Indian conglomerate for long-term capacity.Meta →

Info

June 10, 2026 OpenAI

OpenAI weighs leasing a 10-gigawatt Ohio data-centre campus

Reuters reported that OpenAI was weighing a lease on a proposed 10-gigawatt data-centre campus in Ohio, with potential backing from Nvidia. The scale underlines how far compute ambitions now outrun existing facilities. A site of that size would be among the largest AI campuses contemplated to date, pending firm commitments.Reuters →

Info