Google Tensor Processing Units – version 4

August 2, 2023 matt Comments 1 comment

I’ve written about Google’s custom silicon TPUs before (Google’s Tensor Processing Units – v1).

One of the big reasons for Google and others web services to develop their own custom chips is that general purpose CPUs are flexible but typically need a lot of power. That power costs a lot of money in electricity bills and cooling costs in huge data centers. So, why buy chips with lots of stuff you don’t need when you can build your own – and save millions of dollars a year in a data center with lower cooling and power costs?

In just a 6 years, Google has managed to design and build 4 ever increasingly capable AI data center chips. They had somewhat humble beginnings – but they are becoming increasingly powerful. Now they have just published information about TPU version 4.

What is this new chip capable of?

a nearly 10x leap forward in scaling ML system performance over TPU v3
boosting energy efficiency ~2-3x compared to contemporary ML DSAs, and
reducing CO2e as much as ~20x over these DSAs in typical on-premise data centers

Even crazier, it’s the first system to use purely optical switching.

TPU v4 is the first supercomputer to deploy a reconfigurable OCS (optical circuit switching). OCSes dynamically reconfigure their interconnect topology and are much cheaper, lower power, and faster than Infiniband. The figure below shows how an OCS works, using two MEMs arrays. No optical to electrical to optical conversion or power-hungry network packet switches are required, saving power.

Add to this, the newest version claims to be 1.2-1.7x faster and 1.9x more efficient than nVidia A100 chips.

Worth a read.

Links:

TPU v4: An Optically Reconfigurable Supercomputer for Machine Learning with Hardware Support for Embeddings: https://arxiv.org/ftp/arxiv/papers/2304/2304.01433.pdf
https://cloud.google.com/blog/topics/systems/tpu-v4-enables-performance-energy-and-co2e-efficiency-gains

Matt's Homepage

Google Tensor Processing Units – version 4

August 2, 2023 matt Comments 1 comment

Related

One thought on “Google Tensor Processing Units – version 4”

Leave a Reply Cancel reply

Share this:

Related

One thought on “Google Tensor Processing Units – version 4”

Leave a Reply Cancel reply