{"id":15269,"date":"2025-10-04T15:27:19","date_gmt":"2025-10-04T22:27:19","guid":{"rendered":"https:\/\/mattfife.com\/?p=15269"},"modified":"2025-09-19T15:38:54","modified_gmt":"2025-09-19T22:38:54","slug":"google-ironwood-tpu-9216-chips","status":"publish","type":"post","link":"https:\/\/mattfife.com\/?p=15269","title":{"rendered":"Google Ironwood TPU 9216 chips"},"content":{"rendered":"\n<p>Google\u00a0ended the <a href=\"https:\/\/hc2025.hotchips.org\/\" data-type=\"link\" data-id=\"https:\/\/hc2025.hotchips.org\/\">Hot Chips 2025<\/a> machine learning session with a detailed look at its newest tensor processing unit, <a href=\"https:\/\/blog.google\/products\/google-cloud\/ironwood-tpu-age-of-inference\/\" data-type=\"link\" data-id=\"https:\/\/blog.google\/products\/google-cloud\/ironwood-tpu-age-of-inference\/\">Ironwood<\/a>. First revealed at\u00a0<a href=\"https:\/\/www.techradar.com\/pro\/google-cloud-unveils-ironwood-its-7th-gen-tpu-to-help-boost-ai-performance-and-inference\">Google Cloud Next 25<\/a>\u00a0in April 2025, Ironwood is Google&#8217;s first TPU (Tensor Processing Unit) designed primarily for large scale inference workloads &#8211; and it&#8217;s a whopper.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" width=\"640\" height=\"360\" data-attachment-id=\"15270\" data-permalink=\"https:\/\/mattfife.com\/?attachment_id=15270\" data-orig-file=\"https:\/\/i0.wp.com\/mattfife.com\/wp-content\/themes\/mattTheme\/headerimgs\/2025\/09\/WJvozZEbZN8Ut7gUy5WH3B-1200-80.jpg.webp?fit=1200%2C675&amp;ssl=1\" data-orig-size=\"1200,675\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"WJvozZEbZN8Ut7gUy5WH3B-1200-80.jpg\" data-image-description=\"\" data-image-caption=\"\" data-large-file=\"https:\/\/i0.wp.com\/mattfife.com\/wp-content\/themes\/mattTheme\/headerimgs\/2025\/09\/WJvozZEbZN8Ut7gUy5WH3B-1200-80.jpg.webp?fit=640%2C360&amp;ssl=1\" src=\"https:\/\/i0.wp.com\/mattfife.com\/wp-content\/themes\/mattTheme\/headerimgs\/2025\/09\/WJvozZEbZN8Ut7gUy5WH3B-1200-80.jpg.webp?resize=640%2C360&#038;ssl=1\" alt=\"\" class=\"wp-image-15270\" srcset=\"https:\/\/i0.wp.com\/mattfife.com\/wp-content\/themes\/mattTheme\/headerimgs\/2025\/09\/WJvozZEbZN8Ut7gUy5WH3B-1200-80.jpg.webp?resize=1024%2C576&amp;ssl=1 1024w, https:\/\/i0.wp.com\/mattfife.com\/wp-content\/themes\/mattTheme\/headerimgs\/2025\/09\/WJvozZEbZN8Ut7gUy5WH3B-1200-80.jpg.webp?resize=300%2C169&amp;ssl=1 300w, https:\/\/i0.wp.com\/mattfife.com\/wp-content\/themes\/mattTheme\/headerimgs\/2025\/09\/WJvozZEbZN8Ut7gUy5WH3B-1200-80.jpg.webp?resize=768%2C432&amp;ssl=1 768w, https:\/\/i0.wp.com\/mattfife.com\/wp-content\/themes\/mattTheme\/headerimgs\/2025\/09\/WJvozZEbZN8Ut7gUy5WH3B-1200-80.jpg.webp?resize=480%2C270&amp;ssl=1 480w, https:\/\/i0.wp.com\/mattfife.com\/wp-content\/themes\/mattTheme\/headerimgs\/2025\/09\/WJvozZEbZN8Ut7gUy5WH3B-1200-80.jpg.webp?w=1200&amp;ssl=1 1200w\" sizes=\"auto, (max-width: 640px) 100vw, 640px\" \/><\/figure>\n\n\n\n<p>The architecture is incredible. It delivers 4,614 TFLOPs of FP8 performance &#8211; and eight stacks of HBM3e provide 192GB of memory capacity per chip and is paired with 7.3TB\/s bandwidth. With 1.2TBps of I\/O bandwidth, the system can scale up to 9,216 chips per pod without glue logic and reach a whopping 42.5 exaflops of performance. It absolutely <a href=\"https:\/\/mattfife.com\/?p=3632\" data-type=\"link\" data-id=\"https:\/\/mattfife.com\/?p=3632\">trounces their previous TPUs<\/a>.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img data-recalc-dims=\"1\" loading=\"lazy\" decoding=\"async\" width=\"640\" height=\"360\" data-attachment-id=\"15271\" data-permalink=\"https:\/\/mattfife.com\/?attachment_id=15271\" data-orig-file=\"https:\/\/i0.wp.com\/mattfife.com\/wp-content\/themes\/mattTheme\/headerimgs\/2025\/09\/TPUv7_Inline_PeakPerformanceGrap.width-1000.format-webp.webp?fit=1000%2C562&amp;ssl=1\" data-orig-size=\"1000,562\" data-comments-opened=\"1\" data-image-meta=\"{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}\" data-image-title=\"TPUv7_Inline_PeakPerformanceGrap.width-1000.format-webp\" data-image-description=\"\" data-image-caption=\"\" data-large-file=\"https:\/\/i0.wp.com\/mattfife.com\/wp-content\/themes\/mattTheme\/headerimgs\/2025\/09\/TPUv7_Inline_PeakPerformanceGrap.width-1000.format-webp.webp?fit=640%2C360&amp;ssl=1\" src=\"https:\/\/i0.wp.com\/mattfife.com\/wp-content\/themes\/mattTheme\/headerimgs\/2025\/09\/TPUv7_Inline_PeakPerformanceGrap.width-1000.format-webp.webp?resize=640%2C360&#038;ssl=1\" alt=\"\" class=\"wp-image-15271\" srcset=\"https:\/\/i0.wp.com\/mattfife.com\/wp-content\/themes\/mattTheme\/headerimgs\/2025\/09\/TPUv7_Inline_PeakPerformanceGrap.width-1000.format-webp.webp?w=1000&amp;ssl=1 1000w, https:\/\/i0.wp.com\/mattfife.com\/wp-content\/themes\/mattTheme\/headerimgs\/2025\/09\/TPUv7_Inline_PeakPerformanceGrap.width-1000.format-webp.webp?resize=300%2C169&amp;ssl=1 300w, https:\/\/i0.wp.com\/mattfife.com\/wp-content\/themes\/mattTheme\/headerimgs\/2025\/09\/TPUv7_Inline_PeakPerformanceGrap.width-1000.format-webp.webp?resize=768%2C432&amp;ssl=1 768w, https:\/\/i0.wp.com\/mattfife.com\/wp-content\/themes\/mattTheme\/headerimgs\/2025\/09\/TPUv7_Inline_PeakPerformanceGrap.width-1000.format-webp.webp?resize=480%2C270&amp;ssl=1 480w\" sizes=\"auto, (max-width: 640px) 100vw, 640px\" \/><\/figure>\n\n\n\n<p>Deployment is already underway at hyperscale in Google Cloud data centers, although the TPU remains an internal platform not available directly to customers.<\/p>\n\n\n\n<p>Links:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a href=\"https:\/\/blog.google\/products\/google-cloud\/ironwood-tpu-age-of-inference\/\">https:\/\/blog.google\/products\/google-cloud\/ironwood-tpu-age-of-inference\/<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/hc2025.hotchips.org\/#clip=16hoeeeh601w\">https:\/\/hc2025.hotchips.org\/#clip=16hoeeeh601w<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/www.techradar.com\/pro\/googles-most-powerful-supercomputer-ever-has-a-combined-memory-of-1-77pb-apparently-a-new-world-record-for-shared-memory-multi-cpu-setups\">https:\/\/www.techradar.com\/pro\/googles-most-powerful-supercomputer-ever-has-a-combined-memory-of-1-77pb-apparently-a-new-world-record-for-shared-memory-multi-cpu-setups<\/a><\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>Google\u00a0ended the Hot Chips 2025 machine learning session with a detailed look at its newest tensor processing unit, Ironwood. First revealed at\u00a0Google Cloud Next 25\u00a0in April 2025, Ironwood is Google&#8217;s first TPU (Tensor Processing Unit) designed primarily for large scale inference workloads &#8211; and it&#8217;s a whopper. The architecture is incredible. It delivers 4,614 TFLOPs of FP8 performance &#8211; and eight stacks of HBM3e provide 192GB of memory capacity per chip and is paired with 7.3TB\/s bandwidth. With 1.2TBps of&#8230;<\/p>\n<p class=\"read-more\"><a class=\"btn btn-default\" href=\"https:\/\/mattfife.com\/?p=15269\"> Read More<span class=\"screen-reader-text\">  Read More<\/span><\/a><\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2}},"categories":[9],"tags":[],"class_list":["post-15269","post","type-post","status-publish","format-standard","hentry","category-cool"],"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/p4WECr-3Yh","jetpack-related-posts":[],"_links":{"self":[{"href":"https:\/\/mattfife.com\/index.php?rest_route=\/wp\/v2\/posts\/15269","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/mattfife.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/mattfife.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/mattfife.com\/index.php?rest_route=\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/mattfife.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=15269"}],"version-history":[{"count":3,"href":"https:\/\/mattfife.com\/index.php?rest_route=\/wp\/v2\/posts\/15269\/revisions"}],"predecessor-version":[{"id":15274,"href":"https:\/\/mattfife.com\/index.php?rest_route=\/wp\/v2\/posts\/15269\/revisions\/15274"}],"wp:attachment":[{"href":"https:\/\/mattfife.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=15269"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/mattfife.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=15269"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/mattfife.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=15269"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}