Build and train machine learning models on our new Google Cloud TPUs

By Newsroom America Feeds at 17 May 2017

div class="block-paragraph"div class="rich-text"pWe’re excited to announce that our second-generation Tensor Processing Units (TPUs) are coming to a href="https://cloud.google.com"Google Cloud/a to accelerate a wide range of machine learning workloads, including both training and inference. We call them a href="https://cloud.google.com/tpu/"Cloud TPUs/a, and they will initially be available via a href="https://cloud.google.com/compute/"Google Compute Engine/a./ppWe’ve witnessed extraordinary advances in machine learning (ML) over the past few years. Neural networks have a href="https://research.googleblog.com/2016/09/a-neural-network-for-machine.html"dramatically improved/a the quality of Google Translate, a href="https://www.bloomberg.com/news/articles/2015-10-26/google-turning-its-lucrative-web-search-over-to-ai-machines"played a key role/a in ranking Google Search results and made it a href="https://googleblog.blogspot.com/2015/05/picture-this-fresh-approach-to-photos.html"more convenient/a to find the photos you want with Google Photos. Machine learning allowed DeepMind’s a href="https://deepmind.com/research/alphago/"AlphaGo/a program to defeat Lee Sedol, one of the world’s top Go players, and also made it possible for software to a href="https://research.googleblog.com/2017/04/teaching-machines-to-draw.html"generate natural-looking sketches/a./ppThese breakthroughs required enormous amounts of computation, both to train the underlying machine learning models and to run those models once they’re trained (this is called “inference”). We’ve designed, built and deployed a family of Tensor Processing Units, or TPUs, to allow us to support larger and larger amounts of machine learning computation, first internally and now externally./ppWhile a href="https://cloudplatform.googleblog.com/2016/05/Google-supercharges-machine-learning-tasks-with-custom-chip.html"our first TPU/a was designed to run machine learning models quickly and efficiently—to translate a set of sentences or choose the next move in Go—those models still had to be trained separately. Training a machine learning model is even more difficult than running it, and days or weeks of computation on the best available CPUs and GPUs are commonly required to reach state-of-the-art levels of accuracy./ppResearch and engineering teams at Google and elsewhere have made great progress a href="https://www.tensorflow.org/performance/benchmarks"scaling machine learning training/a using readily-available hardware. However, this wasn’t enough to meet our machine learning needs, so we designed an entirely new machine learning system to eliminate bottlenecks and maximize overall performance. At the heart of this system is the second-generation TPU we're announcing today, which can both train and run machine learning models./p/div/div div class="block-image_full_width"!--image full width-- div class="uni-full-width" figure source media="(max-width: 540px) and (max-resolution: 1.5dppx)" sizes="540px" srcset="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-V2-hero.width-540.png 540w" source media="(max-width: 540px) and (min-resolution: 1.5dppx)" sizes="1080px" srcset="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-V2-hero.width-1080.png 1080w" source media="(max-resolution: 1.5dppx)" sizes="1000px" srcset="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-V2-hero.width-1000.png 1000w" source media="(min-resolution: 1.5dppx)" sizes="2000px" srcset="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-V2-hero.width-2000.png 2000w" img src="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-V2-hero.width-2000.png" alt="tpu-v2-hero" sizes="(max-width: 540px) 540px, (max-width: 540px) 1080px, 1000px, 2000px" srcset="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-V2-hero.width-540.png 540w, https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-V2-hero.width-1080.png 1080w, https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-V2-hero.width-1000.png 1000w, https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-V2-hero.width-2000.png 2000w" figcaptiondiv class="rich-text"Our new Cloud TPU delivers up to 180 teraflops to train and run machine learning models./div/figcaption /figure /div /div div class="block-paragraph"div class="rich-text"pEach of these new TPU devices delivers up to 180 teraflops of floating-point performance. As powerful as these TPUs are on their own, though, we designed them to work even better together. Each TPU includes a custom high-speed network that allows us to build machine learning supercomputers we call “TPU pods.” A TPU pod contains 64 second-generation TPUs and provides up to 11.5 petaflops to accelerate the training of a single large machine learning model. That’s a lot of computation!/ppUsing these TPU pods, we've already seen dramatic improvements in training times. One of our new large-scale translation models used to take a full day to train on 32 of the best commercially-available GPUs—now it trains to the same accuracy in an afternoon using just one eighth of a TPU pod./p/div/div div class="block-image_full_width"!--image full width-- div class="uni-full-width" figure source media="(max-width: 540px) and (max-resolution: 1.5dppx)" sizes="540px" srcset="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-1.width-540.png 540w" source media="(max-width: 540px) and (min-resolution: 1.5dppx)" sizes="1080px" srcset="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-1.width-1080.png 1080w" source media="(max-resolution: 1.5dppx)" sizes="1000px" srcset="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-1.width-1000.png 1000w" source media="(min-resolution: 1.5dppx)" sizes="2000px" srcset="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-1.width-2000.png 2000w" img src="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-1.width-2000.png" alt="tpu-v2-1" sizes="(max-width: 540px) 540px, (max-width: 540px) 1080px, 1000px, 2000px" srcset="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-1.width-540.png 540w, https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-1.width-1080.png 1080w, https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-1.width-1000.png 1000w, https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-1.width-2000.png 2000w" figcaptiondiv class="rich-text"A “TPU pod” built with 64 second-generation TPUs delivers up to 11.5 petaflops of machine learning acceleration./div/figcaption /figure /div /div div class="block-paragraph"div class="rich-text"h3Introducing Cloud TPUs/h3pWe’re bringing our new TPUs to a href="https://cloud.google.com/compute/"Google Compute Engine/a as a href="https://cloud.google.com/tpu/"Cloud TPUs/a, where you can connect them to virtual machines of all shapes and sizes and mix and match them with other types of hardware, including Skylake CPUs and NVIDIA GPUs. You can program these TPUs with a href="https://www.tensorflow.org/"TensorFlow/a, the most popular open-source machine learning framework on GitHub, and we’re introducing high-level APIs, which will make it easier to train machine learning models on CPUs, GPUs or Cloud TPUs with only minimal code changes./ppWith Cloud TPUs, you have the opportunity to integrate state-of-the-art ML accelerators directly into your production infrastructure and benefit from on-demand, accelerated computing power without any up-front capital expenses. Since fast ML accelerators place extraordinary demands on surrounding storage systems and networks, we’re making optimizations throughout our Cloud infrastructure to help ensure that you can train powerful ML models quickly using real production data./ppOur goal is to help you build the best possible machine learning systems from top to bottom. While Cloud TPUs will benefit many ML applications, we remain committed to offering a wide range of hardware on Google Cloud so you can choose the accelerators that best fit your particular use case at any given time. For example, Shazam a href="https://cloudplatform.googleblog.com/2017/05/Shazam-why-cloud-GPUs-finally-make-sense.html"recently announced/a that they successfully migrated major portions of their music recognition workloads to NVIDIA GPUs on Google Cloud and saved money while gaining flexibility./ph3Introducing the TensorFlow Research Cloud/h3pMuch of the recent progress in machine learning has been driven by unprecedentedly open collaboration among researchers around the world across both industry and academia. However, many top researchers don’t have access to anywhere near as much compute power as they need. To help as many researchers as we can and further accelerate the pace of open machine learning research, we'll make 1,000 Cloud TPUs available at no cost to ML researchers via the a href="https://www.tensorflow.org/tfrc"TensorFlow Research Cloud/a./ph3Sign up to learn more/h3If you’re interested in accelerating training of machine learning models, accelerating batch processing of gigantic datasets, or processing live requests in production using more powerful ML models than ever before, please a href="https://services.google.com/fb/forms/tpusignup"sign up today/a to learn more about our upcoming Cloud TPU Alpha program. If you’re a researcher expanding the frontier of machine learning and willing to share your findings with the world, please a href="https://services.google.com/fb/forms/tpusignup"sign up/a to learn more about the TensorFlow Research Cloud program. And if you’re interested in accessing whole TPU pods via Google Cloud, please a href="https://services.google.com/fb/forms/tpusignup"let us know/a more about your needs./div/div div class="block-image_carousel" div class="uni-full-width uni-carousel" uni-component="carousel" div class="uni-carousel-container wrap" figure div class="uni-carousel-slide-content" div class="uni-carousel-image" source media="(max-width: 540px) and (max-resolution: 1.5dppx)" sizes="540px" srcset="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-3.2e16d0ba.fill-540x304.jpg 540w" source media="(max-width: 540px) and (min-resolution: 1.5dppx)" sizes="1080px" srcset="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-3.2e16d0ba.fill-1080x608.jpg 1080w" source media="(max-resolution: 1.5dppx)" sizes="1000px" srcset="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-3.2e16d0ba.fill-1000x563.jpg 1000w" source media="(min-resolution: 1.5dppx)" sizes="1592px" srcset="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-3.2e16d0ba.fill-1592x896.jpg 1592w" img src="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-3.2e16d0ba.fill-1592x896.jpg" alt="tpu-v2-3" sizes="(max-width: 540px) 540px, (max-width: 540px) 1080px, 1000px, 1592px" srcset="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-3.2e16d0ba.fill-540x304.jpg 540w, https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-3.2e16d0ba.fill-1080x608.jpg 1080w, https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-3.2e16d0ba.fill-1000x563.jpg 1000w, https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-3.2e16d0ba.fill-1592x896.jpg 1592w" /div /div /figure figure div class="uni-carousel-slide-content" div class="uni-carousel-image" source media="(max-width: 540px) and (max-resolution: 1.5dppx)" sizes="540px" srcset="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-4.2e16d0ba.fill-540x304.jpg 540w" source media="(max-width: 540px) and (min-resolution: 1.5dppx)" sizes="1080px" srcset="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-4.2e16d0ba.fill-1080x608.jpg 1080w" source media="(max-resolution: 1.5dppx)" sizes="1000px" srcset="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-4.2e16d0ba.fill-1000x563.jpg 1000w" source media="(min-resolution: 1.5dppx)" sizes="1592px" srcset="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-4.2e16d0ba.fill-1592x896.jpg 1592w" img src="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-4.2e16d0ba.fill-1592x896.jpg" alt="tpu-v2-4" sizes="(max-width: 540px) 540px, (max-width: 540px) 1080px, 1000px, 1592px" srcset="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-4.2e16d0ba.fill-540x304.jpg 540w, https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-4.2e16d0ba.fill-1080x608.jpg 1080w, https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-4.2e16d0ba.fill-1000x563.jpg 1000w, https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-4.2e16d0ba.fill-1592x896.jpg 1592w" /div /div /figure figure div class="uni-carousel-slide-content" div class="uni-carousel-image" source media="(max-width: 540px) and (max-resolution: 1.5dppx)" sizes="540px" srcset="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-7.2e16d0ba.fill-540x304.jpg 540w" source media="(max-width: 540px) and (min-resolution: 1.5dppx)" sizes="1080px" srcset="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-7.2e16d0ba.fill-1080x608.jpg 1080w" source media="(max-resolution: 1.5dppx)" sizes="1000px" srcset="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-7.2e16d0ba.fill-1000x563.jpg 1000w" source media="(min-resolution: 1.5dppx)" sizes="1592px" srcset="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-7.2e16d0ba.fill-1592x896.jpg 1592w" img src="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-7.2e16d0ba.fill-1592x896.jpg" alt="tpu-v2-7" sizes="(max-width: 540px) 540px, (max-width: 540px) 1080px, 1000px, 1592px" srcset="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-7.2e16d0ba.fill-540x304.jpg 540w, https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-7.2e16d0ba.fill-1080x608.jpg 1080w, https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-7.2e16d0ba.fill-1000x563.jpg 1000w, https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-7.2e16d0ba.fill-1592x896.jpg 1592w" /div /div /figure figure div class="uni-carousel-slide-content" div class="uni-carousel-image" source media="(max-width: 540px) and (max-resolution: 1.5dppx)" sizes="540px" srcset="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-6.2e16d0ba.fill-540x304.jpg 540w" source media="(max-width: 540px) and (min-resolution: 1.5dppx)" sizes="1080px" srcset="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-6.2e16d0ba.fill-1080x608.jpg 1080w" source media="(max-resolution: 1.5dppx)" sizes="1000px" srcset="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-6.2e16d0ba.fill-1000x563.jpg 1000w" source media="(min-resolution: 1.5dppx)" sizes="1592px" srcset="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-6.2e16d0ba.fill-1592x896.jpg 1592w" img src="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-6.2e16d0ba.fill-1592x896.jpg" alt="tpu-v2-6" sizes="(max-width: 540px) 540px, (max-width: 540px) 1080px, 1000px, 1592px" srcset="https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-6.2e16d0ba.fill-540x304.jpg 540w, https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-6.2e16d0ba.fill-1080x608.jpg 1080w, https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-6.2e16d0ba.fill-1000x563.jpg 1000w, https://storage.googleapis.com/gweb-uniblog-publish-prod/images/tpu-v2-6.2e16d0ba.fill-1592x896.jpg 1592w" /div /div /figure /div div class="uni-carousel-arrows-container hide-tablet hide-mobile" button class="uni-carousel-left-arrow" aria-label="left arrow" /button button class="uni-carousel-right-arrow" aria-label="right arrow" /button /div nav button aria-label="slide 1"/button button aria-label="slide 2"/button button aria-label="slide 3"/button button aria-label="slide 4"/button /nav /div /div

http://blog.google:443/topics/google-cloud/google-cloud-offer-tpus-machine-learning/

Categories:
Tags:

[D] [Digg] [FB] [R] [SU] [Tweet] [G]

NEWSMAIL