Benefits

Deploy deep learning environments in minutes using prepackaged and fully tested Docker images.

Automatically improve performance with optimized model training for popular frameworks like TensorFlow, PyTorch, and Apache MXNet.

Quickly add machine learning (ML) as a microservice to your applications running on Amazon EKS and Amazon EC2.

Build custom ML workflows for training, validation, and deployment through integration with Amazon SageMaker, Amazon EKS, and Amazon ECS.

Use cases

Autonomous vehicle (AV) deployment

Develop advanced ML models at scale to deploy AV technology safely and quickly within your environments.

Natural language processing (NLP)

Reduce the time needed to deploy your ML models and accelerate time to production with up-to-date frameworks and libraries, including Hugging Face Transformers.

Healthcare data analysis

Analyze raw, disparate health data with advanced analytics, ML, and deep learning capabilities to identify trends and make predictions.

Supported Deep Learning Containers

For details on the support for Deep Learning Containers, see the release notes.

  Frameworks: PyTorch TensorFlow    
  Operating systems: Ubuntu Linux      
  Instances: NVIDIA GPUs AWS Trainium AWS Inferentia  
  Platforms: Amazon EC2 Amazon ECS Amazon EKS AWS Graviton

Customer success

  • Roblox

    Roblox is an immersive gaming and creation platform that offers people millions of ways to be together, inviting its community to explore, create and share endless unique experiences. Roblox's global community of millions of developers create and publish their own immersive multiplayer experiences using Roblox Studio, the platform's all-in-one creation engine that enables anyone to build anything they can imagine.

    As part of the AI platform at Roblox, we train and serve 250+ models across our business. Virtually every interaction on Roblox has some form of AI powering it, including safety, generative AI 3D content creation, experiences recommendation, real-time translation and other use cases. We are using AWS Deep Learning Containers for our EKS GPU node groups, and because of that we don't worry about the infrastructure, the Nvidia drivers and CUDA installations - this works out of the box. We can focus on what really matters to us, improving GPU scheduling and GPU utilization to benefit our users and lower our costs to serve those AI workloads.

    Denis Goupil, Principal Machine Learning Engineer - Roblox

Explore more of AWS