Skip to main content

Compute Services

Amazon EC2 (Elastic Compute Cloud)

What it is:
Amazon EC2 provides resizable virtual machines (instances) in the cloud. You can choose from a wide range of instance types optimized for compute, memory, storage, or GPU-based workloads.

Why it matters:

  • Offers full control over compute resources
  • Supports AI/ML training and inference using GPU instances
  • Scales from small experiments to high-performance distributed training

Typical Use Cases:

  • Running deep learning frameworks like TensorFlow or PyTorch on GPU instances
  • Hosting custom-trained ML models for inference
  • Performing large-scale simulations or model training
What is it?

AWS Trainium instances use a custom-designed machine learning chip engineered for high performance with low power consumption, reducing the carbon footprint of training large-scale models.

Key Features:

  • Up to 25% more energy efficient than comparable accelerated computing EC2 instances.
  • Specifically designed for optimal performance per watt for deep learning workloads.
  • Lowers environmental impact compared to other instance types.

Typical Use Cases:

  • Large-scale deep learning training.
  • Organizations prioritizing sustainability and energy efficiency in AI workloads.

Why it matters:
They are the most environmentally friendly choice, helping companies meet sustainability goals while training complex models.

Learn more