Skip to content

Spot instance recommendations #12

@maksimov

Description

@maksimov

Add spot instance pricing recommendations for suitable workloads.

Currently the only pricing recommendation is on-demand → Reserved Instance.
Spot instances can save 60-90% over on-demand for interruptible workloads.

Core

  • Identify workloads suitable for spot (training jobs, batch inference, dev instances)
  • Exclude persistent endpoints and production inference from spot recommendations
  • Factor in spot pricing when calculating potential savings
  • Adjust idle signal severity for spot instances (idle spot is much cheaper than idle on-demand)

Spot Fleet diversification

  • Suggest mixed-instance Spot Fleets (e.g. G4ad + G5 + G6) to minimize total outage risk
  • Correlate GPU-specific interruption rates — g5.xlarge (A10G) tends to have higher stability than p4d (A100) due to larger capacity pool

Interruption data

  • AWS EC2 Capacity Manager (launched 2026) provides native Spot interruption metrics
  • Use this to give concrete interruption risk percentages per instance type per region
  • Example recommendation: "Switch from g5.12xlarge On-Demand to g5.2xlarge Spot fleet — saves 72% with <5% interruption risk in us-east-1"

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or requestv0.2Version 0.2 milestone

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions