Skip to content

Allow running large models on AWS spot instances #54

@sujitpal

Description

@sujitpal

create_endpoint should have parameters to allow end-user to run models on AWS spot instances. This would make it significantly cheaper and more practical to run models like Cohere Instant Large (cohere-gpt-xlarge) which require ml.p4d.24xlarge instances.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions