Skip to content

feat: Add SageMaker HyperPod EKS deployment target and MCP server int…#106

Merged
dferguson992 merged 1 commit intoawslabs:mainfrom
dferguson992:feat/mcp-server-integration
Mar 17, 2026
Merged

feat: Add SageMaker HyperPod EKS deployment target and MCP server int…#106
dferguson992 merged 1 commit intoawslabs:mainfrom
dferguson992:feat/mcp-server-integration

Conversation

@dferguson992
Copy link
Contributor

Issue #, if available: #82, #89

Description of changes:

Summary
Adds support for deploying ML containers to SageMaker HyperPod EKS clusters as an alternative to managed inference endpoints, and introduces MCP (Model Context Protocol) server integration for intelligent instance and region recommendations.

What changed
HyperPod EKS deployment target

New deploymentTarget prompt: users choose between managed-inference (existing) and hyperpod-eks
Kubernetes manifest templates (deployment.yaml, service.yaml, configmap.yaml, pvc.yaml) for HyperPod EKS deployments with GPU resource requests, tolerations, health probes on port 8080, and optional FSx volume mounts
Unified do/ scripts (deploy, clean, logs, test, config) now branch on deployment target — single script per action, no target-specific script files
do/deploy handles kubectl apply for HyperPod and aws sagemaker create-endpoint for managed inference
do/test supports kubectl port-forward testing for HyperPod and sagemaker-runtime invoke-endpoint for managed inference
do/export template added for configuration export
prompt-runner.js and prompts.js extended with HyperPod-specific prompts (cluster name, namespace, replicas, FSx volume handle)
template-manager.js handles conditional HyperPod directory inclusion via ignore patterns
config-manager.js extended with HyperPod parameter support, source enforcement, and deployment target awareness
MCP server integration

mcp-client.js and mcp-command-handler.js for managing MCP server configurations
Three MCP servers under servers/: instance-recommender, region-picker, and hyperpod-cluster-picker
Shared Bedrock client library in servers/lib/
MCP configuration documentation (
mcp-configuration.md
)
Test coverage

13 new property-based test files covering deployment target branching, K8s manifest correctness, script generation, prompt ordering, backward compatibility, and HyperPod cluster picker
Existing tests updated for the new deploymentTarget parameter
486 tests passing, 0 failing

@github-actions github-actions bot added documentation Improvements or additions to documentation generator tests ci dependencies labels Mar 17, 2026
@dferguson992 dferguson992 merged commit dada18d into awslabs:main Mar 17, 2026
10 checks passed
@dferguson992 dferguson992 deleted the feat/mcp-server-integration branch March 17, 2026 14:31
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ci dependencies documentation Improvements or additions to documentation generator tests

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants