Senior Cloud Architect with AWS and Generative AI Expertise

itmc systems, inc • United State
Remote
Apply
AI Summary

We are seeking a Senior Cloud Architect with deep AWS and Generative AI expertise to design end-to-end architecture for the GT AI OS Gen 3 platform. The ideal candidate will have experience with Kubernetes, AWS, and Generative AI/ML platforms. This is a contract position with a remote work arrangement.

Key Highlights
Design end-to-end architecture for the GT AI OS Gen 3 platform
Experience with Kubernetes, AWS, and Generative AI/ML platforms
Contract position with remote work arrangement
Key Responsibilities
Design reference architectures across multiple regulated deployment patterns
Drive financial modeling for all deployment patterns
Architect the AWS Marketplace listing with automated provisioning
Technical Skills Required
AWS Kubernetes Generative AI Machine Learning FedRAMP GDPR EU AI Act CloudFront S3 Direct Connect Equinix Fabric Terraform AWS CDK Helm Autoscaling Kubernetes RBAC GPU inference serving vLLM Ollama Triton Inference Server
Benefits & Perks
Contract position
Remote work arrangement
Nice to Have
AWS certifications: Solutions Architect – Professional, Machine Learning Specialty, Security Specialty
Experience with Equinix Fabric or Equinix Metal interconnect and co-location networking
Familiarity with DoD Impact Level architectures (IL4/IL5) or AWS GovCloud

Job Description


Job Title: Cloud Architect with AWS

Location: Remote

Job Type: Contract Position

Job Summary

ROLE OVERVIEW

We are looking for a Senior Cloud Architect with deep AWS and applied Generative AI / ML expertise to own end-to-end architecture for the GT AI OS Gen 3 platform — a Kubernetes-native AI operating system serving defense and enterprise customers. You will design the reference architectures across multiple regulated deployment patterns: a FedRAMP Moderate environment in AWS US East, a GDPR/EU AI Act-compliant EU deployment, a GPU inference fabric connecting on-premises Equinix servers to AWS via Direct Connect, and a globally distributed CloudFront delivery layer with SSO. You will also drive financial modeling for all deployment patterns and architect the AWS Marketplace listing with automated provisioning — enabling customers to order, deploy, and operate Gen 3 at the click of a button.



ARCHITECTURE SCOPE — 9 DEPLOYMENT & PRODUCT PATTERNS

FedRAMP US Deployment •EU-Compliant Deployment • GPU Inference Fabric (vLLM / Ollama / HA Proxy) • External Inference API Gateway (HuggingFace, Groq) • CloudFront + SSO Delivery •Bedrock FedRAMP Inference• S3 Backup Architecture •Direct Connect via Equinix Fabric• AWS Marketplace with Automated Provisioning


REQUIRED QUALIFICATIONS

  • 8+ years cloud architecture; 5+ years on AWS with enterprise production deployments.
  • 3+ years architecting GenAI / LLM / ML platforms including inference infrastructure, RAG pipelines, and embedding services.
  • Demonstrated experience designing FedRAMP Moderate environments — control mapping, SSP contributions, boundary documentation, and continuous monitoring architectures.
  • Deep AWS networking: VPC, Direct Connect, Transit Gateway, PrivateLink, Route 53, CloudFront, WAF, and BGP routing.
  • Hands-on EKS expertise: RKE2-compatible Kubernetes deployments, GPU node groups, Helm, autoscaling (Karpenter, HPA/VPA), and Kubernetes RBAC.
  • Experience with GPU inference serving: vLLM, Ollama, Triton Inference Server, or equivalent; model sizing, throughput benchmarking, and HA topology design.
  • Familiarity with AWS Marketplace: SaaS listing, Metering Service, entitlement APIs, and automated provisioning via CloudFormation or CDK.
  • Financial modeling proficiency: AWS Pricing Calculator, custom TCO models, and customer-facing pricing design.
  • Knowledge of GDPR data residency requirements and EU AI Act compliance posture for AI system providers.
  • Proficiency with IaC: Terraform and/or AWS CDK at scale.



PREFERRED QUALIFICATIONS

  • AWS certifications: Solutions Architect – Professional, Machine Learning Specialty, Security Specialty.
  • Experience with Equinix Fabric or Equinix Metal interconnect and co-location networking.
  • Familiarity with DoD Impact Level architectures (IL4/IL5) or AWS GovCloud.
  • Experience with CMMC Level 2 certification readiness and SSP authoring.
  • Knowledge of EU AI Act Article 6+ high-risk AI system requirements.
  • Prior AWS MAP engagement leadership or APN consulting background.
  • Experience integrating with OpenAI-compatible inference APIs: HuggingFace, Groq Cloud, Together AI, Anyscale.


Similar Jobs

Explore other opportunities that match your interests

AI Cloud Infrastructure Engineer

Devops
•
14h ago
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Mid-Senior level

omni studio

United State

Cloud Application Architect

Devops
•
1d ago

Premium Job

Sign up is free! Login or Sign up to view full details.

•••••• •••••• ••••••
Job Type ••••••
Experience Level ••••••

NTT DATA North America

United State
Visa Sponsorship Relocation Remote
Job Type Full-time
Experience Level Associate

remotehunter

United State

Subscribe our newsletter

New Things Will Always Update Regularly