Senior Cloud Architect with AWS and Generative AI Expertise
We are seeking a Senior Cloud Architect with deep AWS and Generative AI expertise to design end-to-end architecture for the GT AI OS Gen 3 platform. The ideal candidate will have experience with Kubernetes, AWS, and Generative AI/ML platforms. This is a contract position with a remote work arrangement.
Key Highlights
Key Responsibilities
Technical Skills Required
Benefits & Perks
Nice to Have
Job Description
Job Title: Cloud Architect with AWS
Location: Remote
Job Type: Contract Position
Job Summary
ROLE OVERVIEW
We are looking for a Senior Cloud Architect with deep AWS and applied Generative AI / ML expertise to own end-to-end architecture for the GT AI OS Gen 3 platform — a Kubernetes-native AI operating system serving defense and enterprise customers. You will design the reference architectures across multiple regulated deployment patterns: a FedRAMP Moderate environment in AWS US East, a GDPR/EU AI Act-compliant EU deployment, a GPU inference fabric connecting on-premises Equinix servers to AWS via Direct Connect, and a globally distributed CloudFront delivery layer with SSO. You will also drive financial modeling for all deployment patterns and architect the AWS Marketplace listing with automated provisioning — enabling customers to order, deploy, and operate Gen 3 at the click of a button.
ARCHITECTURE SCOPE — 9 DEPLOYMENT & PRODUCT PATTERNS
FedRAMP US Deployment •EU-Compliant Deployment • GPU Inference Fabric (vLLM / Ollama / HA Proxy) • External Inference API Gateway (HuggingFace, Groq) • CloudFront + SSO Delivery •Bedrock FedRAMP Inference• S3 Backup Architecture •Direct Connect via Equinix Fabric• AWS Marketplace with Automated Provisioning
Interested in remote work opportunities in Devops? Discover Devops Remote Jobs featuring exclusive positions from top companies that offer flexible work arrangements.
REQUIRED QUALIFICATIONS
- 8+ years cloud architecture; 5+ years on AWS with enterprise production deployments.
- 3+ years architecting GenAI / LLM / ML platforms including inference infrastructure, RAG pipelines, and embedding services.
- Demonstrated experience designing FedRAMP Moderate environments — control mapping, SSP contributions, boundary documentation, and continuous monitoring architectures.
- Deep AWS networking: VPC, Direct Connect, Transit Gateway, PrivateLink, Route 53, CloudFront, WAF, and BGP routing.
- Hands-on EKS expertise: RKE2-compatible Kubernetes deployments, GPU node groups, Helm, autoscaling (Karpenter, HPA/VPA), and Kubernetes RBAC.
- Experience with GPU inference serving: vLLM, Ollama, Triton Inference Server, or equivalent; model sizing, throughput benchmarking, and HA topology design.
- Familiarity with AWS Marketplace: SaaS listing, Metering Service, entitlement APIs, and automated provisioning via CloudFormation or CDK.
- Financial modeling proficiency: AWS Pricing Calculator, custom TCO models, and customer-facing pricing design.
- Knowledge of GDPR data residency requirements and EU AI Act compliance posture for AI system providers.
- Proficiency with IaC: Terraform and/or AWS CDK at scale.
Browse our curated collection of remote jobs across all categories and industries, featuring positions from top companies worldwide.
PREFERRED QUALIFICATIONS
- AWS certifications: Solutions Architect – Professional, Machine Learning Specialty, Security Specialty.
- Experience with Equinix Fabric or Equinix Metal interconnect and co-location networking.
- Familiarity with DoD Impact Level architectures (IL4/IL5) or AWS GovCloud.
- Experience with CMMC Level 2 certification readiness and SSP authoring.
- Knowledge of EU AI Act Article 6+ high-risk AI system requirements.
- Prior AWS MAP engagement leadership or APN consulting background.
- Experience integrating with OpenAI-compatible inference APIs: HuggingFace, Groq Cloud, Together AI, Anyscale.
Similar Jobs
Explore other opportunities that match your interests
omni studio
Cloud Application Architect
NTT DATA North America