✦ Luna Orbit — Cloud & Infrastructure

Staff Infrastructure Software Engineer, Enterprise AI

at Scale AI

📍 New York, NY; San Francisco, CA Unknown Posted April 02, 2026
Type Full-Time
Experience lead
Exp. Years 5+ years
Education Not specified
Category Cloud & Infrastructure

Scale is hiring a Staff/ Senior Infrastructure Engineer to architect and implement multi-cloud infrastructure supporting enterprise-scale Agentic workflows. The role combines hands-on delivery with long-term architectural strategy and strong emphasis on security and observability.

  • Architect multi-cloud systems for the SGP platform
  • Define architectural patterns for secure, reliable infra
  • Enhance CI/CD, test frameworks, data quality, reconciliation, anomaly detection
  • Own Agentic observability platform
  • Drive IaC tooling and developer efficiency across the org

This role focuses on cloud and infrastructure engineering, IaC, CI/CD, and observability across AWS/Azure/GCP with Kubernetes. The candidate will own end-to-end infrastructure patterns for Agentic workflows and multi-cloud deployments.

The ideal candidate is a senior/Staff infrastructure engineer with 5+ years building multi-cloud platforms, strong IaC and CI/CD automation, and hands-on experience with AWS/Azure/GCP, Kubernetes, and observability tooling. They should be able to lead technical strategy and mentor teams while delivering reliable infrastructure for enterprise AI.

5+ years of software engineering experienceCI/CDIaC (TerraformHelm Charts)KubernetesDatadog/Prometheus/GrafanaAWS/Azure/GCPPython or JavaScript/TypeScriptSQL
Agentic workflowsLLMsVector databasesObservability toolingMulti-cloud experienceSecurity/compliance experience
TerraformHelm ChartsKubernetesDatadogPrometheusGrafanaAWSAmazon Web ServicesAzureGoogle Cloud PlatformGCP
5+ years of software engineering; CI/CD; IaC (TerraformHelm Charts); Kubernetes; Observability (Datadog/Prometheus/Grafana); AWS/Azure/GCP; Python; JavaScript; TypeScript; SQL
TerraformHelm ChartsKubernetesDatadogPrometheusGrafanaAmazon Web ServicesAWSAzureGoogle Cloud PlatformGCPPythonJavaScriptTypeScriptSQLCI/CDInfrastructure-as-CodeIaCAgentic workflowsLLMsVector databasesVPC
leadershipmentoringcommunicationcollaborationproblem-solvingtime managementadaptability
Industry SaaS
Job Function Lead the design, deployment, and maintenance of Scale AI's multi-cloud infrastructure and Agentic observability platform for enterprise AI
Role Subtype Infrastructure Engineer
Tech Domains Amazon Web Services, Google Cloud Platform, Azure, Kubernetes, Docker, Python, SQL / PostgreSQL, JavaScript
Staff Infrastructure Software EngineerStaff Infrastructure EngineerInfrastructure EngineerTerraformHelm ChartsKubernetesDatadogPrometheusGrafanaCI/CDInfrastructure-as-CodeIaCPythonJavaScriptTypeScriptSQLAmazon Web ServicesAWSAzureGoogle Cloud PlatformGCPmulti-cloudAgentic workflowsLLMsVector databasesObservabilitystaff infrastructure software engineerterraformkubernetesawsamazon web servicesazuregoogle cloud platformiaCci/cdobservability

Lack of 5+ years of software engineering experience, No experience with Terraform, Helm Charts, or Kubernetes, Inability to work in SF or NY locations

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile