✦ Luna Orbit — Cloud & Infrastructure

Sr Software Development Engineer - Silicon Development Infrastructure , ML Silicon Infrastructure

at Amazon.com

📍 US, TX, Austin Unknown Posted April 14, 2026
Type Full-Time
Experience senior
Exp. Years Not specified
Education Not specified
Category Cloud & Infrastructure

This role builds and operates the infrastructure that accelerates silicon development at Annapurna Labs. You will design platforms, tooling, and automation that help chip design teams iterate faster, validate more thoroughly, and ship silicon to market.

  • Design, implement, and operate AWS cloud infrastructure and high-performance computing clusters using Slurm
  • Build CI/CD pipelines for infrastructure-as-code, container images, deployments, and cluster config changes
  • Develop data pipelines and databases for workflow metadata, job outcomes, and resource utilization
  • Create dashboards and alerting systems with monitoring, incident response, and runbooks
  • Automate and simplify silicon development workflows to improve reliability, performance, and cost efficiency

You will deliver AWS-based cloud infrastructure and high-performance computing clusters using Slurm, while creating infrastructure-as-code and CI/CD pipelines for safe deployments and rollbacks. The job also emphasizes observability through metrics/log pipelines, dashboards, alerting, and incident response runbooks.

The ideal candidate is a senior infrastructure engineer who has built and operated cloud infrastructure and high-performance computing platforms using Slurm. They bring strong DevOps-style experience with infrastructure-as-code, CI/CD pipelines, container images, and end-to-end observability including metrics, logs, dashboards, alerting, and incident response.

cloud infrastructurehigh-performance computing clusters using schedulers like SlurmCI/CD pipelinesinfrastructure-as-codeAWSautomation servicesobservabilitymonitoringincident response processesdashboards and alerting systems
formal verificationREST APIscommand-line interfacescontainer imagesrunbooks
Amazon Web ServicesAWSSlurmREST APIscommand-line interfacesCI/CDinfrastructure-as-codecontainer images
silicon development infrastructurecloud infrastructurehigh-performance computingSlurmelectronic design automationcommand-line interfacesREST APIsautomation servicesinfrastructure-as-codeCI/CD pipelinescontainer imagesservice deploymentscluster configuration changesplatform reliabilityobservabilitymetricslogsdashboardsalerting systemsincident responserunbooksdocumentationbenchmarkingautomation
silicon development infrastructurecloud infrastructurehigh-performance computingelectronic design automationAWS preferredschedulers like Slurmcommand-line interfacesREST APIsautomation servicesinfrastructure-as-codeCI/CD pipelinescontainer imagesservice deploymentscluster configuration changestestingstaged rolloutssafe rollback mechanismsplatform reliabilityplatform performancecost efficiencydata pipelinesmetrics ingestionlogs ingestionworkflow results ingestiondatabases for workflow metadatajob outcomesresource utilization patternsdashboardsalerting systemsmonitoringincident responserunbooksdocumentationoperational excellenceautomationobservabilitybenchmarkingplatform toolingtooling and automation for chip design teamsformal verificationemulationverificationchip design
customer obsessionpartnering with internal customersrapid iterationproblem solvingcross-functional collaborationownership biascommunication with silicon designverificationemulationformal verificationand software teamsoperational mindsetcontinuous improvement
Industry Aerospace
Job Function Architect and operate infrastructure platforms that accelerate silicon development workflows
Role Subtype Platform Engineer
Tech Domains Amazon Web Services, Linux, DevOps & SRE, Kubernetes, Python
Senior Silicon Software Development Infrastructure EngineerSilicon Software Development InfrastructureAnnapurna LabsAWS preferredAmazon Web Servicescloud infrastructurehigh-performance computingSlurmschedulers like Slurmelectronic design automationcommand-line interfacesREST APIsautomation servicesinfrastructure-as-codeCI/CD pipelinescontainer imagesservice deploymentscluster configuration changesplatform reliabilityobservabilitymetricslogsdashboardsalerting systemsincident responserunbooks

Must have experience with cloud infrastructure on Amazon Web Services (AWS preferred), Must have experience with high-performance computing clusters using Slurm, Must have experience implementing CI/CD pipelines and infrastructure-as-code

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile