About this role
This role builds and operates the infrastructure that accelerates silicon development at Annapurna Labs. You will design platforms, tooling, and automation that help chip design teams iterate faster, validate more thoroughly, and ship silicon to market.
Key Responsibilities
- Design, implement, and operate AWS cloud infrastructure and high-performance computing clusters using Slurm
- Build CI/CD pipelines for infrastructure-as-code, container images, deployments, and cluster config changes
- Develop data pipelines and databases for workflow metadata, job outcomes, and resource utilization
- Create dashboards and alerting systems with monitoring, incident response, and runbooks
- Automate and simplify silicon development workflows to improve reliability, performance, and cost efficiency
Technical Overview
You will deliver AWS-based cloud infrastructure and high-performance computing clusters using Slurm, while creating infrastructure-as-code and CI/CD pipelines for safe deployments and rollbacks. The job also emphasizes observability through metrics/log pipelines, dashboards, alerting, and incident response runbooks.
Ideal Candidate
The ideal candidate is a senior infrastructure engineer who has built and operated cloud infrastructure and high-performance computing platforms using Slurm. They bring strong DevOps-style experience with infrastructure-as-code, CI/CD pipelines, container images, and end-to-end observability including metrics, logs, dashboards, alerting, and incident response.
Must-Have Skills
cloud infrastructurehigh-performance computing clusters using schedulers like SlurmCI/CD pipelinesinfrastructure-as-codeAWSautomation servicesobservabilitymonitoringincident response processesdashboards and alerting systems
Nice-to-Have Skills
formal verificationREST APIscommand-line interfacescontainer imagesrunbooks
Tools & Platforms
Amazon Web ServicesAWSSlurmREST APIscommand-line interfacesCI/CDinfrastructure-as-codecontainer images
Required Skills
silicon development infrastructurecloud infrastructurehigh-performance computingSlurmelectronic design automationcommand-line interfacesREST APIsautomation servicesinfrastructure-as-codeCI/CD pipelinescontainer imagesservice deploymentscluster configuration changesplatform reliabilityobservabilitymetricslogsdashboardsalerting systemsincident responserunbooksdocumentationbenchmarkingautomation
Hard Skills
silicon development infrastructurecloud infrastructurehigh-performance computingelectronic design automationAWS preferredschedulers like Slurmcommand-line interfacesREST APIsautomation servicesinfrastructure-as-codeCI/CD pipelinescontainer imagesservice deploymentscluster configuration changestestingstaged rolloutssafe rollback mechanismsplatform reliabilityplatform performancecost efficiencydata pipelinesmetrics ingestionlogs ingestionworkflow results ingestiondatabases for workflow metadatajob outcomesresource utilization patternsdashboardsalerting systemsmonitoringincident responserunbooksdocumentationoperational excellenceautomationobservabilitybenchmarkingplatform toolingtooling and automation for chip design teamsformal verificationemulationverificationchip design
Soft Skills
customer obsessionpartnering with internal customersrapid iterationproblem solvingcross-functional collaborationownership biascommunication with silicon designverificationemulationformal verificationand software teamsoperational mindsetcontinuous improvement
Keywords for Your Resume
Senior Silicon Software Development Infrastructure EngineerSilicon Software Development InfrastructureAnnapurna LabsAWS preferredAmazon Web Servicescloud infrastructurehigh-performance computingSlurmschedulers like Slurmelectronic design automationcommand-line interfacesREST APIsautomation servicesinfrastructure-as-codeCI/CD pipelinescontainer imagesservice deploymentscluster configuration changesplatform reliabilityobservabilitymetricslogsdashboardsalerting systemsincident responserunbooks
Deal Breakers
Must have experience with cloud infrastructure on Amazon Web Services (AWS preferred), Must have experience with high-performance computing clusters using Slurm, Must have experience implementing CI/CD pipelines and infrastructure-as-code
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile