Position Details

Type Full-Time

Experience senior

Exp. Years 5+ years

Education M.S./B.S. or Ph.D. in Computer Science or Electrical Engineering or equivalent practical experience

Category AI & Machine Learning

About this role

Lead architect and technical authority for a scalable ML inference platform within Prisma AIRS, driving MLOps and production-grade AI solutions at scale for security applications.

Key Responsibilities

Architect and design scalable ML inference platform
Provide technical leadership and mentorship
Drive model and system performance optimization
Set engineering standards for automated model deployment and monitoring
Collaborate with cross-functional teams to ensure end-to-end system cohesion

Technical Overview

Stack includes Python/Java/C++, Kubernetes, Docker, TensorFlow, PyTorch, ONNX, TensorRT, vLLM/SGLang, CUDA, Kafka/Spark/Flink; cloud: GCP/AWS/Azure/OCI; CI/CD with Jenkins, GitLab CI, Tekton; focus on distributed, low-latency ML inference and model deployment.

Ideal Candidate

The ideal candidate is a senior ML platform engineer with deep expertise in ML systems, MLOps, and scalable inference, strong cloud experience (GCP/AWS/Azure), and a track record architecting distributed ML services.

Must-Have Skills

5+ years of experience with an Object–Oriented programming language (Java/Python)Experience with cloud-native service development stack on GCPKnowledge of Object Oriented Programming design conceptsRESTful API design and micro services architectureBasic understanding of machine learning concepts and familiarity with ML frameworks (e.g.TensorFlowPyTorch)Proven track record leading the architecture of complex ML systems and MLOps pipelines using technologies like Kubernetes and DockerMastery of ML frameworks (TensorFlowPyTorch) and extensive experience with advanced inference optimization tools (ONNXTensorRT)Experience with low-level performance optimization such as custom CUDA kernel development or using Triton LanguageFamiliarity with CI/CD pipelines and automation tools (e.g.JenkinsGitLab CITekton)Experience with data infrastructure technologies (e.g.KafkaSparkFlink)

Nice-to-Have Skills

Open-source contributions in ML/LLM areasExperience with modern LLM inference engines (e.g.vLLMSGLangTensorRT-LLM)Cloud multi-provider experience (GCP/AWS/Azure/OCI)

Tools & Platforms

KubernetesDockerJenkinsGitLab CITektonKafkaSparkFlinkTensorFlowPyTorchONNXTensorRTvLLMSGLangTensorRT-LLMTriton LanguageCUDAGoogle Cloud PlatformAmazon Web ServicesMicrosoft AzureOCI

Required Skills

5+ years of experience with an object-oriented language (Java/Python)cloud-native GCPRESTful APImicroservicesTensorFlowPyTorchKubernetesDockerCI/CDdistributed systemsML inference optimization (ONNXTensorRT)CUDA/TritonKafka/Spark/FlinkLLM engines like vLLM

Hard Skills

PythonGoJavaC++KubernetesDockerTensorFlowPyTorchONNXTensorRTTransformersCNNsGNNsvLLMSGLangTensorRT-LLMCUDACUDA kernelsTriton LanguageKafkaSparkFlinkJenkinsGitLab CITektonGoogle Cloud PlatformAmazon Web ServicesMicrosoft AzureOCIRESTful APIMLOpsKubernetesDockerDistributed systems

Soft Skills

collaborationleadershipmentoringcommunicationproblem-solvingattention to detailteam playerfast learneradaptabilitystakeholder engagement

Industry & Role

Industry Cybersecurity

Job Function Architect and lead the ML platform engineering efforts for Prisma AIRS, focusing on scalable ML inference and MLOps.

Role Subtype ML platform engineer

Tech Domains Python, Java, C++, Kubernetes, Docker, TensorFlow, PyTorch, Google Cloud Platform, Amazon Web Services

Clearance & Visa

Visa Sponsorship Yes

Keywords for Your Resume

Sr Staff Machine Learning Platform EngineerPrisma AIRSPalo Alto NetworksAI security platformMLOpsKubernetesDockerTensorFlowPyTorchTensorRTvLLMSGLangTensorRT-LLMRESTful APIGoogle Cloud PlatformAmazon Web ServicesAzuredistributed systemsreal-time inferencePythonJavaC++

Deal Breakers

No BS/MS/PhD in CS/EE or equivalent, No cloud experience on GCP or lack of Kubernetes/Docker experience, No experience with ML frameworks (TensorFlow, PyTorch)

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile

Sr Staff Machine Learning Platform Engineer (Prisma AIRS)

Get matched to jobs like this