About this role
This role involves architecting and operationalizing AI infrastructure to enable deployment of large-scale AI and ML models, focusing on LLMs, model serving, and MLOps within Cisco Data Fabric.
Key Responsibilities
- Design and deploy AI/ML infrastructure
- Build data pipelines and MLOps platforms
- Manage model lifecycle including migration and updates
- Optimize inference latency and performance
- Drive AI opportunities across the stack
Technical Overview
The technical environment includes AI/ML infrastructure, cloud platforms like AWS, Azure, GCP, and tools for model orchestration, deployment, and monitoring, emphasizing scalable AI solutions.
Ideal Candidate
The ideal candidate is a senior AI/ML infrastructure engineer with extensive experience deploying AI and ML features in cloud environments, particularly with large language models and MLOps platforms. They excel in designing scalable, operational AI systems and have a strong background in cloud services like AWS, Azure, and GCP.
Must-Have Skills
Designed and deployed AI/ML infrastructure and features in production cloud environmentsExperience with major AI services including OpenAIAnthropicHuggingFaceAWS BedrockAzure OpenAI ServiceProduction experience with AWSAzureor GCP cloud platformLed technical decisions and architectural direction
Nice-to-Have Skills
LLM-specific infrastructurePrompt managementTool integrationModel serving at scaleInference optimizationPerformance monitoringML frameworks like TensorFlowPyTorch
Tools & Platforms
AWSAmazon Web ServicesAzureGCPHuggingFaceOpenAIAnthropicAWS BedrockAzure OpenAI Service
Required Skills
AI infrastructureML infrastructureAI/MLLarge Language ModelsLLMsModel servingInference optimizationTensorFlowPyTorchOpenAIAnthropicHuggingFaceAWS BedrockAzure OpenAI ServiceModel orchestrationData pipelinesMLOps platformsModel migrationVersion compatibilityModel updatesMonitoringGovernanceDeployment
Hard Skills
AI infrastructureML infrastructureAI/MLMLLarge Language ModelsLLMsModel servingInference optimizationModel formatsTensorFlowPyTorchOpenAIAnthropicHuggingFaceAWS BedrockAzure OpenAI ServiceModel orchestrationData pipelinesMLOps platformsModel migrationVersion compatibilityModel updatesMonitoringGovernanceDeployment
Soft Skills
CollaborationProblem-solvingInnovationTeamworkCommunication
Keywords for Your Resume
AI infrastructureML infrastructureAI/MLLarge Language ModelsLLMsModel servingInference optimizationTensorFlowPyTorchOpenAIAnthropicHuggingFaceAWS BedrockAzure OpenAI ServiceModel orchestrationData pipelinesMLOps platformsModel migrationVersion compatibilityModel updatesMonitoringGovernanceDeploymentAWSAzureGCPMLOps
Deal Breakers
Lack of experience with AI/ML infrastructure, No cloud platform experience, Less than 8 years of relevant experience
Get matched to jobs like this
Luna finds roles that fit your skills and career goals — no endless scrolling required.
Create a Free Profile