✦ Luna Orbit — System Administration

Senior Production Support Engineer - Order Management

at Charles Schwab

Unknown Posted April 16, 2026
Type Not Specified
Experience senior
Exp. Years Not specified
Education Not specified
Category System Administration

Senior Production Support Engineer role focused on keeping Schwab’s Order Management System (OMS) stable and reliable. Leads complex incident response, drives incident command and post-incident analysis, and improves platform reliability through monitoring, alerting, observability, and automation.

  • Lead response and resolution for complex production incidents
  • Drive incident command and stakeholder communication
  • Shape reliability strategy and support standards
  • Design and evolve monitoring, alerting, observability, and automation
  • Support 24x7 production environment with rotating on-call

The technical scope is production reliability and incident management for an OMS in a high-availability environment. The role includes designing operational models for monitoring and observability, automating support processes, and supporting a 24x7 rotating on-call schedule.

The ideal candidate is a senior production support engineer experienced supporting mission-critical systems in a high-availability environment. They can lead incident response with strong incident command, stakeholder communication, and post-incident analysis, while improving monitoring, alerting, observability, and automation for long-term reliability.

Lead response and resolution for complexbusiness-critical production incidentsDrive incident commandDesign and evolve operational models including monitoringalertingobservabilityand automationSupport a 24x7 production environment through a rotating on-call schedule
Order Management System (OMS)
Order Management System (OMS)high-availability environmentproduction incident managementincident commandstakeholder communicationpost-incident analysisreliability strategymonitoringalertingobservabilityautomationoperational models24x7 production environment supportrotating on-call scheduletechnical leadershipmentoringoperational excellence
Order Management System (OMS)high-availability environmentproduction incident managementincident commandstakeholder communicationpost-incident analysisreliability strategymonitoringalertingobservabilityautomationoperational models24x7 production environment supportroot cause/systemic risk identificationservice restoration during major incidentstechnical leadershipoperational excellence
technical leadershipclarity in communicationaccountabilitysystems-thinking mindsetworking under pressuresound judgment in ambiguous situationsmentoringcross-functional collaborationteam support
Industry Banking
Job Function Ensure the Order Management System (OMS) remains reliable through senior-level incident leadership and reliability engineering improvements.
Role Subtype DevOps Engineer
Production Support EngineerOrder Management System (OMS)OMShigh-availability environmentproduction incidentsincident commandstakeholder communicationpost-incident analysisreliability strategymonitoringalertingobservabilityautomationoperational models24x7 production environmentrotating on-call schedulemajor incidentsservice restorationtechnical leadershipenterprise-impacting issuessystems-thinking mindsetoperational excellence

Must have experience leading resolution for business-critical production incidents, Must be comfortable supporting a rotating 24x7 on-call schedule

Apply for this Position →

Get matched to jobs like this

Luna finds roles that fit your skills and career goals — no endless scrolling required.

Create a Free Profile