Site Reliability Engineering Head

apartmentUnion Bank Of The Philippines placePasig calendar_month 

Job Description

Leadership & Strategy
  • Define and execute the SRE strategy aligned with the bank's technology vision and regulatory requirements.
  • Build, lead, and mentor a high-performing team of SREs, fostering a culture of accountability, learning, and continuous improvement.
  • Collaborate with product, infrastructure, and engineering leaders to integrate reliability practices across the SDLC.
Reliability Engineering
  • Drive service-level objectives (SLOs), service-level indicators (SLIs), and error budgets across business-critical systems.
  • Oversee the design and implementation of monitoring, alerting, and automated response systems to ensure platform uptime and performance.
  • Implement robust incident management practices, including root cause analysis, blameless postmortems, and reliability reporting.
Systems & Automation
  • Champion automation in deployment, scaling, and recovery processes to eliminate toil and improve system efficiency.
  • Guide the modernization of legacy systems towards cloud-native and containerized architectures (e.g., Kubernetes, Docker).
  • Lead capacity planning, disaster recovery, and business continuity initiatives.
Security & Compliance
  • Ensure adherence to banking regulations, data protection laws, and cybersecurity standards in all SRE operations.
  • Collaborate with cybersecurity and risk teams to mitigate risks and support audit and compliance requirements.
Qualifications
  • Bachelor's or Master's degree in Computer Science, Engineering, or a related field.
  • 10+ years of experience in infrastructure, systems engineering, or SRE, including 5+ years in leadership roles.
  • Proven track record of driving reliability in highly regulated, 24/7 environmentspreferably in financial services or banking.
  • Deep knowledge of cloud platforms (AWS, Azure, or GCP), DevOps toolchains, observability tools (e.g., Prometheus, Grafana), and CI/CD pipelines.
  • Strong programming/scripting skills (e.g., Python, Go, Bash).
  • Excellent communication, leadership, and stakeholder management skills.
apartmentMonroe Consulting GroupplaceParañaque, 11 km from Pasig
the technology stack. This position will report in Paranaque, Philippines. Job Summary: The Senior Engineer - Site Reliability is responsible for maintaining the health and performance of applications, services, and infrastructure through the use of monitoring...
business_centerHigh salary

Site Reliability Engineer - Quezon City

placeQuezon City, 10 km from Pasig
and scalability.  •  Collaborate with development teams to implement best practices for reliability and security.  •  Optimize system performance through continuous improvement and proactive maintenance.  •  Implement effective monitoring and alert systems...
placeMakati, 6 km from Pasig
Manage and operate scalable cloud-based SaaS platforms by ensuring high availability, tuning performance, and maintaining end-to-end system health  •  Deliver core operational support and engineering expertise for complex, distributed application...