Site Reliability Engineering Head
Union Bank Of The Philippines Pasig
Job Description
Leadership & Strategy- Define and execute the SRE strategy aligned with the bank's technology vision and regulatory requirements.
- Build, lead, and mentor a high-performing team of SREs, fostering a culture of accountability, learning, and continuous improvement.
- Collaborate with product, infrastructure, and engineering leaders to integrate reliability practices across the SDLC.
- Drive service-level objectives (SLOs), service-level indicators (SLIs), and error budgets across business-critical systems.
- Oversee the design and implementation of monitoring, alerting, and automated response systems to ensure platform uptime and performance.
- Implement robust incident management practices, including root cause analysis, blameless postmortems, and reliability reporting.
- Champion automation in deployment, scaling, and recovery processes to eliminate toil and improve system efficiency.
- Guide the modernization of legacy systems towards cloud-native and containerized architectures (e.g., Kubernetes, Docker).
- Lead capacity planning, disaster recovery, and business continuity initiatives.
- Ensure adherence to banking regulations, data protection laws, and cybersecurity standards in all SRE operations.
- Collaborate with cybersecurity and risk teams to mitigate risks and support audit and compliance requirements.
- Bachelor's or Master's degree in Computer Science, Engineering, or a related field.
- 10+ years of experience in infrastructure, systems engineering, or SRE, including 5+ years in leadership roles.
- Proven track record of driving reliability in highly regulated, 24/7 environmentspreferably in financial services or banking.
- Deep knowledge of cloud platforms (AWS, Azure, or GCP), DevOps toolchains, observability tools (e.g., Prometheus, Grafana), and CI/CD pipelines.
- Strong programming/scripting skills (e.g., Python, Go, Bash).
- Excellent communication, leadership, and stakeholder management skills.
Monroe Consulting GroupParañaque, 11 km from Pasig
the technology stack. This position will report in Paranaque, Philippines.
Job Summary:
The Senior Engineer - Site Reliability is responsible for maintaining the health and performance of applications, services, and infrastructure through the use of monitoring...
Quezon City, 10 km from Pasig
and scalability.
• Collaborate with development teams to implement best practices for reliability and security.
• Optimize system performance through continuous improvement and proactive maintenance.
• Implement effective monitoring and alert systems...
Makati, 6 km from Pasig
Manage and operate scalable cloud-based SaaS platforms by ensuring high availability, tuning performance, and maintaining end-to-end system health
• Deliver core operational support and engineering expertise for complex, distributed application...