Senior Data Platform Reliability Engineer | Onsite
TerraBarn Inc Cebu City Full-time
About OpsWerks
OpsWerks is a technical consulting company specializing in operational services for the high-tech industry. We partner with platform and infrastructure teams to operate multi-cloud environments, execute complex migrations, and enable seamless, scalable application deployments.
Your Role
As a Senior Data Platform Engineer, you will be responsible for the operation, reliability, and continuous improvement of data platforms running on Kubernetes (on-premise and/or AWS/GCP), including frameworks such as DoEKS (Data on EKS) and AIoEKS (AI on EKS).
Key Responsibilities- Operate, maintain, and enhance data platforms deployed on Kubernetes environments
- Deploy platform updates, releases, and configuration changes using GitOps/DevOps practices
- Monitor system health using logs, metrics, and observability tools to ensure high availability
- Participate in incident response, root cause analysis (RCA), and 24x7 on-call rotations
- Improve platform reliability through automation, observability, and self-service tooling
- Troubleshoot user and system issues, including integrations, performance bottlenecks, and misconfigurations
- Collaborate with cross-functional teams to ensure seamless data platform operations
- Provide technical mentorship and guidance to junior engineers
- Champion platform standards, security best practices, and operational excellence
- 3+ years experience supporting production data platforms (e.g., Spark, Airflow, Jupyter)
- 5+ years hands-on experience in ETL/ELT pipelines, data processing, and transformation (Python/Java & SQL)
- Strong experience with Kubernetes, including managed services (AWS EKS / GCP GKE)
- Solid understanding of Linux systems, microservices architecture, and service communication patterns
- Strong troubleshooting skills (application failures, latency, scaling, resource contention)
- Proficiency in monitoring, logging, and observability tools (e.g., Prometheus, Grafana, ELK, Splunk)
- Experience with modern data/AI platforms: Flink, Trino, Druid, Ray.io
- Automation and scripting skills (Bash, Python)
- Relevant certifications (e.g., CKAD, AWS Certified Data Engineer)
- Work on cutting-edge data platforms in multi-cloud environments
- Exposure to large-scale, enterprise-grade infrastructure
- Collaborative, engineering-driven culture focused on reliability and innovation
- Opportunities for technical growth, certification, and mentorship
Alsons/AWS Information Systems IncorporationCebu City
Job Description
Company: Alsons/AWS Information Systems Inc. (AAISI)
Location: i1 Building, Cebu IT Park, Cebu
Work Setup: Onsite, Monday to Friday
SITE RELIABILITY ENGINEER
You will be working with our SRE team which consists of more than...
Collabtech APAC Pty Ltd.Cebu City
Building and managing a small, Cebu‐based engineering team
• Driving AI adoption and automation where it creates clear, measurable internal business value
• Unifying and modernizing back‐office systems across CTG
Flexibility in working hours is required...
Cebu City
Job Description
Posted on 23 April 2026
Site Reliability Engineer - Intern (Associate)
OpsWerks Academy Program
Our intensive technical and mindset training is designed to fuel your development, equipping you with the skills and experience...