Senior Data Platform Reliability Engineer | Onsite

apartmentTerraBarn Inc placeCebu City scheduleFull-time calendar_month 

About OpsWerks

OpsWerks is a technical consulting company specializing in operational services for the high-tech industry. We partner with platform and infrastructure teams to operate multi-cloud environments, execute complex migrations, and enable seamless, scalable application deployments.

Your Role

As a Senior Data Platform Engineer, you will be responsible for the operation, reliability, and continuous improvement of data platforms running on Kubernetes (on-premise and/or AWS/GCP), including frameworks such as DoEKS (Data on EKS) and AIoEKS (AI on EKS).

Key Responsibilities
  • Operate, maintain, and enhance data platforms deployed on Kubernetes environments
  • Deploy platform updates, releases, and configuration changes using GitOps/DevOps practices
  • Monitor system health using logs, metrics, and observability tools to ensure high availability
  • Participate in incident response, root cause analysis (RCA), and 24x7 on-call rotations
  • Improve platform reliability through automation, observability, and self-service tooling
  • Troubleshoot user and system issues, including integrations, performance bottlenecks, and misconfigurations
  • Collaborate with cross-functional teams to ensure seamless data platform operations
  • Provide technical mentorship and guidance to junior engineers
  • Champion platform standards, security best practices, and operational excellence
Qualifications
  • 3+ years experience supporting production data platforms (e.g., Spark, Airflow, Jupyter)
  • 5+ years hands-on experience in ETL/ELT pipelines, data processing, and transformation (Python/Java & SQL)
  • Strong experience with Kubernetes, including managed services (AWS EKS / GCP GKE)
  • Solid understanding of Linux systems, microservices architecture, and service communication patterns
  • Strong troubleshooting skills (application failures, latency, scaling, resource contention)
  • Proficiency in monitoring, logging, and observability tools (e.g., Prometheus, Grafana, ELK, Splunk)
Nice to Have
  • Experience with modern data/AI platforms: Flink, Trino, Druid, Ray.io
  • Automation and scripting skills (Bash, Python)
  • Relevant certifications (e.g., CKAD, AWS Certified Data Engineer)
Why Join OpsWerks?
  • Work on cutting-edge data platforms in multi-cloud environments
  • Exposure to large-scale, enterprise-grade infrastructure
  • Collaborative, engineering-driven culture focused on reliability and innovation
  • Opportunities for technical growth, certification, and mentorship
electric_boltImmediate start

Site Reliability Engineer

apartmentAlsons/AWS Information Systems IncorporationplaceCebu City
Job Description Company: Alsons/AWS Information Systems Inc. (AAISI) Location: i1 Building, Cebu IT Park, Cebu Work Setup: Onsite, Monday to Friday SITE RELIABILITY ENGINEER You will be working with our SRE team which consists of more than...
business_centerHigh salary

Chief Technology Officer (CTO)

apartmentCollabtech APAC Pty Ltd.placeCebu City
Building and managing a small, Cebu‐based engineering team  •  Driving AI adoption and automation where it creates clear, measurable internal business value  •  Unifying and modernizing back‐office systems across CTG Flexibility in working hours is required...
local_fire_departmentUrgent

Information technology (it) specialist

placeCebu City
Job Description Posted on 23 April 2026 Site Reliability Engineer - Intern (Associate) OpsWerks Academy Program Our intensive technical and mindset training is designed to fuel your development, equipping you with the skills and experience...