Apply now »

Site Reliability Engineer

Role Overview

We are looking for a Site Reliability Engineer (SRE) to support and scale our on-premises container and virtual machine infrastructure. You will work within a team to ensure the stability of our OpenShift and GKE On-Prem clusters, focusing on automation, monitoring, and application delivery.

 

Key Responsibilities

  • Assist in the day-to-day operations and maintenance of OpenShift and GKE (Anthos) clusters running on local hardware.

  • Use Helm, Terraform and GitOps to deploy applications across on-prem environments.

  • Configure and manage Prometheus, FluentD, and Grafana to maintain visibility into system health.

  • Support the performance and availability of Apache Druid, MySQL and PostgreSQL for data processing.

  • Networking: Troubleshoot local DNS, routing, and connectivity issues between geographically dispersed clusters

 

Technical Requirements

  • Hands-on experience with OpenShift and GKE On-Prem.

  • Experience with Helm, GitOps, and Terraform

  • Familiarity with Prometheus, Fluentd and Grafana.

  • Familiarity with maintaining Apache Druid, MySQL and PostgreSQL clusters.

  • Understanding of Networking and DNS in a data center context.

 

Nice to have

  • Proficiency in Python or Go.

 

Trainings

Our company supports the self-development of the employees and provides professional training programs for networking, software development and cloud technologies.

 

Travels

From time to time, you will be requested to travel for work to attend internal gatherings and workshops or to represent the company at conferences or meetings.

 

Ref ID:  60328
Location: 

Athens, I, GR

Business Unit:  PCCW Global
Full Time/ Part Time:  Full Time
Job Function:  Technology
Featured Job Category:: 

Apply now »