Job Description: • Design, implement, and maintain CI/CD pipelines using tools such as Jenkins, GitLab CI/CD, or GitHub Actions.
• Manage and maintain Kubernetes clusters in cloud (e.g., GCP GKE) and/or on-prem environments.
• Monitor system performance and availability using Grafana, Prometheus, and other observability tools.
Requirements:• Bachelor's degree in computer science, Information Technology, or related field (or equivalent experience).
• 3+ years of experience in a DevOps, Site Reliability Engineering (SRE), or related role.
• Proficient in building and maintaining CI/CD pipelines.
• Strong experience with Kubernetes and containerization (Docker).
• Hands-on experience with monitoring and alerting tools such as Grafana, Prometheus, Loki, or ELK Stack.
• Experience with cloud platforms such as AWS, Azure, or Google Cloud.
• Experience with Camunda (BPMN) Platform is a big plus – installation, sizing and scaling, network configuration, troubleshooting and monitoring
• Solid scripting skills (e.g., Bash, Python, or Go).
• Familiarity with configuration management tools such as Ansible, Chef, or Puppet is a plus.
Key Responsibilities:
• Install, configure, design, scale and monitoring of BPMN products such as Camunda
• Design, implement, and maintain CI/CD pipelines using tools such as Jenkins, GitLab CI/CD, or GitHub Actions.
• Manage and maintain Kubernetes clusters in cloud (e.g., GCP GKE) and/or on-prem environments.
• Monitor system performance and availability using Grafana, Prometheus, and other observability tools.
• Automate infrastructure provisioning using Infrastructure as Code (IaC) tools like Terraform or Helm.
• Collaborate with development, QA, and operations teams to ensure smooth and reliable software releases.
• Troubleshoot and resolve issues in dev, test, and production environments.
• Enforce security and compliance best practices across infrastructure and deployment pipelines.
• Participate in on-call rotations and incident response.
Apply Now