Site Reliability Engineer / DevOps Engineer
Site Reliability Engineer – Mid Level
Are you an experienced Site Reliability Engineer (SRE) or DevOps engineer looking to shape the future of cloud platform services? Here's your chance to join a new team at an early stage and make a significant impact on systems and services offered to customers and partners in EMEA and beyond.
The Role:
As a Site Reliability Engineer, you'll be responsible for critical areas within the Cloud Platform, including operations, availability, performance, monitoring, change management, emergency response, security, and capacity planning.
Key responsibilities include:
- Designing, implementing, and maintaining robust, scalable, high-quality software and systems within the SRE domain.
- Managing and supporting in-house development systems, CI/CD pipelines, tools, monitoring, and alerting, with a focus on automation to streamline activities and reduce toil.
- Collaborating closely with developers and architects to ensure designed solutions meet non-functional requirements such as availability, performance, security, and maintainability.
- Leading incident management, post-mortem reviews, and continuous improvement initiatives, contributing to the evolution of processes and systems within the organization.
- Defining key metrics and technical decisions driving products and their delivery, including SLOs and SLAs, architecture, best practices, and cost optimization.
- Taking responsibility for complex project tasks, striving for higher standards of individual and team performance.
- Building relationships external to the team and driving knowledge sharing across all products.
- Identifying personal development opportunities, setting goals, and delivering impactful contributions, while also mentoring and training other engineers throughout the company.
The Person:
To excel in this role, you should possess:
- 3-5 years of experience using AWS platform and services.
- Proficiency with common tools and technologies used within CI/CD and Build pipelines, including Terraform, Jenkins, Gitlab, Nexus, Ansible, Maven, Docker, and Helm.
- A BSc in an IT-related field or equivalent professional experience in cloud operations and/or cloud platforms in a DevOps engineering or SRE role.
- Familiarity with commonly used operating systems including Ubuntu and Windows, along with scripting experience in Bash, Python, or PowerShell.
- Experience troubleshooting issues in a cloud environment and working with multiple teams to facilitate orderly project and release plans.
- Familiarity with VMWare vSphere (ESXi, vCenter) desirable.
We're looking for candidates with:
- Extensive experience across a wide range of AWS services, including compute, containerization, storage, database, networking, automation, IAM, security, monitoring, logging, backup, and configuration management.
- Previous experience in an SRE team and understanding of SRE principles.
- Experience in backup and restore processes, cloud-based multi-tenancy, emergency response, and on-call duties.
- Understanding of current best practices around Security Management, patching, branching strategy, release management, and Linux administration.
- Experience working in an agile development team using SCRUM methodologies.
Site Reliability Engineer – Mid Level
Are you an experienced Site Reliability Engineer (SRE) or DevOps engineer looking to shape the future of cloud platform services? Here's your chance to join a new team at an early stage and make a significant impact on systems and services offered to customers and partners in EMEA and beyond.
The Role:
As a Site Reliability Engineer, you'll be responsible for critical areas within the Cloud Platform, including operations, availability, performance, monitoring, change management, emergency response, security, and capacity planning.
Key responsibilities include:
- Designing, implementing, and maintaining robust, scalable, high-quality software and systems within the SRE domain.
- Managing and supporting in-house development systems, CI/CD pipelines, tools, monitoring, and alerting, with a focus on automation to streamline activities and reduce toil.
- Collaborating closely with developers and architects to ensure designed solutions meet non-functional requirements such as availability, performance, security, and maintainability.
- Leading incident management, post-mortem reviews, and continuous improvement initiatives, contributing to the evolution of processes and systems within the organization.
- Defining key metrics and technical decisions driving products and their delivery, including SLOs and SLAs, architecture, best practices, and cost optimization.
- Taking responsibility for complex project tasks, striving for higher standards of individual and team performance.
- Building relationships external to the team and driving knowledge sharing across all products.
- Identifying personal development opportunities, setting goals, and delivering impactful contributions, while also mentoring and training other engineers throughout the company.
The Person:
To excel in this role, you should possess:
- 3-5 years of experience using AWS platform and services.
- Proficiency with common tools and technologies used within CI/CD and Build pipelines, including Terraform, Jenkins, Gitlab, Nexus, Ansible, Maven, Docker, and Helm.
- A BSc in an IT-related field or equivalent professional experience in cloud operations and/or cloud platforms in a DevOps engineering or SRE role.
- Familiarity with commonly used operating systems including Ubuntu and Windows, along with scripting experience in Bash, Python, or PowerShell.
- Experience troubleshooting issues in a cloud environment and working with multiple teams to facilitate orderly project and release plans.
- Familiarity with VMWare vSphere (ESXi, vCenter) desirable.
We're looking for candidates with:
- Extensive experience across a wide range of AWS services, including compute, containerization, storage, database, networking, automation, IAM, security, monitoring, logging, backup, and configuration management.
- Previous experience in an SRE team and understanding of SRE principles.
- Experience in backup and restore processes, cloud-based multi-tenancy, emergency response, and on-call duties.
- Understanding of current best practices around Security Management, patching, branching strategy, release management, and Linux administration.
- Experience working in an agile development team using SCRUM methodologies.