loader image

Senior Site Reliability Engineer

  • Prishtinë
  • This position has been filled

Our Partner, a Software Company based in US, offering end-to-end solutions in Digital Transformation, Digital Consulting and
Business Process Services – supporting all Tech Stacks. Collectively we service a multitude of clients across industries and company verticals. We are a culmination of some of the brightest Full Stack Developers, Data Engineers, Architects, Project Managers, Quality Analysts, Strategists, spanning across multiple time zones.

We are recruiting for them an experienced Senior Site Reliability Engineer to join their team in Kosovo office.

This role focuses on enhancing the reliability, availability, and performance of our cloudbased services and infrastructure. Leveraging advanced automation, monitoring, and DevOps practices, you’ll ensure our systems meet the highest standards of security, efficiency, and scalability.

You’ll work closely with cross-functional teams to drive improvements, manage incident responses, and support the
continuous delivery of high-quality software solutions.
We are looking for engineers that can help us develop our SaaS infrastructure capabilities.

This person will assist us in building the next generation of our multi-tenant, scalable infrastructure.
The successful candidate should be able to contribute to all phases of the architecture lifecycle including
specification, design, implementation, and maintenance. You must be willing to learn about the discovery
industry and quickly integrate new technologies into your repertoire.

Responsibilities

  • Ensuring the reliability, availability, and performance of systems and services by implementing monitoring, incident response, and post-incident analysis.
  • Collaborating with development and operations teams to design, implement, and maintain scalable infrastructure and services that meet performance and capacity requirements.
  • Developing and maintaining automation tools, scripts, and frameworks to streamline operational tasks, deployment processes, and monitoring systems.
  • Responding to and resolving incidents, performing root cause analysis, and implementing preventive measures to minimize the impact of future incidents.
  • Setting up and maintaining monitoring and alerting systems to detect and respond to performance issues, anomalies, and service disruptions.
  • Identifying performance bottlenecks, conducting performance tests, and implementing optimizations to improve system performance and efficiency.
  • Analyzing usage patterns, forecasting resource requirements, and collaborating with teams to ensure adequate capacity for current and future needs.
  • Implementing and maintaining security measures, vulnerability management, and compliance requirements to protect systems and data.
  • Collaborating with cross-functional teams, including developers, operations, and other stakeholders, to promote a culture of reliability and effective communication.
  • Creating and maintaining documentation, runbooks, and knowledge base articles to ensure the availability of up-to-date information for troubleshooting and incident response.
  • Identifying areas for improvement, conducting post-incident reviews, and driving initiatives to enhance system reliability, performance, and operational efficiency.

Requirements and Skills

  • 5 – 8 years of experience with infrastructure automation on a DevOps/DevSecOps Team.
  • BS or MS in Computer Science, or equivalent coursework.
  • Experience with container templatization/orchestration frameworks such as Helm, ArgoCD, etc.
  • Experience with CI/CD tools such as GitHub Actions.
  • Experience maintaining and developing production Infrastructure as Code deployments.
  • Experience in a scripting language, preferably Python, but Ruby or Bash also work. Experience with Linux and Windows architectures.
  • Experience working with AWS.
  • Experience deploying and maintaining Kubernetes clusters.
  • Experience with version control systems like Git.

Hiring Policy
This job description may evolve over time. Our Partner is dedicated to diversity and inclusion, ensuring a fair workplace for all, regardless of race, color, religion, gender, national origin, age, disability, or any other protected status.

Work model: Hybrid

Apply: [email protected]