Site Reliability Engineer in Playbypoint

FULL_TIME

  Remote | Senior | Full time | SysAdmin / DevOps / QA

8 applications
Replies between 1 and 7 days
Last checked today

Playbypoint provides an integrated POS and management software solution tailored for sports clubs. Our platform streamlines operations, enhances customer engagement, and empowers clubs to focus on what they do bestโ€”delivering exceptional sports experiences. As we scale our technology, we seek a proactive SRE to ensure our systems remain robust, scalable, and secure.

Job functions

As a Site Reliability Engineer at Playbypoint, you will be part of an emerging Infrastructure Team that is vital for maintaining the health and performance of our production systems.

You will collaborate with our development teams to support our applications, manage and scale data workflows, and orchestrate containerized workloads using Kubernetes. Your expertise will help us deliver seamless service to our clients across the racquet sports community.

Infrastructure Reliability & Automation: Develop and maintain monitoring, alerting, and incident response systems.Implement Infrastructure-as-Code practices using tools like Terraform or Ansible to manage our cloud environments.

Application & Database Support: Collaborate closely with development teams to deploy, monitor, and troubleshoot Ruby/Ruby on Rails applications. Optimize and monitor MySQL databases with failover and performance-tuning strategies.

Container Orchestration & Cloud Management: Oversee Kubernetes clusters to ensure efficient deployment, scaling, and management of containerized applications. Manage CI/CD pipelines to enable continuous delivery of high-quality software.

Qualifications and requirements

Experience:

  • Proven experience (minimum of 3 years) in an SRE, DevOps, or similar operational role within a dynamic tech environment.
  • Proven experience in solving problems at scale, minimizing bottlenecks, and improving efficiency at both technical and process levels.

Technical Proficiency: Proven experience with Kubernetes and container orchestration.Competency in setting up CI/CD pipelines and using Infrastructure-as-Code tools (e.g., Terraform, Ansible).Experience with monitoring tools such as Prometheus, Grafana, or Datadog.Familiarity with cloud platforms (AWS, GCP, or Azure) is a plus. Familiar with deploying, monitoring & optimizing web applications.

Soft Skills: Excellent problem-solving abilities and strong communication skills. A proactive mindset with a commitment to continuous improvement. Ability to thrive in a remote, collaborative, and fast-paced environment.

Desirable skills

  • Previous experience of scaling/deploying applications built-in web frameworks like Django or Ruby on Rails
  • Proven experience going deep into problems that involve distributed systems and/or low-level optimizations.

Conditions

Fully remote You can work from anywhere in the world.
Flexible hours Flexible schedule and freedom for attending family needs or personal errands.
Digital library Access to digital books or subscriptions.
Company retreats Team-building activities outside the premises.
Computer provided Playbypoint provides a computer for your work.
Education stipend Playbypoint covers some educational expenses related to the position.

Remote work policy

Fully remote

Candidates can reside anywhere in the world.

About Playbypoint

Building the next generation of software and connecting the world of sports ๐Ÿ˜Ž — Playbypoint's full profile