Playbypoint provides an integrated POS and management software solution tailored for sports clubs. Our platform streamlines operations, enhances customer engagement, and empowers clubs to focus on what they do bestโdelivering exceptional sports experiences. As we scale our technology, we seek a proactive SRE to ensure our systems remain robust, scalable, and secure.
As a Site Reliability Engineer at Playbypoint, you will be part of an emerging Infrastructure Team that is vital for maintaining the health and performance of our production systems.
You will collaborate with our development teams to support our applications, manage and scale data workflows, and orchestrate containerized workloads using Kubernetes. Your expertise will help us deliver seamless service to our clients across the racquet sports community.
Infrastructure Reliability & Automation: Develop and maintain monitoring, alerting, and incident response systems.Implement Infrastructure-as-Code practices using tools like Terraform or Ansible to manage our cloud environments.
Application & Database Support: Collaborate closely with development teams to deploy, monitor, and troubleshoot Ruby/Ruby on Rails applications. Optimize and monitor MySQL databases with failover and performance-tuning strategies.
Container Orchestration & Cloud Management: Oversee Kubernetes clusters to ensure efficient deployment, scaling, and management of containerized applications. Manage CI/CD pipelines to enable continuous delivery of high-quality software.
Experience:
Technical Proficiency: Proven experience with Kubernetes and container orchestration.Competency in setting up CI/CD pipelines and using Infrastructure-as-Code tools (e.g., Terraform, Ansible).Experience with monitoring tools such as Prometheus, Grafana, or Datadog.Familiarity with cloud platforms (AWS, GCP, or Azure) is a plus. Familiar with deploying, monitoring & optimizing web applications.
Soft Skills: Excellent problem-solving abilities and strong communication skills. A proactive mindset with a commitment to continuous improvement. Ability to thrive in a remote, collaborative, and fast-paced environment.
Fully remote
Candidates can reside anywhere in the world.