Site Reliability Engineer
As a Site Reliability Engineer, you are responsible for ensuring the reliability, scalability, and availability of our SaaS platform.
Site Reliability Engineer
Weaviate is a global remote-first startup, with teams hailing from many different parts of the world, where it is not totally uncommon for someone to work remotely from fun places (like a camping site or a beach in Miami). While this gives you freedom and flexibility to work from anywhere and at any time, we are looking for people who are comfortable working independently, who are proactive and take ownership, and people who communicate effectively.
Weaviate a Vector Search engine & database, which uses machine learning to organize and search data in a completely new way. We believe that the next wave of software infrastructure is AI-first and that a strong open-source community is a basis for creating high-quality software.
About this role
As a Site Reliability Engineer, you are responsible for ensuring the reliability, scalability, and availability of our SaaS platform. You will work closely with the development team to design, build, and maintain the infrastructure that powers our applications and our customer’s managed Weaviate clusters. You will monitor and analyze the availability and performance of our services and implement solutions to enhance them. Lastly, you lead the on-call rotation to ensure 24/7 availability of our applications.
In this role you are part of the Weaviate Cloud Services (WCS) team. This team builds Weaviate’s managed offering. From provisioning to auto-scaling, from monitoring to building vibrant dashboards, and from pricing integration to user administration. It will never get boring with the exciting challenges of offering a state-of-the-art vector database as a service to our users. The atmosphere in the team is friendly, collaborative, and enabling – with a focus on delivering premium-quality software products iteratively.
What we are looking for
- You are passionate about observing the health of a complex system and automating tools and processes to ensure the reliability of its services.
- You see incidents before they happen and prevent them from doing so.
- You have profound experience in operating and maintaining SaaS platforms on common cloud providers like AWS, GCP, Azure etc.
- You have good software engineering skills and experience with at least one programming language (e.g. Go, Rust, Java, TypeScript, etc.).
- You have a great understanding of distributed microservice systems, including high availability and scalability.
- You are experienced in CI/CD and know how to develop and operate continuously deployed applications in production.
- You are usually available in a time zone between UTC-5 and UTC+2.
- Nice to have: You’re interested in AI-first databases and Machine Learning workflows (e.g. continuous training).
What we offer
- 100% remote with flexible work hours.
- Competitive compensation, including paid time off.
- Budget available to spend on going to conferences, co-working space, home office equipment, etc.
- Work with very experienced and fun team members.
- An atmosphere that encourages learning and personal growth, and that gives you lots of freedom, flexibility, and responsibilities.
- You will work at the forefront of search, ML/AI, and cloud-native technologies - and all of it is open source.
- Department
- Weaviate Cloud Services team
- Remote status
- Fully Remote
- Employment type
- Full-time

About Weaviate
Weaviate a global remote-first startup, with teams hailing from many different parts of the world, where it is not totally uncommon for someone to work remotely from fun places (like a camping site). At Weaviate we believe that the next wave of software infrastructure is AI-first and that a strong open-source community is a basis for creating high-quality software.
Site Reliability Engineer
As a Site Reliability Engineer, you are responsible for ensuring the reliability, scalability, and availability of our SaaS platform.
Loading application form
Already working at Weaviate?
Let’s recruit together and find your next colleague.