Senior Site Reliability Engineer - Observability (x/f/m)
Posted on January 23, 2026 (8 minutes ago)
About Doctolib
We’re a B Corp company committed to building together the healthcare we all dream of.
More than 2,900 Doctolibers in France, Germany, Italy, and the Netherlands help improve the daily lives of care teams with a new generation of technologies and services.
Doctolib helps over 90 million patients manage their health and access optimal healthcare.
Job description
Doctolib’s Engineering environment is rich and we build innovative products and features to ease doctors' and patients' lives. We are looking for a Senior Site Reliability Engineer to keep Doctolib production systems running smoothly and to support the exponential growth of Doctolib services.
As a Senior Site Reliability Engineer within the Core Reliability & Observability team, you will play a pivotal role in shaping the company’s observability strategy and ensuring our platform remains reliable, debuggable, and scalable. This role sits at the intersection of infrastructure, developer experience, and product engineering, focusing on logging, metrics, tracing, and alerting foundations across the organization.
What you will do
- Lead the observability strategy across the platform, building scalable, developer-friendly logging and tracing capabilities.
- Identify and lead large-scale cross-cutting reliability initiatives, including improvements to incident detection, response, and postmortem analysis.
- Take part in the on-call rotation and improve the on-call experience by refining alerting, reducing noise, and ensuring actionable telemetry.
Who you are
You could be our next teammate if you:
- Have solid hands-on experience (3+ years) on a large-scale production platform.
- Have proven experience with cloud platforms such as AWS, Azure, or Google Cloud.
- Have a solid understanding of containerization and orchestration technologies (Docker and Kubernetes).
- Have strong understanding of Helm for managing Kubernetes manifests and ArgoCD for GitOps workflows.
- Have deep expertise in observability tooling and architecture, including logging (Fluent Bit, OpenTelemetry, Loki, Elasticsearch, Logstash, Vector), tracing (OpenTelemetry or APMs), and metrics (Prometheus, Thanos, Datadog, or equivalent).
- Are proficient in at least one programming language (Ruby, Python, Go, Java, etc.) and understand infrastructure-as-code principles.
- Have experience with monitoring and observability tools and enjoy troubleshooting performance issues in complex environments.
- Speak English.
What we offer
- Free health insurance for you and your family.
- Up to 14 days of RTT.
- Parental care program (1 month off in addition to legal parental leave and 0.5 days off per child when school starts).
- Wellbeing program (free mental health and coaching with partner moka.care).
- A flexible workplace policy offering both hybrid and office-based mode.
- Flexibility days allowing work in EU countries and the UK for 10 days per year.
- Lunch voucher with Swile card.
- Work Council subsidy to refund part of sport club membership or creative class.
- Bicycle subsidy.
The interview process
- Recruiter interview
- Technical SRE interview
- System design interview
- Behavioral interview
- Background / reference check
- Offer
Equal opportunity & data privacy
Doctolib evaluates candidates based solely on qualifications and motivation, without discrimination. Applicants are encouraged to exclude personal information (for example pictures, age) from their applications.
If you require any accommodation during the hiring process, please let Doctolib know for support.
For data processing details and inquiries, contact hr.dataprivacy(at)doctolib.com.
How to apply
To apply for this role, use the Apply button on the job page. Complete the application form and submit your CV and any supporting documents requested.
Contact for inquiries
If you have questions about the recruitment process or need an accommodation, contact hr.dataprivacy(at)doctolib.com.