Engineering Manager - Machine Learning (d/f/m)
Posted on April 22, 2026 (about 1 month ago)
Why us?
We believe that AI can revolutionize the diagnosis and treatment of cancer and other complex diseases. Aignostics is a spin-off from Charite9 with teams in Berlin and New York, backed by over $50M in funding and working with academic and life-science partners.
A hands-on leadership role where you will lead a high-performing team building large-scale distributed training infrastructure and workflows for digital pathology and foundational model development. Expect to spend around 50% of your time contributing technically while owning team leadership and roadmaps.
Where your expertise is needed
People & Team Leadership
- Build and scale a high-performing team for distributed ML challenges
- Own the full employee lifecycle: recruiting, onboarding, performance management, career development, and retention
- Mentor engineers and foster a culture of continuous learning and psychological safety
- Create an inclusive environment where diverse perspectives drive innovation
Strategic & Operational Management
- Define and execute technical roadmaps aligned with company objectives
- Lead resource allocation and capacity planning
- Own FinOps responsibilities: optimize cloud costs and track spending
- Ensure operational readiness through monitoring and incident response practices
- Establish and track KPIs for team performance and system health
Technical Leadership
- Design, develop, and maintain large-scale distributed training pipelines and ML infrastructure
- Lead architecture decisions for distributed systems enabling efficient model development at scale
- Hands-on contribution to critical technical challenges, including optimization of training pipelines and infrastructure
- Drive technical excellence through code reviews and architectural guidance
- Keep current with distributed training technologies and bring innovation to the team
Cross-functional Collaboration
- Partner with Product teams to translate business requirements into technical solutions
- Collaborate with Research Scientists to enable scalable model development and experimentation
- Work with Platform Engineering to ensure robust infrastructure and tooling
- Build relationships across engineering teams to drive alignment and knowledge sharing
- Communicate technical concepts effectively to technical and non-technical stakeholders
What we are looking for
Required Skills
- Bachelor's or Master's degree in Computer Science, Engineering, Mathematics, or related field
- 6+ years of software or ML engineering experience, with at least 2 years in a technical leadership role
- Proven track record building and leading high-performing engineering teams and guiding projects across the SDLC
- Deep understanding of ML concepts and model optimization techniques (distillation, graph optimization, quantization, etc.)
- Significant experience with large-scale distributed training systems and frameworks (especially PyTorch and NCCL); familiarity with GPUs, distributed systems and parallel computing
- Advanced Python skills; experience in C/C++ or CUDA is a plus
- Familiarity with MLOps/DevOps practices: CI/CD, Docker, Kubernetes, observability; cloud platforms (GCP, AWS or Azure) and infrastructure-as-code
- Experience with Linux, version control, and container technologies
- Demonstrated ability in resource allocation, capacity planning, and FinOps principles
- Excellent problem-solving and data-driven decision-making skills
Leadership & Soft Skills
- Effective communication and stakeholder management
- Ability to give constructive feedback and navigate difficult conversations
- Proven people leadership with experience managing the full employee lifecycle
- Strategic thinking balancing short-term execution and long-term vision
- Experience with agile methodologies and iterative development
- Ability to influence without authority and build consensus
- Track record of empowering team members and fostering autonomy
Ideally, you also have
- Experience with regulated or healthcare production systems and medical device standards (ISO 13485)
- Experience working with biomedical or image data
- Hands-on experience with Google Kubernetes Engine, SLURM and Ray
- Experience with advanced ML stack (TorchDyno, JAX, TensorRT)
- Familiarity with Information Security standards (ISO 27001)
- Experience with FinOps tools and cloud cost optimization
- Experience leveraging LLM/Agentic systems to accelerate development
Our offer
- Purpose-driven startup working to improve cancer outcomes
- Cutting-edge AI research with involvement from Charite9 and TU Berlin
- Diverse and international team
- Opportunity to shape technical direction and grow into broader leadership roles
- Learning & Development yearly budget of 1,000e282AC plus 2 L&D days, language classes and internal programs
- Leadership development programs and executive coaching
- Flexible working hours and teleworking policy
- 30 paid vacation days per year
- Family and pet friendly with flexible parental leave options
- Subsidized membership choice among public transport, sports, and well-being
- Social gatherings, lunches and off-site events
- Optional company pension scheme
About Aignostics
International interdisciplinary team powering the next generation of precision medicine and advancing AI and pathology. Founded in 2020 with 120+ coworkers.
How to apply
To apply for this role, click the "Apply for this job" button on the job page. The application form will load on the site.
Application notes
If you have questions, visit the company's careers page at https://www.aignostics.com/company/careers or use the job URL: https://aignostics.teamtailor.com/jobs/6946429-engineering-manager-machine-learning-d-f-m