* You bring a wealth of experience (ideally 5+ years) in developing data-driven solutions and navigating the journey of moving complex models into production.
* You believe that great tech is built by great teams. You have experience guiding others, conducting code reviews, and fostering an environment of psychological safety and technical growth.
* You have a deep proficiency in Python, PySpark, and Databricks. You enjoy the mathematical “why” behind Machine Learning and have specific experience in areas like Pricing, Forecasting, or Marketing modeling (e.g. Attribution).
* You treat Data Science as software engineering, driving excellence through CI/CD (Azure DevOps, GitHub Actions), version control, and modular coding.
* You excel at translating complex requirements into technical strategies and actively exchange knowledge to scale best practices.
Introduction text
Schwarz Digits creates the technological foundation for digital sovereignty in Europe. As the IT and digital division of the Schwarz Group, we develop and manage the IT infrastructures for the retail divisions Lidl and Kaufland, as well as Schwarz Production and PreZero. At the same time, we operate as an independent provider in the external market to support companies across Europe in their digital transformation. We bundle our core services in the areas of Cloud, Cyber Security, Data & AI, Communication, and Workspace.
Join us and contribute to digital sovereignty in Europe. With us, you will work at the intersection of agility and security: You will benefit from fast decision-making processes, enjoy genuine creative freedom in your projects, and be able to build upon the stable foundation of the Schwarz Group.
Your tasks
* You champion code quality and development standards while providing thoughtful mentorship to junior and mid-level data scientists.
* You design and own the development of complex, scalable ML/Deep Learning models focusing on predictive pricing, algorithmic optimization, advanced campaign modeling, and personalized recommendations.
* You manage the entire production lifecycle from research and MLflow logging through to deployment and maintenance.
* You drive CI/CD automation using Azure DevOps, GitLab, and Databricks Asset Bundles, while designing high-performance PySpark ETL and data validation frameworks in cloud environments.
* You ensure model reliability through rigorous testing, drift detection, and real-time monitoring, collaborating cross-functionally with Engineers and Analysts in a modern cloud environment (Azure, Databricks, Google Cloud) using Python and PySpark.