About us ECDB – Shaping the Future of eCommerce with Data! At ECDB, we firmly believe that data determines success in eCommerce. That’s why we provide leading companies like Amazon, Google, and PayPal with the most precise analyses and market insights. With billions of transactions as our foundation, we are developing one of the most comprehensive eCommerce data platforms worldwide. Our team of over 50 experts combines cutting-edge technology with deep industry knowledge – and this is where you come in! If you're eager to shape the future of eCommerce through data-driven insights, ECDB is the perfect place for you. Tasks Own large-scale data processing pipelines, including batch processing of raw, unstructured data Design and optimize distributed compute workloads to transform large-scale web and natural language data into structured, production-ready datasets Train, deploy and monitor ML-models at scale (e.g., NLP models, classifiers and enrichment use-cases) Productionize models: Batch inference & retraining pipelines Implement AI-assisted pipelines (e.g. LLM-based classification or extraction) Requirements Several years of relevant professional experience (4 years) Proven track record in python-heavy data processing Prior experience with distributed compute frameworks (Spark / Dask / Ray) on object-storage based datasets (e.g., Parquet on S3-compatible storage Practical ML experience (training, evaluation, deployment, retraining) Ability to work with messy, large-scale data and turn it into reliable outputs Benefits Attractive career opportunities in a rapidly growing company Short decision-making processes and plenty of room for personal responsibility An ambitious, open-minded team with a passion for smart solutions A strong focus on continuous learning and development Flexible working hours, the option to work from home, and a healthy work–life balance A modern office in Hamburg’s historic Speicherstadt, offering a unique atmosphere