Collaborate with data engineering and platform teams to maintain, operate, and improve data infrastructure on an on-prem Cloudera cluster.
Monitor and operate key data science pipelines relying on Apache Spark.
Install and manage software environments required by the data science team.
Support network optimization and collaborate with infrastructure and datacenter teams.
5+ years of experience in a DevOps, DataOps, SRE, or similar data infrastructure role.
~ Strong foundation in software development and automation, with experience in Python, Bash, or similar scripting languages.
~ Experience working with big data technologies like Hadoop, HDFS, Impala, Hive, Kafka, Airflow, or NiFi.
~ Comfortable with Linux-based systems, version control, and CI/CD practices.
~ Solid understanding of distributed systems, relational databases, and data modeling
Benefits and Details:
~ Modern Braunschweig office with cafeteria (relocation supported)
~ Annual contribution to capital-forming schemes (pension, savings, etc.)
~30 Vacation Days
~ Employee Discounts on company products