Become our new Cloud AI Architect (m/f/d)– and shape the future of data-driven platforms!
As a Cloud AI Architect, you are responsible for the hands-on design, implementation and stable operation of AI and data-driven systems in complex enterprise environments.
You work AI-first, close to production and with high ownership : from the initial architecture decision to deployment strategies, stability, security and cost control in live operation.
This role is not purely conceptual or advisory. You make concrete technical decisions, implement them yourself, and take responsibility for ensuring that agentic systems, data platforms, and AI workloads run reliably, scalably, and economically .
Your tasks & responsibilities
AI-First Platform & Operational Architecture
You follow a consistent AI-first engineering approach .
Platforms and operating models are designed from the outset to meet the specific requirements of LLMs, agentic systems, and AI workloads .
This includes:
* Development and enhancement of cloud platforms on Microsoft Azure for AI and data systems
(compute, storage, network, identity, tooling)
* Deployment and operation of Databricks as a central platform for data engineering, machine learning and AI workloads
* Architecture of agent orchestration, runtime environments, control planes and tool integrations
* Ensuring that platforms are production-ready, scalable, observable, and operationally manageable.
DevOps, deployment & release management
You are responsible for the technical implementation and stable operation of AI, agent and data systems throughout their entire lifecycle.
* Building and operating CI/CD pipelines for AI, agent, and data components in Azure
* Definition and implementation of release and update strategies (e.g., canary releases, versioning, controlled agent updates)
* Reproducible deployments and clean rollbacks, especially for Databricks and AI workloads
* Close collaboration with engineering managers and product managers on architectural and operational decisions.
Stability, Security & Governance
You will be responsible for the safe and stable operation of business-critical systems.
* Responsibility for availability, performance, fault tolerance and incident handling
* Implementation of Security by Design in Azure and Databricks environments, including:
o Role and authorization concepts (Azure IAM)
o Secrets Management
o Network and system isolation
o Audit and compliance requirements
* Development and enforcement of governance rules for agentic systems (access, guardrails, policies, control mechanisms)
Cost, Performance & Scalability
You ensure that technical excellence and economic efficiency go hand in hand.
* Transparency and active control of operating costs in Azure and Databricks (compute, storage, token costs, latency)
* Design of architectures that scale with growing data volumes and agent networks.
* Evaluation of technical and economic trade-offs together with the Engineering Manager and Product Strategist
Production launch & operation
You accompany systems all the way to productive enterprise operation – and beyond.
* Responsibility for go-live preparation, stabilization and transition to regular operation
* Creation of runbooks, operational documentation, and architecture decision records
* Preparing for handover to customer IT or internal operations teams
* Technical contact person in critical project and operational situations
What we are looking for
Core Profile
* Highly experienced, hands-on engineer specializing in cloud platforms (Microsoft Azure), DevOps, and enterprise operations.
* Proven productive experience with Azure and Databricks in enterprise environments
* Experience with AI and data workloads in production environments (not a purely infrastructure or conceptual role)
* Strong AI-first mindset with a clear understanding of stability, security, and operations.
AI & Data Engineering
* Experience with LLM-based systems and agentic architectures
* Understanding of ML lifecycle concepts (training, inference, monitoring)
* Architecture of agentic systems including guardrails, policies and control mechanisms
Cloud, DevOps & Platform
* Very good practical experience with:
o Microsoft Azure (compute, networking, storage, IAM, security)
o Databricks (data engineering, ML & AI workloads)
* CI/CD pipelines and infrastructure as code (e.g. Terraform, Bicep)
* Monitoring, logging and observability in the enterprise environment
Security & Governance
* Cloud Security Patterns
* Identity & Access Management
* Compliance and audit requirements in enterprise environments
Soft Skills & Working Method
* Excellent communication skills:
You explain technical decisions, risks and costs clearly – both internally and to customers.
* Strong business and product understanding
* Forward deployed mindset :
* You enjoy working closely with customers and taking responsibility in real project situations.
* Strong ownership mindset, pragmatism, and implementation skills
Language skills
* Fluent and negotiation-level German and English skills
Why you're exactly right for us
* Demanding data and AI projects using state-of-the-art technologies (Microsoft Azure, Databricks, modern AI platforms)
* High-performance culture with a lot of responsibility and creative freedom
* Steep learning curve & development opportunities, including Azure & Databricks certifications
* Hands-on engineering culture with a strong team spirit
* Close collaboration with customers, partners and the broad AI ecosystem