Inserat online seit: 16 Juni
Aufgaben der Stelle
Westhouse is one of the leading international recruitment agencies for the procurement of highly qualified experts in fields such as IT lifecycle management, SAP, engineering, commerce and specialist consultancy.
For our client we are currently looking for a Operations & Support Expertise — Storage (m/f/d) - Frankfurt (50%) and Remote.
Your tasks
Provide Tier-3 operational ownership for Storage Products for Local Production (DE).Handle complex incidents, deep troubleshooting, and root cause analysis; drive permanent fixes and preventive measures.Ensure operational readiness for storage changesMonitoring/alerting coverage, performance baselines, hardening, patch strategy, rollback and recovery procedures, runbooks.Execute and improve standard operational procedures through automation (reduce toil, improve MTTR and stability).Automate standard operational tasks (capacity checks, validation procedures, provisioning workflows where applicable).Ensure operational readiness for deploymentsValidation of deployment artifacts from an operations perspective.Defining and enforcing quality assurance measures (e.g. required documentation of standard operation procedures, successful test reports, …) to ensure the high quality of delivered products and services.Ensuring rollback strategies and operational monitoring (observability) are in place for production deploymentsEnsure operational stability and responsiveness for the managed Kubernetes platformMonitoring system health, performance metrics, and service availability across multi-tenant environments.Identifying, analyzing, and resolving incidents, minimizing service disruption.Triggering root cause analysis and implementation of corrective and preventive actions.Reduce operational toil and improve service reliabilityAddress recurring operational issues by automating remedial standard operations processesValidate all automated procedures following the established software development lifecycle including staging, testing, and validation reviewsEnsure platform operations adhere to security and compliance standardsImplementing monitoring and logging strategies to support audit and compliance requirements.Performing routine security scans and remediating identified vulnerabilities