Role Description
As a Senior Machine Learning Engineer at warmwind, you will push the limits of AI by designing, training, and scaling state-of-the-art Large Language Models (LLMs) and advanced reinforcement learning (RL) systems. We are looking for an exceptional expert who has already built and deployed large-scale LLMs and understands every detail of the process—from tokenization to training at scale.
Your work will drive our next-gen AI models, shaping the future of machine intelligence beyond traditional paradigms. You will work with massive compute clusters (500+ H100 GPUs) and cutting-edge reinforcement learning techniques to create highly efficient, scalable, and groundbreaking AI systems.
Responsibilities
* Design, train, and optimize Large Language Models (LLMs) from scratch
* Scale distributed training on massive GPU clusters (500+ H100 GPUs)
* Implement advanced reinforcement learning techniques (RLHF, adversarial self-play, real-time control)
* Develop high-performance architectures for multi-modal AI systems
* Build simulation environments for RL-based AI agents
* Optimize inference speed and efficiency for real-world deployment
* Collaborate with top AI researchers to push the boundaries of machine learning innovation
Your Profile:
Must-haves
* Deep expertise in LLMs – you've built and trained large-scale models yourself
* Experience with large-scale distributed training on 500+ GPU superclusters
* Deep understanding of reinforcement learning, neural network optimization, and self-play methods
* Expert in PyTorch, TensorFlow, JAX & low-level optimization techniques (CUDA, Triton, DeepSpeed, etc.)
* Familiarity with high-performance computing (HPC, NVLink, InfiniBand, parallel computing)
* Strong publication track record in AI/ML research is a plus
* Relocation to Jena, Germany after initial onboarding
Company Culture and Work Style
We operate in a dynamic startup environment where speed, efficiency, and innovation are key to achieving our goals and growing together. Our development process is based on rapid iterations, allowing us to quickly implement and test ideas to enhance our product and meet user needs.
What we offer:
* Innovation Opportunities:
Work on cutting-edge technology and help shape the technical direction of our product.
* Impact:
Your contributions will directly influence the user experience and the success of our platform.
* Startup Atmosphere:
Flat hierarchies, direct communication, and a real opportunity to create something very big.
* Fair Compensation:
Performance-based payment with the opportunity to participate in the growth through success.
* Flexible Work Conditions with Structure:
We offer you high flexibility in shaping your workday—provided tasks and goals are met, you're free to design your workflow. At the same time, we value efficient collaboration during core working hours to move projects forward and facilitate quick discussions.
Language requirements:
* German or English