Machine Learning Engineer, Fast Optimized Inference – EMEA Remote

About the role:

As a Machine learning Engineer, you work mainly on creating great libraries highly focused on real world ML use cases. We’re building on top of our open-source to create more specialized code with a focus on industrial level of usage.

We are searching for someone who brings fresh ideas, demonstrates a unique and informed viewpoint, and enjoys collaborating with a progressive, nimble and decentralized approach to develop real-world solutions and positive user experiences at every interaction.

Objectives of this role:

Develop specialized software for specific machine learning (ML) use cases that have broad applications, similar to [text-generation-inference](https://github.com/huggingface/text-generation-inference).
Utilize existing library frameworks to create scalable software solutions for industrial purposes.
Enhance the reliability, quality, and time-to-market of our software suite. Measure and optimize system performance to stay ahead of customer needs and drive innovation.
Manage the production environment by monitoring availability and ensuring overall system health. We run our own tools

About you:

If you are a passionate Machine Learning Engineer with a keen interest in AI and proficient with Python, Rust and specialized Cuda kernels Frameworks (transformers of course + Keras or PyTorch), we would love to hear from you. Join our team and contribute to the advancement of AI technologies while working alongside talented professionals in a collaborative and stimulating environment.

APPLY HERE!

← Bookkeeper - US Head of Cloud Infrastructure and Operations →