# ML Engineer

**Company:** Pruna AI
**Department:** Engineering
**Location:** Munich 
**Remote status:** Hybrid

👋 About us

At Pruna, we’re on a mission to make AI more efficient to build a better future.

While the focus of Foundational model Labs is scaling up, we aim to level the playing field by building AI models that are as accessible as possible.

After years of research on efficient ML, we decided that the best way to spread our impact was to take it into our own hands. Each of us cares deeply about empowering people to maximize their impact while minimizing their carbon footprint.

🔍 Role Description

As an **ML Engineer** at Pruna AI, you will bridge the gap between cutting-edge research and real-world application. Your mission is to identify the most promising AI models released by the community and industry, apply a combination of internal and external efficiency methods and to make them more efficient, and deploy them to be used by end users.

You’ll be at the forefront of operationalizing our research, ensuring that users can benefit from state-of-the-art models without the heavy costs of deployment. This is a hands-on role combining deep ML expertise with practical engineering skills.

### What you’ll do:

- **Model Optimization**

- **Deployment & Delivery**

- **Customer & Partner Engagement**

🌟 Your Skills

We would love to see:

### Educational background or Experience

- B.Sc., M.Sc./Ph.D. in Computer Science, Machine Learning, or related fields—or equivalent industry experience.

- Exceptional performance academically.

- Demonstrated experience working with modern AI models (e.g., transformers, diffusion, multimodal architectures,…).

### Machine Learning Expertise

- Strong foundations in deep learning and applied ML.

- Expertise in **PyTorch** and **Python**.

- Familiarity with model deployment workflows (Cog, Litserve, vLLM, etc.).

### Engineering & Deployment

- Experience taking ML models from research to production in real-world environments.

- Understanding of performance benchmarking, profiling, and hardware-aware optimization.

- Comfort with neo cloud platforms (Replicate/Runpod/Modal), or legacy clouds (AWS/Azure/GCP) and containerization (Cog, Docker…).

### Evaluation Skills

- Strong understanding of benchmarking tools and frameworks for both quality and efficiency.

- Experience translating evaluation metrics into actionable engineering trade-offs.

### Personal Attributes

- Strong sense of ownership and accountability.

- Ability to thrive in ambiguous, fast-moving environments.

- Clear communication skills to bridge research and customer needs.

- Passion for making AI both impactful and sustainable.

### Bonus Points

- Experience with compression methods (quantization, pruning, distillation, compilation).

- Knowledge of lower-level optimization frameworks (Triton, CUDA, C++).

- Prior experience in forward-deployed engineering or customer-facing ML roles.

## ⚖️ Expected Salary

💸 **Salary** : We pay top market rates based on seniority and location, leveraging publicly available data that we share with you during the process.   
  
🌞 **Benefits** : Meal vouchers, health & wellness solutions, mobility, travel policy to visit fellow Pruners and a remote stipend for your home workspace.

## 🛤️&nbsp;&nbsp;Recruitment process

The recruitment process consists of 4 interviews to check the expectations, technical skills and team/culture fit of the candidate.

**1. Intro Call -** We have a chat to get to know you, discuss your expectations, and give a feeling of who we are. _[~1 hour call]_

**2. Foundations -** We test the foundational knowledge that are important for the role you applied for (e.g. real world optimisation task). _[~2/3 hours preparation + ~1 hour call]_

**3. Challenge -** Depending on the role you applied for, we will dive together into a task that is representative of the work that you would be doing at Pruna AI.&nbsp;_[~2/3 hours preparation + ~1 hour call]_

**4. Meet the team -** You will have the chance to meet the team and get to know better the everyday life at Pruna AI.&nbsp;_[~1 hour call]_

_Accessibility note: To ensure that everybody who is interested in joining Pruna AI&nbsp;has equal opportunity and ability to start that journey, we have made sure our hiring process is efficient, flexible, and accessible. From the application to interviews, our team will adapt to your needs and what works best to help you show your best._

## 💜 Our Values

We care deeply about the organization we are growing to achieve our goal of making&nbsp;AI accessible and sustainable. There's many ways we could manage our people and&nbsp;work, and it will evolve over time, however we wanted to share the following main aspirations we want to uphold:

##### 🧠&nbsp; Decide Wisely

Make rational, customer-focused decisions based on collected insights and experiences.

##### 🤝&nbsp; Trust by Default

Assume good intentions, communicate transparently and precisely, and create a safe space for collaborating.

##### 🌍&nbsp; Foster Inclusion

Build an enjoyable and supportive workplace that values integrity, personal growth, diverse backgrounds and perspectives. [Read our Code of Conduct for more info :)](https://careers.pruna.ai/posts/code-of-conduct)

##### 🌱&nbsp; Grow Together

Provide actionable feedback and credit to strengthen teamwork and collaboration.

##### 🚀&nbsp; Learn Relentlessly

Embrace adaptability in a fast-moving landscape to drive innovation and efficiency.

[Apply for this position](https://careers.pruna.ai/jobs/6569302-ml-engineer/applications/new)