DevOps Engineer

Mistral AI

Mistral AI

Software Engineering
Paris, France
Posted on Tuesday, April 2, 2024
We are seeking our first DevOps Engineer.
Responsibilities
- Collaborate with AI/ML engineers and researchers to develop and implement a CI/CD that enables safe and reproducible experiments
- Enable seamless replication of work environment across several HPC clusters
- Implement and maintain monitoring, logging and alerting systems for both our large training runs and our client-facing APIs
- Make sure training environments are always available and ready on several clusters
- Improve development processes while finding the right balance between rigor, speed and flexibility for software development & research organization
- Develop and own internal tooling
- Collaborate with our AI/ML engineers and data scientists to build and maintain a secure, scalable, and efficient infrastructure.
- Develop and implement CI/CD pipelines to streamline the evaluation and development of AI/ML models and other applications.
- Ensure compliance with security best practices and industry standards.
- Work closely with the development team to troubleshoot and resolve issues in production environments.
- Develop and maintain containerization and orchestration systems using tools like Docker and Kubernetes.
- Document processes and procedures to ensure consistency and knowledge sharing across the team
About you:
- Master’s degree in Computer Science, Engineering, or a related field, or equivalent experience.
- 3+ years of experience in a DevOps role, preferably in an AI/ML-focused environment.
- Strong experience with Kubernetes-based cloud computing
- Proficiency in scripting languages such as Python, Bash, or PowerShell.
- Experience with CI/CD tools like Jenkins, GitLab CI, or CircleCI.
- Experience with containerization and orchestration technologies such as Docker and Kubernetes.
- Strong knowledge of Python development good practices
- Having worked with GPUs before is a + but not required
- Familiarity with infrastructure-as-code tools like Terraform or CloudFormation.
- Knowledge of monitoring, logging, and alerting tools like Prometheus, Grafana, ELK Stack, or Datadog.
- You ideally have an experience in Slurm
- Strong understanding of networking, security, and system administration concepts.
- Excellent problem-solving and communication skills.
- Self-motivated and able to work well in a fast-paced startup environment.
What We Offer:
- Ability to shape the exciting journey of AI and be part of the very early days of one of Europe’s hottest startup
- A fun, young, multicultural team and collaborative work environment — based in Paris and London
- Competitive salary and bonus structure
- Comprehensive benefits package
- Opportunities for professional growth and development
We're a small team, composed of seasoned researchers and engineers in the AI field. We like to work hard and be at the edge of science. We are creative, low-ego, team-spirited, and have been passionate about AI for years. We hire people that foster in competitive environments, because they find them more fun to work in. We hire passionate women and men from all over the world.
Developers are using our API via la Plateforme to build incredible AI-first applications powered by our models that can understand and generate natural language text and code. We are multilingual at our core. More recently, we released le Chat, as a demonstrator of our models.