We are seeking a Senior AI Platform Operations Expert to optimize and monitor AI platforms on Azure and Google Cloud environments.
You will work closely with clients to provide expert consultation and help design operational services for AI platforms like DIAL and Mistral.
Apply now to contribute your expertise in AI platform operations and cloud technologies.
We accept CVs in English only.
Responsibilities
- Operate AI platform infrastructure on Azure and Google Cloud environments
- Monitor token consumption and access across AI platforms
- Ensure functionality of custom solutions built for platform operations
- Consult clients on software and infrastructure details of AI platform services
- Design and build services to support AI platform operations
- Internalize and apply context of AI platform services provided to users
- Collaborate with cross-functional teams to optimize platform performance
- Analyze operational data to identify improvement opportunities
- Maintain up-to-date documentation of platform operations and procedures
- Support deployment and scaling of AI platform components
- Troubleshoot and resolve operational issues promptly
- Provide training and guidance to junior team members
- Stay current with emerging AI platform technologies and best practices
Requirements
- 3+ years of experience with AI platform operations on Azure and Google Cloud
- Strong knowledge of Azure Kubernetes Service (AKS) and Databricks
- Proven ability to monitor and manage cloud infrastructure and resources
- Experience consulting clients on software and infrastructure solutions
- Analytical skills to interpret operational data and metrics
- Problem-solving skills for troubleshooting platform issues
- Ability to design and implement scalable operational services
- Strong communication skills to interact with clients and teams
- Experience working with token management and access control in cloud environments
- Ability to internalize complex platform contexts and translate them into operational strategies
- Upper-Intermediate English (B2+) proficiency
Nice to have
- Experience with AI platform tools such as DIAL and Mistral
- Python programming skills
- Experience with analytical tools for cloud resource management
- Knowledge of AI platform monitoring and alerting tools
We offer
- Learning Culture - We want you to be the best version of yourself, that is why we offer unlimited access to learning platforms, a wide range of internal courses, and all the knowledge you need to grow professionally
- Health Coverage - Health and wellness are important, that is why we have you and up to four family members in a premiere health plan.
We have a couple of options, so you can choose what is best for you and your family - Visual Benefit - Seeing your work for us would be a sight for sore eyes.
We want your vision to always be at 100% which is why we offer up to $ COP for any visual health expenses - Life Insurance Plan - We have partnered with MetLife to offer a full-coverage Ife insurance plan.
So, your family is covered, even if you are gone.
- Medical Leave Coverage - We are one of the few companies that cover 100% of your medical leave, for up to 90 days.
Your health is the most important thing to us - Professional Growth Opportunities - We have designed a highly competitive and complete development process, where you will have all the tools to get where you have always wanted to be, personally and professionally
- Stock Option Purchase Plan - As an EPAMer you can be more than just an employee, you will also have the opportunity to purchase stock at a reduced price and become a part owner of our organization
- Additional Income - Besides your regular salary, you will also have the chance to earn extra income by referring talent, being a technical interviewer, and many more ways
- Community Benefit - You will be part of a worldwide community of over 50,000 employees, where you can learn, challenge yourself, stand out, and share your knowledge and experience with multicultural teams
Please note that even though you are applying for this position, you may be offered other projects to join within EPAM.
EPAM is a leading global provider of digital platform engineering and development services.
We are committed to having a positive impact on our customers, our employees, and our communities.
We embrace a dynamic and inclusive culture.
Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow.
No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.