EPAM is a leading global provider of digital platform engineering and development services.
We are committed to having a positive impact on our customers, our employees, and our communities.
We embrace a dynamic and inclusive culture.
Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow.
No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.
We are seeking a skilled
Data Platform Operations Engineer
to support the stability, security, performance, and efficiency of our global enterprise data platform.
This role focuses on providing operational coverage within an 8/5 support model under a follow-the-sun approach, ensuring that the platform continues to meet business needs worldwide.
The ideal candidate will have experience with cloud-based data platforms, an operational mindset, and the ability to contribute to performance optimization and cost management.
Responsibilities
- Ensure the secure and stable operation of the enterprise data platform (Snowflake, AWS data stack, dbt, orchestration tools, BI/analytics, etc.)
- Provide operational coverage within an 8/5 support model and assist in responding to major incidents
- Improve monitoring, alerting, and observability solutions for incident detection and resolution
- Carry out platform updates, patching, and configuration management to comply with security and operational standards
- Monitor performance metrics and collaborate on changes to meet shifting business needs
- Use observability platforms to monitor infrastructure, data pipelines, and key services
- Provide monitoring reports and insights to support better operational decisions
- Help implement automation opportunities to reduce manual processes and improve efficiency
- Collaborate on platform enhancements for resilience, scalability, and optimal cost management
- Contribute to infrastructure-as-code and repetitive operational practices
Requirements
- Experience managing cloud-based data platforms for 2+ years (e.g., Snowflake, Databricks, BigQuery, or similar)
- Understanding of cloud environments (AWS) with focus on operations and automation
- Familiarity with monitoring tools (Datadog, Prometheus, Grafana, CloudWatch, etc.)
- Basic knowledge of Infrastructure as Code (Terraform, Ansible) and configuration management
- General understanding of security, networking, and compliance within cloud environments
- Problem-solving skills with a team-focused, service-oriented approach
- Flexibility to work in a global support model and assist with on-call responsibilities
- Effective communication skills to collaborate with technical and non-technical teams
- Commitment to improving operational processes and contributing to platform performance
- English level B1+ for effective communication
Nice to have
- Background in cost optimization strategies or FinOps best practices
- Experience working in regulated industries (pharma, healthcare, finance) or compliance-centric environments
- Knowledge of modern data stack tools (dbt, Dagster/Airflow, Tableau, Power BI)
- Understanding of foundational SRE (Site Reliability Engineering) practices
We offer
- International projects with top brands
- Work with global teams of highly skilled, diverse peers
- Healthcare benefits
- Employee financial programs
- Paid time off and sick leave
- Upskilling, reskilling and certification courses
- Unlimited access to the LinkedIn Learning library and 22,000+ courses
- Global career opportunities
- Volunteer and community involvement opportunities
- EPAM Employee Groups
- Award-winning culture recognized by Glassdoor, Newsweek and LinkedIn