Direct message the job poster from Forte Group
Our client is a leader in full-stack automation for mission-critical business processes.
Their SaaS-based composable automation platform, purpose-built for ERP, empowers organizations to orchestrate, manage, and monitor workflows across any application, service, or server — in the cloud or on-premises — with confidence and control.
Core Values : One Team • Make Your Own Weather • Obsess over Customer Success • Work the Problem • Be Curious • Own the Outcome • Respect Each Other
Responsibilities
- Monitor alerts and system health, document incidents, and escalation issues as needed while participating in 24/7 on-call support
- Automate manual tasks and implement scripts and checks to improve infrastructure reliability and performance
- Deploy and manage infrastructure in EKS/Kubernetes clusters using Terraform and Helm, and support existing environments in Docker and Docker Swarm
- Proactively implement monitoring solutions and create automation for root cause analysis (RCA) and remediation
- Collaborate closely with Support, Customer Success, Migration, and Professional Services teams to deliver best-in-class SaaS reliability
- Plan and execute deployments and updates with a strong customer-centric focus, minimizing impact on users
- Investigate incidents, perform RCA, implement preventive measures, and define actions for relevant teams
Qualifications
- Hands-on experience as an AWS Cloud Engineer, with strong knowledge of EKS, Terraform, and Helm
- Proficiency with Docker and Docker Swarm, and a solid understanding of AWS IAM roles and policies
- Experience in logging and monitoring AWS resources using CloudWatch, and working in a Linux environment
- Strong scripting skills in Bash and/or Python, with a good understanding of REST APIs.
- Experience with monitoring solutions such as Grafana and Prometheus
- Customer-facing communication skills to explain issues and RCA findings effectively
- Background in product/application support for SaaS solutions and familiarity with APIs, databases, systems architecture, and design
- Experience designing, implementing, and operating within a DevSecOps environment
Seniority level
Employment type
Job function
- Engineering and Information Technology
Industries
- IT Services and IT Consulting
- Software Development
- IT System Custom Software Development
#J-18808-Ljbffr