Overview
We are seeking a skilled DevOps Engineer with 5+ years of hands-on experience to join our engineering team.
This role will be pivotal in ensuring the availability, scalability, and reliability of our cloud-native systems, databases, and applications.
The ideal candidate is proactive, collaborative, and passionate about building resilient and automated infrastructures.
Este es un puesto de trabajo remoto.
Responsibilities
- Configure and operate monitoring, logging, and tracing tools, collaborating with developers to enhance application logging for effective problem detection.
- Build operational dashboards, create alerts and automation workflows to guarantee system reliability and uptime; define and track key reliability metrics.
- Monitor system performance and reliability, implementing improvements and additional alerts as necessary.
- Work closely with software engineering teams to design and implement robust, reliable systems.
- Write and maintain automation tasks to streamline infrastructure and development processes.
- Participate in a 24/7 on-call rotation to respond to alerts and incidents, performing root cause analysis and post-mortem reviews.
- Implement and manage security and compliance best practices across infrastructure and CI/CD pipelines.
- Manage and optimize AWS EKS Kubernetes clusters for deployment, scaling, and operation of containerized applications.
- Design, build, and maintain scalable, customer-facing infrastructure on AWS using Terraform.
- Collaborate with developers and QA teams to streamline code deployment and testing workflows.
- Partner with Database Administrators to gather requirements and configure database connection parameters.
- Provide support for Linux-based environments across development, staging, and production.
Qualifications
- 5+ years of professional experience in Site Reliability Engineering (SRE), DevOps, or a related field.
- Familiarity with SRE concepts such as SLI/SLOs and Golden Signals.
- Strong problem-solving skills and ability to work independently and collaboratively.
- Proven experience with Kubernetes (monitoring, deployment, scaling, networking).
- Deep understanding of AWS services (EC2, EKS, S3, IAM, CloudWatch, etc.).
- Experience with CI/CD pipelines, preferably using GitHub Actions.
- Scripting expertise in Bash Shell.
- Solid knowledge of Docker and container registries.
- Proficiency with Linux-based systems and troubleshooting system-level issues.
Preferred Qualifications
- Experience with Helm charts for Kubernetes deployments.
- Knowledge of networking fundamentals and security best practices.
- Familiarity with Agile development methodologies and the DevOps culture.
Job Details
- Seniority level: Mid-Senior level
- Employment type: Full-time
- Job function: Information Technology
- Industries: IT Services and IT Consulting
We are not providing any explicit job location details beyond remote and Bogota references in the original content.
#J-18808-Ljbffr