Job Title
Service Reliability Engineer
** CV must be in English **
Location: BOGOTA
About the business:
Amadeus Hospitality’s Technology Operations organization provides infrastructure and engineering support for the hosting and development of proprietary software used by the global hospitality industry.
Summary of the role:
The Service Reliability Engineer is responsible to provide support for revenue generation production systems.
The engineer assists with monitoring, maintenance, and problem resolution of production applications.
The candidate must be able to provide prompt technology operations support in a high energy, fast-paced environment.
In this role you will:
Provide support related to production systems availability, latency, performance, efficiency issues.Support monitoring tools currently in production.Provide emergency response to production systems incidents.Maintain production ticketing system.Maintain the knowledgebase solution platform.Create, Delete and maintain production automation solutions using tools.Automation of day-to-day tasks.Resolve/remove false-positives alerts.Configure and update alert dashboards.Maintain tasks using task scheduler.Become SME of production applications and operations tools.Participate during application releases implementation.Analyze and interpret application logs to determine problem areas.Enhance current application and device monitoring systems.Help to evaluate application performance statistics including application and system response times.About the ideal candidate:
Bachelor’s or graduate degree in Computer Science or a related field certification.Working knowledge of the Linux and Windows operating systems.Ability to technically troubleshoot web server technologies such as Apache, IIS or NginX by connecting to those servers and analyzing technical problems within the ap-plication, server and operating systems logs to identify the root cause and resolving the issue creating an impact to system’s availability in production.Experience technically supporting middleware such as Tomcat, JBoss or other application server by evaluating the middleware state while analyzing the logs and identifying a solution to be executed.Experience supporting monitoring, alerting, or pipeline analysis tool such as Datadog, Zabbix, Prometheus, Splunk or Nagios while optimizing the current configuration of those monitoring tools and technically maintaining their availability.General networking knowledge.Ability to write basic Linux shell script incorporating Grep, SED or AWKAbility to troubleshoot Java application servers while using the appropriate commands and JVM argumentsFluency in EnglishFollowing skills are a plus:
Fluency in Python, Ruby or other common scripting languages.Experience in problem solving and troubleshooting network latency and connectivity issues.Experience developing operational automation in a distributed environment.Ability to perform database queries across database platforms.Knowledge of automated and centralized job scheduling.Experience in a mixed on-premises and cloud environment.Experience with a CDN such as Akamai, Cloudflare or other.Experience with VMware.Experience with Docker and Kubernetes or other containerized solution.Strong collaboration skills and team playerGood written and verbal communication ability.What we can offer you:
Get rewarded with competitive remuneration, individual and company annual bonus, vacation and holiday paid time off, health insurances and other competitive benefits.
Work from anywhere: onsite, hybrid or fully remote.
Professional development to broaden your knowledge and enhance your skills with on-line learning hubs packed with technical and soft skills training that allow you to develop and grow.
Enter a diverse and inclusive workplace, join one of the world’s top travel technology companies and take on a role that impacts millions of travelers around the globe.