The ideal candidate is going to work in an agile environment on problems that are focused on improving uptime by reducing time to mitigation when issues are surfaced/reported by automated means or through customer incidents, contributing to code for automation of monitoring, patching, and remediation of service anomalies.
You will work alongside a software development team within the greater OAC organization where you will support existing features in the cloud as well as new operational processes, automation, and content.
You will play a key role in improving the processes supporting the OAC services, so that the service functions more and more autonomously over time.
Basic Qualifications:
BS or MS in Computer Science or equivalent technical discipline.
Understanding of cloud networking fundamentals (TCP/IP, DNS, HTTP, .
Knowledge of Linux/Unix systems and experience with system-level troubleshooting and scripting.
Exposure to DevOps practices and tools (., CI/CD pipelines, automated deployments).
Strong problem-solving and troubleshooting skills, including the ability to reverse engineer existing systems.
Ability to produce clear technical documentation (runbooks, knowledgebase articles, help guides).
Experience working with agile methodologies.
Strong communication skills and ability to work with remote, globally distributed teams.
Good interpersonal skills with the ability to build rapport with all stakeholders.
Ability to work independently and adapt to dynamic project needs.
Willingness to participate in on-call rotations and perform occasional after-hours work.
Preferred Qualifications:
Experience with issue tracking and collaboration tools (Jira, Confluence).
Experience with cloud-native application development (microservices, Kubernetes, Docker technologies).
Exposure to REST API design and development using Java technologies.
Perform DevOps activities to support service reliability with customers, release cycles, and production stability.
Participate in a follow-the-sun model for 24x7 support of Oracle Analytics Services
Respond to service incidents, troubleshoot, and lead resolution efforts, including root cause analysis.
Become an expert in Oracle Analytics services, to prevent/resolve customer issues effectively and prevent regressions and repeats.
Document various processes & runbooks as well as update existing processes.
Deliver interim patches, hot-fixes, and upgrades with high quality.
Partner with development, product, and support teams to resolve service failures/outages.
Monitor service metrics, analyse trends, and implement improvements to CI/CD pipelines and operational processes followed by the team.
Follow all best practices and procedures as established by the organization.
Mentor and guide junior engineers, contribute to a culture of knowledge sharing and technical excellence, and drive continuous improvement.
Career Level - IC3