Site Reliability Engineer (SRE) at NEORIS
Join to apply for the Site Reliability Engineer (SRE) role at NEORIS .
We are looking for an experienced Site Reliability Engineer (SRE) to join and ensure the reliability, scalability, and performance of our cloud-based infrastructure.
The ideal candidate will have strong expertise in Google Cloud Platform (GCP), container orchestration, and infrastructure-as-code, with a focus on automation and observability.
Key Responsibilities
- Design, implement, and maintain GCP infrastructure with a focus on reliability and security (GCP expertise is mandatory)
- Manage and optimize Google Kubernetes Engine (GKE) clusters, including upgrades and container troubleshooting
- Implement and maintain Docker containerization strategies
Automation & Deployment
- Develop and maintain GitHub Actions workflows for CI/CD pipelines
- Manage infrastructure as code using Terraform for deployments
- Automate operational processes to improve efficiency and reduce manual intervention
Networking & Security
- Configure and maintain firewall rules with focus on network protocols and inbound traffic management
- Implement security best practices across all infrastructure components
- Monitor and optimize network performance
Monitoring & Observability
- Implement and maintain Prometheus for system monitoring and alerting
- Configure and manage Grafana dashboards for system visibility
- Establish SLOs, SLIs, and error budgets for critical services
Collaboration & Best Practices
- Work closely with development teams to improve system reliability and performance
- Participate in incident response and post-mortem analyses
- Document system architecture and operational procedures
Technical Requirements
Must-Have Skills
- Extensive experience with Google Cloud Platform (GCP) services
- Strong knowledge of Kubernetes (GKE) and container orchestration
- Proficiency in Terraform for infrastructure provisioning
- Experience with GitHub Actions or similar CI/CD tools
- Expertise in Docker containerization
- Networking knowledge including firewall configuration and protocols
- Monitoring stack experience (Prometheus + Grafana)
Nice-to-Have Skills
- Experience with API security and tokenization
- Knowledge of schema design and database optimization
- Understanding of promotion strategies for canary deployments
- Familiarity with infrastructure cost optimization
Soft Skills
- Strong problem-solving and troubleshooting abilities
- Excellent communication skills for collaborating across teams
- Proactive approach to identifying and addressing potential issues
- Ability to document technical processes clearly
We Offer
- Professional growth
- Dynamic work environment
- Competitive salary
- Attractive benefits plan
- Development opportunities
For more information, visit NEORIS at or follow NEORIS on social media.
This job posting may include restrictions regarding location and eligibility where applicable.
#J-18808-Ljbffr