Overview
At ForUsAll, we’re revolutionizing the U.S. retirement industry with cutting-edge AI technology.
Based in San Francisco, our fintech startup is on a mission to provide cost-efficient retirement solutions for small and mid-sized businesses.
Founded by industry pioneers who reimagined 401(k) plans for Fortune 500 companies, we’re supported by top-tier venture capitalists and financial tech experts who share our passion for empowering everyday Americans to achieve financial security.
Founded by industry veterans who previously transformed retirement plans for Fortune 500 companies, we’re backed by top venture capital firms and fintech leaders who share our mission to democratize access to modern, diversified retirement portfolios.
About the Role
We’re looking for a Senior DevOps Engineer to lead and scale the infrastructure and automation pipelines that power our AI-driven applications.
You’ll own our AWS environment, CI/CD systems, and MLOps infrastructure, ensuring reliable deployments and high availability of our platforms.
This is a high-impact role for someone excited about infrastructure-as-code, rapid iteration, and building automation around ML workflows.
You’ll collaborate closely with AI engineers, backend developers, and product teams to streamline development and operational efficiency from code to cloud.
What You’ll Do
- Architect, maintain, and optimize cloud infrastructure in AWS , following security and scalability best practices
- Manage and enhance our CI/CD pipelines (e.g., GitHub Actions, CodePipeline, CircleCI) to support reliable and fast releases
- Build and maintain MLOps workflows , including model versioning, training pipelines, testing, and deployment automation
- Support container orchestration using Docker and Kubernetes (EKS preferred)
- Define and enforce infrastructure-as-code practices using tools like Terraform or CloudFormation
- Monitor system performance and availability using modern observability stacks (e.g., Prometheus , Grafana , Datadog , CloudWatch )
- Collaborate with engineering teams to set up staging environments, automate tests, and manage secrets and access control
- Drive reliability and incident response practices (alerts, runbooks, root cause analysis, etc.)
Requirements
- 5+ years of DevOps, Site Reliability, or Infrastructure Engineering experience
- Strong hands-on experience with AWS core services (EC2, ECS/EKS, S3, IAM, CloudWatch, RDS, Lambda, etc.)
- Experience designing and maintaining robust CI/CD pipelines with GitHub Actions or similar tools
- Solid understanding of MLOps workflows and ML model deployment practices
- Proficient in infrastructure-as-code tools ( Terraform or CloudFormation )
- Experience managing containerized applications with Docker and Kubernetes
- Proficient with scripting languages (Bash, Python, etc.) for automation and tooling
- Familiarity with security best practices (e.g., least-privilege IAM roles, secret rotation, encryption in transit/at rest)
- Bonus: Experience with AWS SageMaker, Bedrock a plus
This will be a hybrid role with 3 days in our office located in El Poblado.
What is the highest level of education you’ve obtained?
What is the best WhatsApp number to reach you at?
#J-18808-Ljbffr