Senior DevOps Engineer

Profit Isle is transforming how finance teams understand and act on profitability. We’re evolving from a consulting-heavy model into an AI-powered SaaS platform.

About Profit Isle

We’re looking for a Senior DevOps Engineer to help build and scale the cloud infrastructure, automation pipelines, and MLOps foundation that power our analytics, AI, and enterprise features.

You’ll work across engineering, data, and machine learning domain, designing secure, scalable, and observable systems that enable reliable deployments, cost efficiency, and continuous innovation.

Responsibilities

Core DevOps & Infrastructure Management

  • Design, build, and maintain our core cloud infrastructure using Infrastructure as Code (IaC) principles (e.g., Terraform, Ansible).
  • Automate the provisioning of scalable, secure, and resilient environments on our cloud provider (GCP).
  • Champion and implement best practices for containerization (Docker) and orchestration (Kubernetes), ensuring high availability and efficient resource utilization.
  • Implement GitOps principles for consistent, automated deployments across environments (e.g., using ArgoCD or Helm) and manage artifact storage via tools like Artifact Registry.
  • Manage and enhance our observability stack (e.g., Datadog, GCP Monitoring tools), providing deep insights into both application and infrastructure performance.

MLOps (Machine Learning Operations)

  • Develop and manage robust CI/CD pipelines for the end-to-end machine learning lifecycle: data validation, model training, versioning, and production deployment.
  • Establish monitoring for production models to track performance, detect data/concept drift, and trigger retraining workflows.
  • Maintain and continuously improve existing CI/CD pipelines for software and model deployments.

Data & Pipeline Infrastructure

  • Collaborate with data engineers to build and support the infrastructure for data ingestion, processing, and storage.
  • Manage and optimize data pipeline orchestration tools that feed our ML models.
  • Ensure data security and governance across our data lakes and warehouses.

Security & Cost Optimization

  • Integrate security best practices into all stages of the infrastructure and deployment lifecycle.
  • Proactively monitor and optimize cloud resource usage to manage the high costs associated with CPU instances and large-scale data processing.
  • Implement cost-control measures, budget alerting, and resource tagging strategies.

Collaboration & Support

  • Serve as the infrastructure expert for both the software engineering and data science teams, providing guidance and unblocking them.
  • Create documentation and runbooks for the systems you build, empowering others to use them effectively.

Requirements

  • BS in Computer Science, Management Information System, Software Engineering/Development
  • 3+ years of experience in a DevOps, SRE, or Cloud Infrastructure role.
  • Strong proficiency with a major cloud provider (GCP).
  • Hands-on expertise with Infrastructure as Code (Terraform strongly preferred).
  • Proven ability to build and manage complex CI/CD pipelines (e.g., Jenkins, GitHub Actions).
  • Deep experience with containerization (Docker) and container orchestration (Kubernetes).
  • Excellent scripting skills (e.g., Python, Bash).
  • Excellent ability to multi-task and exceptional time management skills.
  • Experience in building monitoring dashboards for observability purposes.
  • A strong understanding of cloud cost management strategies.

Bonus Qualifications

  • Direct experience with MLOps principles and tools.
  • Experience in a SaaS or multi-tenant environment.
  • Experience setting up and managing Google Kubernetes Engine.
  • Exposure to ISO 9001 or similar quality frameworks.
  • Exposure to SOC 2 compliance or similar security frameworks.

What We Offer

  • Competitive salary and benefits package.
  • Hybrid work environment.
  • Professional development opportunities.
  • Access to the latest technologies and tools.
  • Opportunity to work on innovative projects.
  • Modern tech stack and tools.
  • Collaborative and learning-focused team culture.

Profit Isle is committed to building a diverse and inclusive team. We welcome applicants from all backgrounds and experiences.