Overview
The DevOps Lead at BBI Logistics is responsible for the reliability, scalability, and efficiency of our core infrastructure and deployment processes. In this senior role, you will architect and manage our cloud infrastructure and CI/CD pipelines to ensure that our services are delivered with high availability, top performance, and rigorous security. You will lead the adoption of DevOps best practices – infrastructure as code, automated deployments, monitoring, and incident response – and collaborate closely with development and QA teams to streamline software delivery. This is a hands-on technical leadership position where you will drive innovation in how we build, release, and maintain our platform, all while mentoring a team of DevOps/SRE engineers and instilling a culture of automation and continuous improvement.
Responsibilities
- Architect and maintain scalable, secure, high-availability infrastructure in the cloud (AWS or similar), using Infrastructure-as-Code tools (e.g., Terraform, CloudFormation) to provision and manage resources efficiently. Ensure the infrastructure is designed for fault tolerance, disaster recovery, and cost-effectiveness to support our growing platform.
- Design and implement robust CI/CD pipelines that automate build, test, and deployment workflows. Streamline the software release process by enabling frequent and reliable deployments, utilizing tools such as Jenkins, GitHub Actions, or GitLab CI for continuous integration and delivery.
- Oversee configuration management and environment consistency across development, staging, and production. Use automation tools like Ansible, Chef, or Puppet to manage system configurations and ensure repeatable deployments and configurations in all environments.
- Implement comprehensive monitoring, logging, and alerting to proactively track system performance and reliability. Deploy and manage monitoring tools (e.g., CloudWatch, Datadog, Prometheus) and logging frameworks (ELK stack or Splunk) to ensure visibility into system health, quick detection of issues, and adherence to SLAs.
- Drive infrastructure security and compliance in operations: work closely with the Security Lead to enforce cloud security best practices (network segmentation, firewalls, IAM policies) and integrate security checks (vulnerability scanning, configuration compliance) into the deployment process. Ensure that DevOps practices support regulatory requirements and protect sensitive data.
- Lead and mentor the DevOps/SRE team, setting standards for operational excellence. Provide technical guidance in system design and coding (Infrastructure as Code scripts, CI/CD pipelines), conduct reviews for infrastructure changes, and foster a culture of collaboration, innovation, and continuous improvement in automation.
- Collaborate with software engineering and QA teams to improve build/deployment processes and environment setups. Act as a bridge between development and operations, ensuring new features are designed with scalability and operability in mind, and troubleshooting deployment issues jointly to resolve bottlenecks quickly.
- Evaluate and adopt new technologies that improve our platform’s performance or developer productivity. This could include container orchestration (Kubernetes), service mesh, serverless architectures, or other emerging DevOps tools and cloud services – you will stay informed of industry trends and guide the team in implementing effective new solutions.
- Own incident management and root-cause analysis for infrastructure issues. Establish on-call rotations and incident response processes to ensure 24/7 support for critical systems. Lead post-incident reviews to identify underlying causes and implement long-term fixes, continuously improving our system’s reliability and recovery capabilities.
Qualifications
- Experience: 7+ years in DevOps, Site Reliability Engineering (SRE), or related fields, including a strong track record of managing cloud-based infrastructure and CI/CD pipelines in production environments. At least 2+ years in a technical leadership or senior engineer role guiding DevOps practices.
- Education: Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent practical experience in systems engineering/DevOps. Advanced certifications (e.g., AWS Certified DevOps Engineer, Kubernetes Administrator) are a plus.
- Cloud & Infrastructure-as-Code: Hands-on expertise with modern cloud platforms (AZURE preferred, also AWS or GCP) and managing infrastructure at scale. Proficient in Infrastructure-as-Code tools like Terraform or CloudFormation for automating resource provisioning and configuration.
- Containers & Orchestration: Strong experience with containerization (Docker) and container orchestration using Kubernetes (EKS, AKS, or self-managed clusters). Able to design and manage microservices infrastructure, including scaling, service discovery, and container networking.
- CI/CD & Automation: Demonstrated ability to build and maintain CI/CD pipelines using tools such as Jenkins, CircleCI, GitLab CI/CD, or GitHub Actions. Experience with configuration management and automation tools (Ansible, Chef, or Puppet) to achieve consistent and repeatable deployments.
- Scripting & Programming: Proficiency in scripting/programming (e.g., Python, Bash, or PowerShell) for automating routine tasks and building custom tooling. Solid knowledge of Linux/Unix system administration and shell scripting is required for managing servers and troubleshooting.
- Monitoring & Performance: Familiarity with monitoring and logging solutions (such as Prometheus, Grafana, ELK stack, DataDog, or CloudWatch). Ability to analyze system metrics and logs to diagnose issues, tune performance, and ensure high uptime and system resiliency.
- Security & Networking: Good understanding of network and security fundamentals in cloud environments – including VPC architecture, load balancers, VPNs, DNS, firewalls/security groups, and identity/access management. Experience implementing basic security measures (TLS, secret management, vulnerability patching) as part of infrastructure operations.
- Leadership & Collaboration: Proven ability to lead and collaborate in cross-functional teams. Excellent communication skills to work effectively with software engineers, QA, product managers, and to document infrastructure designs/procedures. Experience mentoring junior DevOps engineers or developers in operational best practices is highly valued.
- Domain Knowledge (Preferred): Experience in logistics, finance or other high-compliance industries is a plus. Familiarity with the requirements of handling financial data (e.g., PCI compliance, data retention policies) and the importance of reliability and security in financial systems will help inform decisions in this role.