Site Reliability Engineer
Job Type
Full-time
Description

Secure Passage, Inc. provides innovative security solutions for companies, governments, and critical infrastructure including schools and other community assets. Our platforms uniquely integrate intelligence, data and insights for a comprehensive view accessible to all stakeholders. Leveraging our expertise in cybersecurity, homeland security, defense, and public safety, we enhance and secure mission-critical operations.

We are looking for a Site Reliability Engineer (SRE) with a strong foundation in DevOps principles to join our engineering team. This is a career opportunity to play a key role in ensuring the reliability, stability, and security of our infrastructure while supporting development teams to create and deploy scalable, resilient applications in the cloud.

Key Responsibilities

  • Kubernetes Expertise: Create, deploy, configure, and manage applications using Kubernetes, with a strong focus on AWS Fargate/EKS and GCP CloudRun. Responsibilities include optimizing deployments for security, stability, and monitoring.
  • Cloud Infrastructure Management: Manage AWS or GCP-based infrastructure.You will be responsible for deploying, scaling, and monitoring infrastructure components to ensure availability, resilience, and performance.
  • Security Focus: Implement and maintain security best practices, including encryption in transit and at rest. Configure and monitor necessary security components (VPC, IAM, etc.) to ensure data and infrastructure security.
  • Container Optimization: Work with development teams to optimize containerized applications, specifically Golang, Python, and Node.js, for performance, scalability, and resource efficiency.
  • Monitoring & Stability: Set up and manage monitoring tools (CloudWatch, CloudTrail, Prometheus, Grafana, and Google Cloud Monitoring tools, etc.) to track performance, identify bottlenecks, and maintain overall system health.
  • App Load Testing (Bonus): Experience in running load tests to ensure applications can handle expected traffic. Analyze results and recommend optimizations based on performance metrics.
Requirements

Qualifications

  • 3+ years of experience in a Site Reliability Engineer, DevOps, or Infrastructure Engineer role.
  • Demonstrable experience with Kubernetes in deployment, optimization, security, and monitoring.
  • Strong understanding of cloud services, including best practices for deployment, monitoring, and security.
  • Deep knowledge of securing cloud environments and data, with hands-on experience configuring encryption mechanisms.
  • Experience with Docker and optimizing Golang, Python and Node.js-based containers for performance and resource utilization.
  • Bonus: Knowledge of app load testing techniques and tools.
  • Ability to obtain some level of security clearance or successfully complete rigorous background check for certain clients.
  • Authorized to work in the United States without sponsorship.

Desired Skills & Competencies

  • Automation: Experience with Infrastructure-as-Code (IaC) tools such as Terraform.
  • Collaboration: Work closely with development teams to improve CI/CD pipelines and automate deployment processes.
  • Problem Solver: Strong troubleshooting skills with a proactive approach to identifying and resolving infrastructure issues.
  • Security Mindset: Consistent focus on security across development and operational practices.