Secure Passage, Inc. provides innovative security solutions for companies, governments, and critical infrastructure including schools and other community assets. Our platforms uniquely integrate intelligence, data and insights for a comprehensive view accessible to all stakeholders. Leveraging our expertise in cybersecurity, homeland security, defense, and public safety, we enhance and secure mission-critical operations.
We are looking for a Site Reliability Engineer (SRE) with a strong foundation in DevOps principles to join our engineering team. This is a career opportunity to play a key role in ensuring the reliability, stability, and security of our infrastructure while supporting development teams to create and deploy scalable, resilient applications in the cloud.
Key Responsibilities
- Kubernetes Expertise: Create, deploy, configure, and manage applications using Kubernetes, with a strong focus on AWS Fargate/EKS and GCP CloudRun. Responsibilities include optimizing deployments for security, stability, and monitoring.
- Cloud Infrastructure Management: Manage AWS or GCP-based infrastructure.You will be responsible for deploying, scaling, and monitoring infrastructure components to ensure availability, resilience, and performance.
- Security Focus: Implement and maintain security best practices, including encryption in transit and at rest. Configure and monitor necessary security components (VPC, IAM, etc.) to ensure data and infrastructure security.
- Container Optimization: Work with development teams to optimize containerized applications, specifically Golang, Python, and Node.js, for performance, scalability, and resource efficiency.
- Monitoring & Stability: Set up and manage monitoring tools (CloudWatch, CloudTrail, Prometheus, Grafana, and Google Cloud Monitoring tools, etc.) to track performance, identify bottlenecks, and maintain overall system health.
- App Load Testing (Bonus): Experience in running load tests to ensure applications can handle expected traffic. Analyze results and recommend optimizations based on performance metrics.
Qualifications
- 3+ years of experience in a Site Reliability Engineer, DevOps, or Infrastructure Engineer role.
- Demonstrable experience with Kubernetes in deployment, optimization, security, and monitoring.
- Strong understanding of cloud services, including best practices for deployment, monitoring, and security.
- Deep knowledge of securing cloud environments and data, with hands-on experience configuring encryption mechanisms.
- Experience with Docker and optimizing Golang, Python and Node.js-based containers for performance and resource utilization.
- Bonus: Knowledge of app load testing techniques and tools.
- Ability to obtain some level of security clearance or successfully complete rigorous background check for certain clients.
- Authorized to work in the United States without sponsorship.
Desired Skills & Competencies
- Automation: Experience with Infrastructure-as-Code (IaC) tools such as Terraform.
- Collaboration: Work closely with development teams to improve CI/CD pipelines and automate deployment processes.
- Problem Solver: Strong troubleshooting skills with a proactive approach to identifying and resolving infrastructure issues.
- Security Mindset: Consistent focus on security across development and operational practices.