Senior (Lead) Software Engineer, Site Reliability Engineering (remote)
Fully Remote Stow, OH
Description

Overview

Binary Defense, headquartered in Stow, Ohio, is a rapidly growing cybersecurity software and services firm with solutions that include SOC-as-a-Service, best-in-class Managed Detection & Response software, Security Information & Event Management, Threat Hunting and Counterintelligence. Our expert security staff and technology help shield businesses from cyberattacks.


Named a Cleveland Plain Dealer Top Workplace in 2020 and a top 10 software developer in Northeast Ohio by Crain’s, Binary Defense is a fast-paced business that enjoys a relaxed culture, flexible remote work options, as well as generous on-site perks. For the third year in a row, Binary Defense has been recognized as one of the fastest-growing private companies in the US! Ranking at 2,985 on the Inc. 5000 list for 2021, Binary Defense is proud of its position as the only security firm in Ohio to make the list. The company ranks among the top 100 in Ohio for growth, and in the top 25 in the Northeast Ohio region. At the 2021 Greater Cleveland Partnership’s “Best of Tech Awards,” Binary Defense was recognized as the “Tech Company of the Year”, “Best Technology Solution” and “Best Tech Services Company” in Northeast Ohio. 


Binary Defense offers competitive medical, dental and vision coverage for employees and dependents, a 401k match which vests every payroll, a flexible and remote friendly work environment, as well as training opportunities to expand your skill set (to name a few!). If you’re interested in joining a growing team with great perks, we encourage you to apply!


About the Role

Binary Defense is looking for a Site Reliability Engineer (SRE) to work within a cross-functional team to expand the capabilities of our flagship product. We believe that production stability is the responsibility of the entire delivery team, and that excellent software is created through the proximity of development and operations activities. Keep reading if you are a software engineer with a passion for automation, enjoy short release cycles, appreciate working as a core member of a software delivery team, and relentlessly focus on continuous improvement. 


You will be responsible for automating, monitoring, and improving both system reliability and availability. You will be a Subject Matter Expert in evaluating performance and risk of outgoing software features. Additionally, you will lead the effort of monitoring, tracking, reporting, and improving trends for Service Level Indicators (SLIs) and performance against Service Level Objectives (SLOs) within agreed upon error budgets. 


Reporting into the Senior Software Architect, this role will be critical in helping the organization reach its development roadmap goals and support a product experiencing rapid growth. As a remote-friendly team, we default to trust and expect the best from each other. We thrive as a team when we cooperate with each other to deliver timely and effective work. We do our best to help everyone bring their whole selves to work, encourage diversity, and support family-friendliness and flexibility in our schedules.


Key Responsibilities

· Focused on reliability, performance, efficiency improvements, & monitoring of the various environments supporting our internal team members and external customers.

· Participate as a member of an agile delivery team through the full life cycle from ideation, deployment, operation, & refinement.  

· As a member of a delivery team, you will be involved in design, development, testing, capacity planning, and readiness reviews. 

· Responsible for identification and implementation of enablement tools that support our continuous integration (CI) / continuous delivery (CD) system and automation framework. Ensure the tools are selected with input from the delivery team and are then adopted with consistency.? 

· Partner with developers, product management, and other internal subject matter experts to ensure the team has a strategy to achieve desired service level objectives (SLOs). 

· Lead effort to monitor, alert, & report on overall system health by tracking Service Level Indicators (SLIs) and Service Level Objectives (SLOs). 

· Share the status of key performance indicators and metrics via dashboards. 

· Use data to advocate for changes that will measurably improve reliability and increase velocity.

· Reduce manual activities with limited long-term value through automation. Measures, establishes goals, and celebrates the improvements achieved.

· Participate in the on-call rotation with other delivery team members.

· Perform initial review and triage of production issues, escalates as appropriate. 

· Champion sustainable incident response and blameless root cause analysis.


Requirements

Education/Experience

· Computer Science, Software Engineering, or similar degree. Equivalent real-world experience would be acceptable in lieu of degree.

· Experience successfully working with distributed teams as the delivery teams are fully remote.

· 7+ years of hands-on software development experience.

· Strong communication and collaboration skills.

· Ability to successfully define and drive adoption of tools, processes, and frameworks across multiple teams.

· Deep knowledge and experience with:

    Managed Kubernetes offerings such as AKS, EKS, GKE

    GitLab Build Pipelines

    Docker

    Terraform


Other Knowledge, Skills and Abilities

· Technologist - Knowledge and interest in the latest system architecture, automation, cloud, and advanced technology trends with the ability to rapidly learn and apply new technology. Strong ability to share and teach to accelerate the team's adoption of new technologies. 

· Calculated Risk Taker - Understands that end user satisfaction is a balance between features, service, and performance.

· Collaborative - Works closely with team members and stakeholders to understand needs, gain perspective, and collectively deliver solutions with a shared purpose. 

· Enthusiastic - must be high-energy and a passionate advocate for quickly delivering value. 

· Adaptive and Inclusive - works with team members to understand pain points and adjust standards, tools, and best practices accordingly. Learns from the team and adjusts with a focus on enablement.  

· Attitude of transparency - must desire to bring disclosure and transparency.

· Creativity, initiative, and flexibility - tempered by pragmatism, patience, and attention to detail. 

· Honest, humble, friendly, and collegial. 

· Creative problem-solver - Ability to look at solutions in creative and unconventional ways, recognize opportunities to innovate, and engage partners in a vision and strategy while maintaining the "big picture" view 

· Commitment to continuous improvement. Ability to dynamically adjust the plan, to resolve impediments as well as to meet changing business needs. 

· Accountable - must embody a strong sense of responsibility for the timely completion of tasks, as well as the responsibility to ensure a shared understanding of shared tasks. 


Preferred

· Experience with mix of processes (Kanban, Scrum, XP, LeSS, SAFe, Waterfall, etc.). 

· Experience in the security industry.

· Experience with Kubernetes based tooling:

    Cert Manager

    KEDA

    Ingress Controllers such as Nginx Ingress and HA Proxy Ingress

    AWS Controllers for Kubernetes (ACK) or Azure Service Operator (ASO)

    Flux 

    Spinnaker

    Argo

· Knowledgeable in multitenant deployments with Kubernetes

· Master of secrets management

· Experience monitoring with:

    Prometheus

    Grafana

· Experience administering and scaling:

    .NET Core

    Python

    Postgres

    Redis

· Experience with Windows and Linux operating systems.

· Experience managing messaging infrastructures.

· Experience with AWS solution architecture and management. Including:

    EC2

    ECS

    S3

    VPC

· Experience with 

    Debian/Linux

    Ansible



PM21