CloudBees was founded in 2010 and is backed by Goldman Sachs, Morgan Stanley, Bridgepoint Credit, HSBC, Golub Capital, Delta-v Capital, Matrix Partners, and Lightspeed Venture Partners. Visit www.cloudbees.com and follow us on Twitter, LinkedIn, and Facebook.
It’s an exciting time to join the CloudBees team because thousands of development and deployment teams around the world are using our products to enhance and optimize the way their teams build and deliver software using continuous delivery.
To support the delivery of CloudBees products (both internal and external), our Operations team designs, deploys, secures and manages a variety of software systems and the related GCP / AWS infrastructure that underpins these software engineering objectives. Our guiding principle is simple - “Make Engineering Faster”. Underpinning this are the challenges of security, process change, technical change that all must be met to varying degrees. Further, we are passionate about reducing the manual work that plagues IT teams - Engineering / Ops / Support / Security - and are empowered to re-engineer processes (more easily said than done) and technology to achieve those objectives.
Our preferred candidate will be located in a US/CAN Eastern Standard Time zone to provide the best working hour coverage for our Engineering and Operations workloads.
We are looking for someone who:
- Will bring new ideas from where you’ve worked in the past - what worked well, what didn’t work, and help us understand what CloudBees could do better.
- Has “the knack” - an uncanny ability to uncover the root cause of problems based on limited information because “it feels like something you’ve seen before”.
- Has no fear of saying - “Have you considered doing it this way” - and giving constructive feedback to colleagues on alternative ways of doing things.
- And, the courage to say - “I’m not sure” - and seeks feedback from your colleagues on how to complete a task in our tech-stack.
- self-motivated and enjoy solving problems
- excited by the opportunity to automate yourself out of recurring work
- able to keep tickets up to date, so we don’t need too many status meetings (we’re async first due to our distributed nature)
- You will liaise directly with the software engineers across all teams, ensuring that decisions are agreed internally and externally to Operations and that they meet our technical and non-technical objectives.
How you’ll work:
- In a geographically distributed team (APAC, US, EMEA) of peers
- Choose your own work in tandem with the team, team-leader, and manager
- Report directly to the Operations Manager (who reports to the VP of Engineering)
- Flexible working hours - while there will be some core hours required for meeting with Engineering and Operations teams - much latitude is given in getting work done. This includes determining your own start and finish times to accommodate family life.
- Limited travel (especially at the moment) - but once things return to normal (whatever that looks like) we’ll have an annual offsite somewhere on the planet
Team-building - inside the team and across teams - you’ll be proposing new ideas and helping implement them
Thinking - coming up with new ways of solving problems - and working with the team to prove them out and then implement them
Documenting - describing problems, how your proposed solution solves those problems, and how your implemented solutions are operated.
Optimizing - working with Engineering teams to optimize their build systems - even our monster jobs (lots of parallelism - lots of bottlenecks - lots of technical challenges)
Defining - writing / modifying Terraform to handle our infrastructure, and helping teams define their infrastructure using our modules.
Coding - you’ll be writing code in Golang - we occasionally write glue in Python / Groovy (but not very often)
- working with our security team to drive operational change in engineering teams - “Supply Chain Integrity” - you’ve heard about it - it’s the new hot topic
- working with our security team to drive organizational security changes (logging, auditing, monitoring, alerting)
Observing - adding the right monitoring so that we are alerted before our customers notice, and not alerted when the system is able to heal itself
Alerting - getting alerts and working with team members to solve the initial problem, documenting the problem and then working out how to stop them happening again
DevOps / DevSecOps - We’re not “DevOps” engineers, but we do help our teams become more proficient in doing Dev and Ops (and embracing Total Ownership). We build the guard rails to help them do it safely and align tech-stacks across the company.
What you will work on:
- IaaS - GCP, AWS, a little Azure
- Platform tools - Kubernetes, terraform, helm, docker
- Language tools - Golang
- Engineering tools - Jenkins, Vault, CodeShip, Auth0
- Operations tools - PagerDuty, DataDog
There are a lot more tools - you should ask during your interview!
You have experience in:
- programming in various languages and domains
- the "modern Ops stack" (e.g. monitoring, alerting, cloud-based provisioning, Docker, Kubernetes)
- Linux systems administration
- cloud-based operations (GCP, AWS, or Azure)
- cluster orchestration and management tools (e.g. k8s / ECS / Terraform / etc)
- continuous integration / continuous delivery tools (Jenkins / CodeShip etc)
- modern software engineering practices: code reviews, unit / acceptance testing, source control, etc.
- using work tools like GSuite, GitHub, Jira, Confluence, and Slack.
- You are available for occasional On-Call
- NOTE: No one has been woken up for an outage in over 8 years, there hasn’t been a weekend outage in the last 6 months, and while we have had to triage security vulnerability reports (on weekends), but those have not required work beyond the initial fault analysis. There are occasional false-positives - that are easily silenced - and then resolved at a system level during normal working hours.
We're invested in you!
We offer competitive benefits packages, opportunities for professional development, and more. We also offer generous paid time off to allow our employees time to rest, recharge and to be present with family and friends throughout the year.
At CloudBees, we truly believe that the more diverse we are, the better we serve our customers. A global community like Jenkins demands a global focus from CloudBees. Organizations with greater diversity—gender, racial, ethnic, and global—are stronger partners to their customers. Whether by creating more innovative products, or better understanding our worldwide customers, or establishing a stronger cross-section of cultural leadership skills, diversity strengthens all aspects of the CloudBees organization.
In the technology industry, diversity creates a competitive advantage. CloudBees customers demand technologies from us that solve their software development, and therefore their business problems, so that they can better serve their own customers. CloudBees attributes much of its success to its worldwide work force and commitment to global diversity, which opens our proprietary software to innovative ideas from anywhere. Along the way, we have witnessed firsthand how employees, partners, and customers with diverse perspectives and experiences contribute to creative problem-solving and better solutions for our customers and their businesses.