User avatar
Full Time Partially Remote
Vancouver, BC, Canada
6 days ago
Location: Vancouver, BC With growth in the Copperleaf development teams and many exciting new projects, Copperleaf Cloud Operations is looking to expand. CloudOps is modelled on a Site Reliability Engineering Team. We provide reliability and uptime and work on our services to automate and reduce toil. As a CloudOps Engineer you will be involved in the following: Ensuring Copperleaf Services run smooth and have the capacity for continued growth and improvement. Designing and architecting cloud-based solutions to support Copperleaf’s services. Working with Product Development to design hosting solution for Copperleaf products and tooling. Contributing to the development of infrastructure health monitoring and reporting Troubleshooting production infrastructure, load issues, and implement solutions Using best practices for deployment processes and site reliability processes Identifying areas for development and improvement of Windows and Linux capabilities Mentoring other members of the team Working together with the team, you will share an on-call rotation and be an escalation contact for service incidents Your background: You have at least 3+ years’ experience working in a DevOps, and/or SRE role and 5+ years’ total relevant experience in a software development or IT environment Building and operating highly available, complex customer facing systems at scale in a 24x7 environment. Ability to build tooling, automation and/or services in one or multiple languages (e.g., Go, Python) Tooling which measures service KPIs, providing live data on availability, performance, and system health experience in higher-level languages (e.g., Python, Java, NodeJS) writing and reviewing code, You are familiar with micro-services as well as monolithic architectures, and ideally have seen them in operation at a global scale You have prior experience working in high performance or distributed systems Identifying novel solutions to challenging operational problems and developing interfaces between separate SaaS offerings Proven track record of managing cloud operations in a 24/7 setting. Experience supporting live production systems, maintaining high availability, and responding swiftly to issues as they appear Participate in on-call rotation and be an escalation contact for service incidents Experience with both Linux, Windows systems, and networking fundamentals 6+ years’ experience in configuration and maintenance of applications such as web servers, load balancers, relational databases, storage systems and messaging systems 6+ years’ experience learning software, frameworks and APIs System maintenance, patching and deployment Experience in creating secured cloud infra for Staging, Development and Production environments. Skills: Configuration Management tools (e.g., Ansible) Docker for Linux and for Windows Amazon Web Services Continuous integration tools (e.g. Jenkins, Azure DevOps) Source and Project Control (JIRA, Github, Github Enterprise) Infrastructure as code (e.g. Terraform, CloudFormation) Knowledge of web application frameworks Familiarity with NoSQL, relational SQL, and database concepts Monitoring and Logging tools (Sumologic, DataDog, New Relic) Dashboard tools (e.g. Grafana) About you: You have an eye for detail and can identify trends and commonalities across large numbers of incidents. You’re a great communicator and enjoy capturing process in documentation: internal documentation (accurate concise bug reports, reference material). You’re flexible - as a small but rapidly growing team in an international company, you occasionally may be required to work flexible hours to support 24/7 infrastructure. You’re a wizard at troubleshooting: able to identify root causes and find workarounds. You can see the big picture and proactively manage long-term goals and priorities. You’re curious; passionate about learning new information. We recognize that talent comes in many forms, so we’re looking for passion, enthusiasm and transferable skills. What’s it like here? At Copperleaf, we’re committed to building a great culture because we know it sets us apart. Culture is at the very core of everything we do, and it is what makes people want to be part of the market-leading company we’re building. We are a global team of world-class innovators continually pushing the limits of what’s possible to deliver exceptional value and extraordinary experiences to our clients. To do that, we actively cultivate an open and supportive team environment, where diverse ideas and perspectives are encouraged and respected. Headquartered in Vancouver, Canada, Copperleaf is building a better world, one decision at a time. As one of Canada’s Fastest-Growing Companies, winner of Canada’s Most Admired Corporate Cultures and the BC Tech Association’s Tech Culture of the Year, we are a dynamic and disruptive organization offering exciting opportunities for growth and innovation. We are also proud to be a proactive eq