Job Details\n\nSite Reliability Engineer II (aka SRE II) are specialists in treating operations as a software problem. They focus on reliability of systems and services - addressing availability, performance, scalability, latency, observability, efficiency. They work on maintaining key components and developing systems that will minimize human labor (through automation) and increase system reliability with the end goal of breaking the relationship between system size, operational toil and complexity.\n\n\n\nSRE II are responsible for the implementation of technical solutions based on business requirements, they can estimate the effort and impact of the items they work on, and show a high quality of craft in what they deliver. SRE II work primarily within the scope of their team while occasionally collaborating across partner teams. They are expected to work together with colleagues (potentially in other job roles) to design and implement technical tasks. They are also expected to actively participate in incident response for issues affecting their team.\n\nBecause the required technical skills and commercial knowledge can vary from one area to another, SRE II can wear several hats; part of a business service owner team, owner of a piece of infrastructure, and\/or consultant to product development teams regarding Site Reliability Engineering related scope.\n\nBuilding software applications\n\nIs responsible to build software applications by using relevant development languages and applying knowledge of systems, services and tools appropriate for the business area\nIs responsible to refactor and simplify code by introducing design patterns when necessary\nIs responsible to ensure the quality of the application by following standard testing techniques and methods that adhere to the test strategy\nHas sufficient knowledge to write readable and reusable code by applying standard patterns and using standard libraries\nHas sufficient knowledge to maintain data security, integrity and quality by effectively following company standards and best practices\n\nSoftware Systems Design\n\nHas sufficient knowledge to evaluate possible architecture solutions by taking into account cost, business requirements, technology requirements and emerging technologies\nHas sufficient knowledge to describe the implications of changing an existing system or adding a new system to a specific area, by having a broad, high-level understanding of the infrastructure and architecture of our systems\nHas sufficient knowledge to help grow the business and\/or accelerate software development by applying engineering techniques (e.g. prototyping, spiking and vendor evaluation) and standards\nHas sufficient knowledge to meet business needs by designing solutions that meet current requirements and are adaptable for future enhancements\n\n\n\nEnd to End System Ownership\n\nHas sufficient knowledge to own a service end to end by actively monitoring application health and performance, setting and monitoring relevant metrics and act accordingly when violated\nHas sufficient knowledge to reduce business continuity risks and bus factor by applying state-of-the-art practices and tools, and writing the appropriate documentation such as runbooks and OpDocs\nHas sufficient knowledge to reduce risk and obtain customer feedback by using continuous delivery and experimentation frameworks\nIs responsible to independently manage an application or service by working through deployment and operations in production\nHas sufficient knowledge to maintain data security, integrity and quality by effectively following company standards and best practices\n\nTechnical Incident Management\n\nHas sufficient knowledge to address and resolve live production issues by mitigating the customer impact within SLA\nHas sufficient knowledge to improve the overall reliability of systems by producing long term solutions through root cause analysis\nHas sufficient knowledge to keep track of incidents by contributing to postmortem processes and logging live issues\n\nAutomation and toil reduction\n\nHas sufficient knowledge to ensure that infrastructure stays current by reducing technical debt, searching for bottlenecks and preparing for scaling\nHas sufficient knowledge to reduce cost of operations and maintenance by leveraging new technologies, automation, and partner with vendors to ensure we stay current\nHas sufficient knowledge to reduce human labour by writing small software features that address availability, scalability, latency and efficiency\n\nMonitoring and Alerting improvements\n\nHas sufficient knowledge to review and verify performance of production systems and network infrastructure by continuously monitoring appropriate observability metrics, business KPIs and capacity planning\nHas sufficient knowledge to improve application reliability by partnering with development teams to advise on setting appropriate observability metrics\n\nCritical Thinking\n\nIs responsible to systematically identify patterns and underlying issues in complex situations, and to find solutions by applying logical and analytical thinking.\nIs responsible to constructively evaluate and develop ideas, plans and solutions by reviewing them, objectively taking into account external knowledge, initiating 'SMART' improvements and articulating their rationale.\n\nContinuous Quality and Process Improvement\n\nHas sufficient knowledge to identify opportunities for process, system and structural improvements (i.e performance gains) by examining and evaluating current process flows, methods and standards.\nHas sufficient knowledge to design and implement relevant improvements by defining adapted\/new process flows, standards, and practices that enable business performance.\n\nEffective Communication\n\nHas sufficient knowledge to deliver clear, well-structured, and meaningful information to a target audience by using suitable communication mediums and language tailored to the audience\nHas sufficient knowledge to achieve mutually agreeable solutions by staying adaptable, communicating ideas in clear coherent language and practising active listening\nHas sufficient knowledge to ask relevant (follow-up) questions to properly engage with the speaker and really understand what they are saying, by applying listening and reflection techniques\n\nArchitectural Guidance\n\nHas sufficient knowledge to advise product teams towards a technical solution that meets the functional, nonfunctional & architectural requirements by challenging the rationale for an application design and providing context in the wider architectural landscape\nHas basic knowledge to set a clear direction for a technical capability by evaluating and aligning the target architecture improvements, reframing architectural designs and decisions for varied stakeholder\n\nPre-Employment Screening\n\nIf your application is successful, your personal data may be used for a pre-employment screening check by a third party as permitted by applicable law. Depending on the vacancy and applicable law, a pre-employment screening may include employment history, education and other information (such as media information) that may be necessary for determining your qualifications and suitability for the position.