Customer Reliability Engineer, Reliability and Incident Management
- linkCopy link
- emailEmail a friend
Minimum qualifications:
- Bachelor’s degree in Science, Technology, Engineering, Mathematics, or equivalent practical experience.
- 13 years of experience troubleshooting and advocating for customer needs, and triaging technical issues (e.g., hardware, software, application, operational, process).
- Experience in reading/debugging code written in a general purpose coding language (e.g., Java, C, C++, Python, Shell, Go or JavaScript, etc.) and in virtualization and orchestration frameworks.
- Experience troubleshooting /advocating in a customer-facing environment, triaging technical issues (e.g., hardware, software, application, operational, process).
- Experience with incident management and response in a distributed systems environment.
Preferred qualifications:
- 5 years of experience in a technical role such as Site Reliability Engineering, Technical Solutions Engineering, or Software Engineering, Customer Engineering or professional services.
- Experience in applying SRE principles to improve the reliability and performance of systems.
- Experience in managing technical escalations and communicating with executive stakeholders.
- Familiarity with facilitating resilience exercises and post-incident reviews.
- Ability to lead and influence in a collaborative, cross-functional environment and analyze systems, specifically Non-Abstract Large-Scale System Design (NALSD).
- Excellent communication skills with the ability to convey technical concepts to a variety of audiences.
About the job
As a Customer Reliability Engineer, you will be a pivotal individual contributor on the CRIM team. You will be entrusted with providing technical leadership directly to customers. This role requires a unique blend of technical expertise, exceptional customer-facing skills, and a proactive mindset to prevent incidents before they occur. You will leverage your experience to guide customers in adopting SRE principles, navigating incidents, and ultimately build reliable systems on Google Cloud.Google Cloud accelerates every organization’s ability to digitally transform its business and industry. We deliver enterprise-grade solutions that leverage Google’s cutting-edge technology, and tools that help developers build more sustainably. Customers in more than 200 countries and territories turn to Google Cloud as their trusted partner to enable growth and solve their most critical business problems.Responsibilities
- Provide education on Google's reliability principles and best practices by engaging directly with customers.
- Lead high-level architectural reviews and risk analyses to help customers identify and mitigate potential system vulnerabilities.
- Advocate for customers within Google by providing feedback to Product and Engineering teams to improve reliability and supportability.
- Ensure lessons are learned from incidents by supporting and guiding the postmortem review process.
- Co-ordinate remediation actions in the customer's environment post-incident by working with frontline Technical Support Engineers.
Information collected and processed as part of your Google Careers profile, and any job applications you choose to submit is subject to Google's Applicant and Candidate Privacy Policy.
Google is proud to be an equal opportunity and affirmative action employer. We are committed to building a workforce that is representative of the users we serve, creating a culture of belonging, and providing an equal employment opportunity regardless of race, creed, color, religion, gender, sexual orientation, gender identity/expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition (including breastfeeding), expecting or parents-to-be, criminal histories consistent with legal requirements, or any other basis protected by law. See also Google's EEO Policy, Know your rights: workplace discrimination is illegal, Belonging at Google, and How we hire.
If you have a need that requires accommodation, please let us know by completing our Accommodations for Applicants form.
Google is a global company and, in order to facilitate efficient collaboration and communication globally, English proficiency is a requirement for all roles unless stated otherwise in the job posting.
To all recruitment agencies: Google does not accept agency resumes. Please do not forward resumes to our jobs alias, Google employees, or any other organization location. Google is not responsible for any fees related to unsolicited resumes.