Careers
Careers

job details

Back to jobs search

Jobs search results

2,205 jobs matched
Back to jobs search

Staff Quality and Reliability Engineer, Google Cloud

GoogleSunnyvale, CA, USA

Minimum qualifications:

  • Bachelor's degree in Electrical Engineering, Computer Engineering, Computer Science, or a related field, or equivalent practical experience.
  • 8 years of experience in reliability or product quality engineering (e.g., working on ICs, SoCs, or microprocessors).
  • Experience with silicon or semiconductor manufacturing or Fab processes (e.g., CMOS, FinFET, or Device Physics).
  • Experience with advanced manufacturing nodes (e.g., 5nm, 3nm) or assembly (e.g., 2.5D, 3D, or Chiplet packaging).
  • Experience in a production or manufacturing environment (e.g., Failure Analysis, Root Cause Analysis, or RMA processes).

Preferred qualifications:

  • Master's degree or PhD in Electrical Engineering, Computer Engineering or Computer Science, with an emphasis on computer architecture.
  • Experience in Chiplets and High power devices.
  • Experience in data analytics to identify commonalities and abnormalities.
  • Experience in semiconductor reliability and manufacturing processes (fab, assembly, test), or IC and packaging failure mechanisms and related failure analysis.
  • Knowledge of Design-for-Reliability guidelines and implementation techniques.
  • Familiarity with test methods and hardware for silicon qualification (e.g., HTOL chambers, ESD, LU).

About the job

In this role, you’ll work to shape the future of AI/ML hardware acceleration. You will have an opportunity to drive cutting-edge TPU (Tensor Processing Unit) technology that powers Google's most demanding AI/ML applications. You’ll be part of a team that pushes boundaries, developing custom silicon solutions that power the future of Google's TPU. You'll contribute to the innovation behind products loved by millions worldwide, and leverage your design and verification expertise to verify complex digital designs, with a specific focus on TPU architecture and its integration within AI/ML-driven systems.

As a Quality and Reliability Engineer for Google Cloud, you will lead the development of Design-for-Reliability guidelines and drive the adoption of advanced technologies to optimize silicon production and reliability. You will be responsible for ensuring that High Performance Computing (HPC) SOC products meet stringent quality requirements by collaborating across design, manufacturing, and hardware teams to execute comprehensive test plans. Additionally, you will own the cross-functional investigation and root-cause analysis of integrated circuit (IC) issues to develop effective solutions in a production environment.

The AI and Infrastructure team is redefining what’s possible. We empower Google customers with breakthrough capabilities and insights by delivering AI and Infrastructure at unparalleled scale, efficiency, reliability and velocity. Our customers include Googlers, Google Cloud customers, and billions of Google users worldwide.

We're the driving team behind Google's groundbreaking innovations, empowering the development of our cutting-edge AI models, delivering unparalleled computing power to global services, and providing the essential platforms that enable developers to build the future. From software to hardware our teams are shaping the future of world-leading hyperscale computing, with key teams working on the development of our TPUs, Vertex AI for Google Cloud, Google Global Networking, Data Center operations, systems research, and much more.

The US base salary range for this full-time position is $183,000-$271,000 + bonus + equity + benefits. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.

Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about benefits at Google.

Responsibilities

  • Own development of Design-for-Reliability guidelines, collaborating with subject area experts (e.g., SER, EMIR, PERC, HVDRC, Margining, etc.).
  • Facilitate technology adoption to optimize production and reliability (embedded sensors, in-field monitor/debug, etc.).
  • Collaborate with design, manufacturing, silicon engineering, and hardware/component quality teams to ensure High Performance Computing (HPC) SOC silicon products meet quality and reliability requirements (Mission profile, DPPM/FIT, Aging, etc.).
  • Partner with cross-functional organizations to design and execute quality and reliability test plans (HTOL, ELFR, ESD/LU, b/HAST, THB, etc.) and production Reliability methods (HVS and other methods).
  • Own cross-functional investigation of IC quality and reliability issues to identify root causes and develop solutions (RMA Triage, Analytics, Failure Analysis, etc.).

Information collected and processed as part of your Google Careers profile, and any job applications you choose to submit is subject to Google's Applicant and Candidate Privacy Policy.

Google is proud to be an equal opportunity and affirmative action employer. We are committed to building a workforce that is representative of the users we serve, creating a culture of belonging, and providing an equal employment opportunity regardless of race, creed, color, religion, gender, sexual orientation, gender identity/expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition (including breastfeeding), expecting or parents-to-be, criminal histories consistent with legal requirements, or any other basis protected by law. See also Google's EEO Policy, Know your rights: workplace discrimination is illegal, Belonging at Google, and How we hire.

If you have a need that requires accommodation, please let us know by completing our Accommodations for Applicants form.

Google is a global company and, in order to facilitate efficient collaboration and communication globally, English proficiency is a requirement for all roles unless stated otherwise in the job posting.

To all recruitment agencies: Google does not accept agency resumes. Please do not forward resumes to our jobs alias, Google employees, or any other organization location. Google is not responsible for any fees related to unsolicited resumes.

Google apps
Main menu