Staff Software Engineer, Gemini Evals, GenAI, DeepMind
- linkCopy link
- emailEmail a friend
Minimum qualifications:
- Bachelor’s degree in Computer Science, Electrical Engineering, or a related technical field or equivalent practical experience.
- 8 years of experience in software development.
Preferred qualifications:
- Experience in designing, building, and maintaining high-performance distributed systems or processing pipelines.
- Experience leading architectural migrations or cross-team infrastructure projects.
- Proficiency in Python.
About the job
At Google DeepMind our mission is to build the world's first general-purpose learning agent. Central to this mission is the complex task of measuring the intelligence of our prototypes. As a Software Engineer, you will be working with the cutting edge AI agents developed by our exceptional team of Machine Learning and Neuroscience research scientists. Your responsibilities will include everything from creating systems for agent testing using 2D and 3D games to developing test problems within physics simulators. You will create graphical visualization of results, build competitive agent leaderboards and test new algorithms on robots. To succeed in this role you will need to have a strong foundation in software engineering and enjoy working on a wide range of challenging problems within a mission-driven team.Artificial intelligence will be one of humanity’s most transformative inventions. At Google DeepMind, we are a pioneering AI lab with exceptional interdisciplinary teams focused on advancing AI development to solve complex global challenges and accelerate high-quality product innovation for billions of users. We use our technologies for widespread public benefit and scientific discovery, ensuring safety and ethics are always our highest priority.
Responsibilities
- Design and optimize distributed evaluation execution engines capable of orchestrating large volumes of inference steps across TPU and Google compute unit (GCU) pools with high throughput and low latency.
- Build foundational abstractions to evaluate complex LLM agent loops, tool use, and automated LLM-as-a-judge rating systems.
- Design error classification, automated retry policies, and observability dashboards to maintain strict service level objective (SLOs) for evaluation pipeline success rates.
- Partner closely with GDM research scientists and Data Science teams to anticipate frontier model evaluation requirements and translate them into elegant infrastructure solutions.
- Mentor fellow engineers, set high standards for code quality (Python in Google3), and advocate testing and system design practices.
Information collected and processed as part of your Google Careers profile, and any job applications you choose to submit is subject to Google's Applicant and Candidate Privacy Policy.
Google is proud to be an equal opportunity and affirmative action employer. We are committed to building a workforce that is representative of the users we serve, creating a culture of belonging, and providing an equal employment opportunity regardless of race, creed, color, religion, gender, sexual orientation, gender identity/expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition (including breastfeeding), expecting or parents-to-be, criminal histories consistent with legal requirements, or any other basis protected by law. See also Google's EEO Policy, Know your rights: workplace discrimination is illegal, Belonging at Google, and How we hire.
If you have a need that requires accommodation, please let us know by completing our Accommodations for Applicants form.
Google is a global company and, in order to facilitate efficient collaboration and communication globally, English proficiency is a requirement for all roles unless stated otherwise in the job posting.
To all recruitment agencies: Google does not accept agency resumes. Please do not forward resumes to our jobs alias, Google employees, or any other organization location. Google is not responsible for any fees related to unsolicited resumes.