About The Role
The system validation engineer role will focus on system-level design validation testing (DVT) and system integration validation. Nobody in the world has direct experience with our system. We are therefore looking for extraordinary individuals with outstanding track records for debugging, solving problems, and taking a product successfully to production. You will have the rewarding opportunity to be part of a team that works on state-of the art technology being applied to important problems such as Covid-19 research.
Responsibilities
- Work with the Design team to define system-level design validation testing (DVT) plan covering voltage, environmental, and cooling parameters, execute the test plan, and document the report
- Define the board and subassembly-level requirements for DVT teams and work with the Operations team to prepare components for building the DVT systems
- Determine software requirements for performing system-level DVT and work with system software teams to implement the tests
- Own the readiness of DVT chambers and external power and cooling infrastructure either through in-house development or working with external labs
- Maintain a set of stable hardware systems for software release four-corner regression testing
- Work with the Design team and System Software team to debug issues exposed in DVT and drive resolution
Skills & Qualifications
- Experience defining DVT and system integration test plans and writing test reports for high performance compute systems, covering electrical, mechanical, and thermal aspects
- Experience writing test scripts with Python, shell, and bash in Linux environment and effectively logging and analyzing a large amount of test results
- Excellent communication, planning, and coordination skills across Systems, Operations, and Software teams
- Familiarity with compute server architecture, high speed IO and system management IO interfaces, and power delivery
- Familiarity with liquid cooling and high voltage AC circuits
- Familiarity with test equipment such as oscilloscopes, protocol analyzers, time domain reflectometers, etc.
- Experience debugging system-level issues a plus
Cerebras Systems is committed to creating an equal and diverse environment and is proud to be an equal opportunity employer. We celebrate different backgrounds, perspectives, and skills. We believe inclusive teams build better products and companies. We try every day to build a work environment that empowers people to do their best work through continuous learning, growth and support of those around them.
This website or its third-party tools process personal data. For more details, click here to review our CCPA disclosure notice.
Cerebras Systems has pioneered a groundbreaking chip and system that revolutionizes deep learning applications. Our system empowers ML researchers to achieve unprecedented speeds in training and inference workloads, propelling AI innovation to new horizons.
The Condor Galaxy 1 (CG-1), unveiled in a recent announcement, stands as a testament to Cerebras' commitment to pushing the boundaries of AI computing. With a staggering 4 ExaFLOP processing power, 54 million cores, and 64-node architecture, the CG-1 is the first of nine powerful supercomputers to be built and operated through an exclusive partnership between Cerebras and G42. This strategic collaboration aims to redefine the possibilities of AI by creating a network of interconnected supercomputers that will collectively deliver a mind-boggling 36 ExaFLOPS of AI compute power upon completion in 2024.
Cerebras is building a team of exceptional people to work together on big problems. Join us!