Sr. Linux System Administrator (HPC/Dev Ops)

Estados Unidos, California, South San Francisco

en de fr ru tr it pt zh ja

As part of the Roche Science Infrastructure operations team, we are looking for an infrastructure engineer to support our High Performance Computing (HPC) environment. You will manage Agile computing environments in several locations supporting our Research organizations to enable science in Roche. This includes installing, configuring, administering, and fine-tuning our High Performance Computing environment and related science IT infrastructure components across the organization in a timely, cost-effective and efficient manner.

Responsibilities:

  • Ensures installation, configuration and operation of the environments 

  • Assist in the design of the new HPC infrastructure and other scientific infrastructure services, as well as maintaining them.

  • Contributes to the concept, planning and execution of projects.

Who you are

You’re someone who wants to influence your own development. You’re looking for a company where you have the opportunity to pursue your interests across functions and geographies. Where a job title is not considered the final definition of who you are, but the starting point.

Qualifications:

  • Bachelor’s degree in Computer Science or equivalent work experience

  • Must have Linux Administration and scripting experience (other programming language a plus) and HPC

  • Senior level technical operational skills, such as troubleshooting, capacity planning, and root cause analysis

  • Linux System Administration and work related experience

  • 3 + years of experience and knowledge of HPC

  • Configuration management (GIT/Stash, Puppet, and basic Bright knowledge)

  • GPFS experience (maintenance, upgrades,...)

  • Schedulers (LSF)

  • Containers: Kubernetes

  • Monitoring: Ganglia, Grafana, ELK, Influxdb

  • Provisioning: Katello/ Foreman, Jenkins

  • Network: Mellanox

Also,

  • Excellent customer orientation and delivery focus with good end user perspective

  • Experience working in a fast changing environment where solutions are deployed and retired at high pace

  • Demonstrated problem solving skills

  • Good communication and interpersonal skills

  • Ability to work effectively alone or within a team, including virtual teams

  • Proactivity, with a clear ability to think beyond boundaries, take controlled risks and assume responsibilities

  • Working experience in pharmaceutical or scientific/research sector is a plus

  • Experience working in a global organization, working in an international and multicultural environment considered a plus

Advanced knowledge/consideration:

  • Storage knowledge: NAS, Object Storage (e.g. StorageGrid) and S3 storage

  • Data Lifecycle Management

  • AWS; Azure