Advanced Analytics Network - Data Science Intern

United States of America, California, Santa Clara
United States of America, California, Belmont
United States of America, California, Pleasanton

Roche is recruiting for 5 Advanced Analytics- Data Science interns ranging from 3 month - 6 month depending on candidate's availability and business needs. Ideal candidates will begin internship in May 2020, however, flexibility in start/end dates can be accommodated.  

Our interns will join various analytics teams across the organization and work on an innovative project applying advanced analytics and machine learning approaches to the company’s real world (e.g. electronic medical records, insurance claims, medical transcripts), genomic, diagnostic, imaging and clinical trial data assets spanning multiple disease areas. 

We are looking for individuals who are:

  • Creative problem solvers, quick learners and comfortable experimenting with new approaches

  • Demonstrate high productivity and enjoys dealing with ambiguity and applying novel methodologies

  • Possess entrepreneurship, passion and curiosity for understanding and interrogating complex data.


  • Collaborate with the host team and other stakeholders to evaluate potential machine learning techniques and applications

  • Design, build and interpret machine learning algorithms to address selected research questions (including preparing the input data)

  • Proactively share learnings and knowledge to support the development of the wider Roche  Advanced Analytics community

  • Help shape the direction of machine learning and artificial intelligence within Roche

Experience and Competencies Preferred:

  • Knowledge of a wide range of machine learning techniques and applications

  • Experience applying machine learning algorithms and techniques, preferably to healthcare data

  • Experience with technologies required to undertake analyses on large data sources or with computationally intensive steps (SQL, parallelization, Hadoop, Spark, HPC cluster computing, Docker)

  • Experience with imaging analysis would be beneficial

  • Fluency in statistical programming languages (R, Python, etc.)

  • Strong communication and collaboration skills

  • Experience implementing reproducible research practices like version control (e.g. using Git) and literate programming

  • Demonstrated contributions to open source packages, libraries or functions


  • Bachelors, Master's and PhD candidates or recent graduate in Data Science related field (e.g., Statistics, Mathematics, Epidemiology, Health Economics, Outcomes Research, Computer Science, health service research, statistics/biostatistics, EE/Biomedical Engineering or related disciplines )

  • Hands-on research experience involving study design, statistical analysis and machine learning techniques in the context of healthcare

  • Proficiency with R or Python and/or Matlab and other image analysis, data analysis tools

  • Excellent communication and presentation skills

  • Able to work independently but also comfortable in a collaborative environment

  • Proficiency with UI technologies (Angular JS / React JS & Sever side technologies like Java / Python / Go lang)

  • Linear modelling and logistic regression

Preferred Qualifications:

  • Experience in using claims data

  • Graph theory

  • Survival analysis

  • Imagine analysis