Job Detail

Data Scientist for the Data Science & Research Team

Inseriert am: 12.11.2018

Data Scientist for the Data Science & Research Team


The Lausanne University Hospitale (CHUV) is inviting applications for a position of a Data Scientist for the Data Science & Research team.

DépartementDépartement infrastructures (DINF)Code emploiStatisticien-ne - 478011Niveau11Date d'entrée souhaitée01-03-2019Type de contratCDICatégorie professionnelleAdministrationLieuLausanneTaux d'activité100%Date de début de publication12-11-2018Date de fin de postulation07-12-2018Référence03630-AD-102-2018

Contexte


The CHUV is a key player in medical care and biomedical research both at national and international levels.


The mission of the Data Science & Research (DS&R) team, part of CHUV Department of Information Systems, is to foster the adoption and use of data science within the hospital to significantly improve biomedical research and hospital key processes.


Every day, a massive volume of clinical data is generated within the hospital and available on the analytics platform. In order to use this data, The DS&R team is developing the analytics platform for running data science projects in the field of clinical research with physicians and scientists. Our current and future challenges lie at the intersection of big data, medical informatics, data protection and artificial intelligence.   


The DS&R team, based in Lausanne, is composed of data scientists, data analysts, project managers, data protection experts and is now looking for a skilled Data Scientist.


Mission


You are passionate about applying data science to enable data-driven decision-making, finding data patterns and understanding large data trends. Excited to work in the medical field, you will be responsible for creating machine learning solutions to optimize hospital processes and solve clinical research challenges working closely with physicians and scientists :



  • Designing, implementing and testing end-to-end data processing pipelines to generate insights out of medical data. This includes formalizing user specifications, feature extraction, designing custom machine learning models, implementing them in the analytical platform and maintaining them

  • Coordinating external Data scientist consultants to enable the team to run the multiple projects in parallel

  • Working with structured and unstructured text to improve patient data de-identification and extract meaningful information for clinical research using NLP techniques.


Profil


In your past experiences (2 to 4 years), you have acquired provable programming skills and a good track record in building machine learning models that you have applied to real-world large datasets. Particularly, you are familiar with the following data science techniques:



  • Experience creating novel machine learning models, evaluating and comparing them in a reproducible setting

  • Data cleaning, feature extraction and selection techniques such as creating ETL pipelines, unstructured data preprocessing, dimensionality reduction, pattern recognition, etc.

  • Excellent understanding of supervised and unsupervised state-of-the-art machine learning algorithms for clustering, novelty/anomaly detection, classification and regression

  • Excellent understanding of NLP concepts and techniques such as word2vec, n-grams, TF-IDF, ontologies, named-entity recognition and linking, information retrieval etc.

  • Experience building Deep Learning architectures using RNNs and CNNs, and application to several data types like texts and images

  • Data visualization techniques

  • Strong mathematical skills and working knowledge of statistical concepts.


This position requires a MSc in Computer Science (PhD is a plus) and experience in the following:



  • Highly skilled in Python and good experience with at least another object-oriented programming language like Scala or Java

  • Proficiency in using machine learning and scientific packages like with Numpy, Scipy, Pandas, Scikit-Learn, XGBoost, LightGBM, CatBoost, Eli5, etc.

  • Proficiency in using NLP libraries like NLTK, Gensim, Spacy, Stanford CoreNLP and regular expressions

  • Excellent experience in at least one deep learning framework like Keras, Tensorflow, Theano, PyTorch, etc.

  • Good experience with visualization tools like Seaborn, Bokeh, Plotly, matplotlib, etc.

  • Good experience with Linux and shell scripting

  • Good experience with Agile software development as a team

  • Familiarity with search engines such as the Elastic stack or Solr

  • Some experience of  SQL/NoSQL databases and concepts

  • Ability to manage multiple initiatives and resources in parallel

  • Good analytical and problem solving skills

  • Ability to work with third-party RESTful APIs

  • Ability to work with and maintain a professional code base under version control (Git).


Moreover, you foster a can-do attitude and work well in a cross-functional environment. You excel in communicating with team members, physicians and scientists.


You have an excellent command of the English language, both verbal and written. Please note that also a good working knowledge of French is mandatory to help you interact with end-users and peers.


Nous offrons


Devenir une collaboratrice ou un collaborateur du Centre hospitalier universitaire vaudois, c'est l'assurance de bénéficier :



  • De prestations sociales de premier ordre

  • D'un droit à trois jours de formation minimum par année

  • De 25 jours de vacances par année

  • De restaurants d'entreprise de qualité hôtelière, dans chacun des bâtiments de l'institution.


Contact et envoi de candidature


Contact for further information : Mrs Nathalie Jacquemont : Nathalie.jacquemont@chuv.ch


Applications should be submitted by our elecronic form only (please click the buttom "postuler" at the bottom of this vacancy annoucement). If for technical reasons you are unable to apply online, please contact the recruitment team 021 314 85 70.