Replica Analytics is recruiting for a data scientist to join a fast-growing startup. Working as part of the data science team, this role involves coordinating with external partners and clients on data synthesis projects as well as researching and implementing improvements to existing data synthesis pipelines.
Key Responsibilities
-
Client Projects
- Maintain and improve existing production and quality control pipelines for synthetic data deliverables
- Communicate and coordinate with clients on data synthesis deliveries
- Participate in client education on data synthesis technologies
- Research & Development
- Contribute to the development of new technologies for data synthesis using a wide variety of machine learning methods; investigate various research topics in machine learning and statistics to determine the best method for data synthesis
- Contribute to the implementation and testing of production and research pipelines in Python and R as well as other languages
- Contribute to the dissemination of research results in the form of peer-reviewed papers, reports, and presentations
Minimal Requirements
-
BSc/MSc degree (or equivalent) in mathematics, statistics, computer science, or electrical engineering
-
Work experience: 1 year for MSc candidates / 2 years for BSc candidates
-
Demonstrated ability for conducting statistical and machine learning research (in the form of a thesis, publications, or side projects)
-
Proficient in Python or R programming for data science (data cleaning/pre-processing, classification and regression, model evaluation, data visualization, writing and applying custom functions, parallelization)
-
Excellent organizational and communication skills (verbal and oral)
-
Detail-oriented
-
Motivated to learn and apply new machine learning methods to solve real-life problems
Optional Requirements
- Deep learning experience with PyTorch or TensorFlow
- Experience working with health care data