By continuing to use our site, you consent to the processing of cookies, user data (location information, type and version of the OS, the type and version of the browser, the type of device and the resolution of its screen, the source of where the user came from, from which site or for what advertisement, language OS and Browser, which pages are opened and to which buttons the user presses, ip-address) for the purpose of site functioning, retargeting and statistical surveys and reviews. If you do not want your data to be processed, please leave the site.

Replica Analytics Careers

Data Scientist

Replica Analytics is recruiting for a data scientist to join a fast-growing startup. Working as part of the data science team, this role involves coordinating with external partners and clients on data synthesis projects as well as researching and implementing improvements to existing data synthesis pipelines.

Key Responsibilities

  • Client Projects

    • Maintain and improve existing production and quality control pipelines for synthetic data deliverables
    • Communicate and coordinate with clients on data synthesis deliveries
    • Participate in client education on data synthesis technologies

  • Research & Development
    • Contribute to the development of new technologies for data synthesis using a wide variety of machine learning methods; investigate various research topics in machine learning and statistics to determine the best method for data synthesis
    • Contribute to the implementation and testing of production and research pipelines in Python and R as well as other languages
    • Contribute to the dissemination of research results in the form of peer-reviewed papers, reports, and presentations

Minimal Requirements

  • BSc/MSc degree (or equivalent) in mathematics, statistics, computer science, or electrical engineering

  • Work experience: 1 year for MSc candidates / 2 years for BSc candidates

  • Demonstrated ability for conducting statistical and machine learning research (in the form of a thesis, publications, or side projects)

  • Proficient in Python or R programming for data science (data cleaning/pre-processing, classification and regression, model evaluation, data visualization, writing and applying custom functions, parallelization)

  • Excellent organizational and communication skills (verbal and oral)

  • Detail-oriented

  • Motivated to learn and apply new machine learning methods to solve real-life problems

Optional Requirements

  • Deep learning experience with PyTorch or TensorFlow
  • Experience working with health care data

About Replica Analytics

Replica Analytics develops software for generating synthetic data that maintains the statistical properties of real data. We enable easy, fast and effective access to high utility data that is made portable through data simulators.

Careers Contact:

If you are interested in this position with Replica Analytics, please send an email to with your resume and contact information, and we will follow-up with you.