Replica Analytics is recruiting a software developer to join our software engineering team. The company is a fast-growing start-up in machine learning and AI, developing technology for the life sciences and healthcare industry.
This position will contribute to the development of our suite of data synthesis software. In this role you will work with Data Scientists to scale theoretical and prototoype machine learning models out to production level implementations. While knowledge of machine learning techniques is considered a strong asset, this is primarily a software engineering role with a focus on robust software design for high throughput processing.
Necessary Requirements
- Science or Engineering background (e.g., Computer Science, Engineering, Math, or Physics)
- 2+ years experience programming in Python, Java or Scala
- 5+ years experience developing software for production systems
- 2+ years implementing high throughput data pipelines using message transports such as Kafka and RabbitMq
- 2+ years experience with machine learning frameworks such as scikit-learn, LightGBM, Tensorflow, PyTorch
- 2+ years scaling and optimizing high capacity processing workflows
- Experience porting computations to GPUs
- Experience with workflow managers such as Apache Airflow or Celery
- Experience with automated testing
- Experience writing internal (e.g., software design) and external (e.g., user guides) documentation for software
- Experience troubleshooting based on client feedback
Additional Requirements
-
Experience with statistical machine learning methods
- Experience with deep learning methods
- Some formal training in statistics
- Experience developing Cloud Native systems
- Experience developing APIs
- Experience with C++