$100,000 - $120,000
Our client's advanced analytics team provides productized and semi-custom data and analytical solutions to some of the largest biopharmaceutical and medical technology companies in the world. Their team is composed of data scientists, social scientists, and software engineers. Their work involves adapting computational sociology, data science, and product design to serve clients’ needs. Combining these methodologies, they assist healthcare companies with their marketing, sales, and human resources objectives. With access to one of the largest healthcare datasets in the world and a rapidly growing market of clients, they are helping transform healthcare.
Responsibilities: Design pipelines that extract data from health claims, EHR and other databases, transform/clean them via complex algorithms, integrate them with other data sets, and store them.
Write and edit automated and semi-automated scripts for data analysis and manipulation.
Model semi-hierarchal healthcare data at multiple levels with formal knowledge modeling and entity schemes
Perform hybrid qualitative-quantitative work to evaluate data sources, data pipelines, knowledge modeling schemes, and to recommend next steps
Collaborate with team members to build project plans and timelines.
Transform general directives into concrete, actionable tasks, procedures, and outputs.
Help supervise training of junior team members.
Requirements: Project self-management; proven ability to execute complex, open-ended projects with little supervision.
Teamwork, working with a close-knit group of other data scientists and developers.
Software development, with accompanying proficiency in a language - options include Python, Scala or Java.
General-purpose scripting, with accompanying proficiency in a language: Python, Perl, Ruby or Bash.
OS-level Shell, with accompanying proficiency in the shell language - options include Bash OR C-Shell OR Z-Shell OR Powershell.
SQL - options include SQL Server OR MySQL OR PostgresSQL.
Software development best practices, including accompanying tools such as task tracking and version control systems.
Distributing computing frameworks - options include Spark OR Hadoop MapReduce a plus
Cloud computing infrastructure - options include AWS w/S3 OR Azure a plus
Distributed data storage/database tools a plus.
Comprehensive text editor environments such as Emacs.
NO VISA's, NO Sponsorship