Protege Logo

Protege

Applied Data Scientist

Sorry, this job was removed Sorry, this job was removed at 02:55 p.m. (PST) on Thursday, Apr 03, 2025
Remote
Remote

Company Overview:

We are building Protege to solve the biggest unmet need in AI — getting access to the right training data. The process today is time intensive, incredibly expensive, and often ends in failure. The Protege platform facilitates the secure, efficient, and privacy-centric exchange of AI training data, starting in the healthcare industry. 

Solving AI's data problem is a generational opportunity. The company that succeeds will be one of the largest in AI — and in tech.

Summary

The Applied Data Scientist bridges the gap between our data assets and our customers' needs in our healthcare vertical. They play a key role in ensuring our datasets are well-matched to the AI models our customers are building and well-understood by those customers. This role requires both healthcare data expertise, extensive experience with statistical analysis, and some customer collaboration.

We are open to hiring someone for part-time, temp-to-hire, and full-time opportunities in this role. Part-time would require at least 20 hours per week. 

Responsibilities

  • Data Analysis: Conduct feasibility analyses by querying healthcare datasets to assess patient cohort availability based on complex inclusion/exclusion criteria (i.e. procedures, diagnoses, diversity, longitudinal completeness, regulatory constraints). 

  • Trade-off Assessments: Assess privacy-preservation techniques to maximize dataset utility.

  • Customer Collaboration: Work directly with prospective customers to understand their data requirements and help curate the best data assets for their use cases.

  • Data Strategy: Identify gaps in our data offerings and provide insights to our partnerships team on the highest-priority data acquisitions.

  • Data Quality Assurance: Evaluate potential data partnerships, ensuring the data is high-quality, well-documented, and commercially viable.

Technical Skill Set

  • Data Expertise: Experience working with healthcare/medical datasets: some combination of imaging, EHR, genomic, claims, and pathology data as well as comfort with SQL, R , and/or Python for data analysis. The bigger the dataset you have worked with, the better! 

  • Longitudinal & Cohort Analysis: Ability to evaluate datasets for completeness over time, ensuring sufficient patient follow-up and retention for model training.

  • Diversity & Bias Mitigation: Knowledge of techniques to assess and improve dataset diversity across demographics, geographies, and clinical subpopulations.

  • Privacy-Preserving Technologies: Familiarity with de-identification techniques such as Safe Harbor and Expert Determination.

Qualifications

  • 2+ years experience in a health data role (e.g., biomedical informatics, computational biology, AI/ML in healthcare) or equivalent experience, e.g., Ph.D. or Masters in healthcare economics, statistics or data science with healthcare focus, etc.

  • Excellent communication skills with the ability to translate complex data concepts.

  • Proficiency in Snowflake and a stats coding language (SQL, R, Python), including writing complex queries and working with large datasets.

  • Experience in a customer-facing role preferred.

Similar Jobs

16 Days Ago
Remote
USA
170K-200K Annually
Senior level
170K-200K Annually
Senior level
Security • Cybersecurity
The role involves analyzing false negatives and false positives, training models on datasets, and improving detection efficacy for security attacks.
Top Skills: NumpyPandasPythonPyTorchSklearnSQLTensorFlow
16 Days Ago
Remote
USA
180K-212K Annually
Senior level
180K-212K Annually
Senior level
Cloud • Fintech • Cryptocurrency • NFT • Web3
As a Senior Data Scientist, you will build models, provide mentorship, and develop techniques to optimize user experiences and drive business value.
Top Skills: Causal InferenceData ModelingMachine LearningQuantitative Analysis
3 Hours Ago
Remote
2 Locations
110K-135K Annually
Mid level
110K-135K Annually
Mid level
Insurance • Legal Tech • Social Impact
The System Administrator manages IT systems at Atticus, focusing on GCP, Google Workspace, user permissions, cybersecurity, and device management, while providing end-user support.
Top Skills: GCPGoogle WorkspacemacOSMdmSalesforceTwilio

What you need to know about the Seattle Tech Scene

Home to tech titans like Microsoft and Amazon, Seattle punches far above its weight in innovation. But its surrounding mountains, sprinkled with world-famous hiking trails and climbing routes, make the city a destination for outdoorsy types as well. Established as a logging town before shifting to shipbuilding and logistics, the Emerald City is now known for its contributions to aerospace, software, biotech and cloud computing. And its status as a thriving tech ecosystem is attracting out-of-town companies looking to establish new tech and engineering hubs.

Key Facts About Seattle Tech

  • Number of Tech Workers: 287,000; 13% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Amazon, Microsoft, Meta, Google
  • Key Industries: Artificial intelligence, cloud computing, software, biotechnology, game development
  • Funding Landscape: $3.1 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Madrona, Fuse, Tola, Maveron
  • Research Centers and Universities: University of Washington, Seattle University, Seattle Pacific University, Allen Institute for Brain Science, Bill & Melinda Gates Foundation, Seattle Children’s Research Institute
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account