Senior Data Scientist
Fully Remote Remote Worker - N/A
Description

Company Overview

Homethrive was born from personal experience. Our founders grappled with the overwhelming challenges of caregiving for family members while balancing their work lives. The journey was fraught with confusion, a myriad of unanswered questions, and countless hours delving into endless online searches. After taking numerous days off and spending extended hours on the phone, the answers remained elusive. They recognized the need for a streamlined, more efficient solution. Enter Homethrive. Our mission is to revolutionize family caregiving by delivering superior outcomes for caregivers, their loved ones, and health plans alike. At the heart of our service is the Homethrive personal caregiving coach and assistant — an all-in-one interactive tool that offers the expertise, recommendations, and support our members deserve.


Our Mission

At Homethrive, we are revolutionizing family caregiving to achieve better outcomes for caregivers, their loved ones, and their employers. Our innovative platform offers a personal caregiving coach and assistant that provides comprehensive knowledge, tailored recommendations, and ongoing support, all within a single interactive tool.


Leadership and Growth

Our leadership team comprises seasoned industry veterans with a proven track record of building multi-billion dollar enterprises. Backed by prominent healthcare venture capital funds, Homethrive is well-positioned for significant growth. As we continue to expand, we are seeking talented individuals to join our world-class team and contribute to our mission of transforming family caregiving.


Location

Homethrive is a remote-first culture, with headquarters in Chicago, IL.


Job Overview

We are seeking a Senior Data Science Engineer to join our innovative health tech startup. This individual will play a critical role in developing and deploying cutting-edge AI solutions using Large Language Models (LLMs), vector databases, Retrieval-Augmented Generation (RAG), and robust productionization pipelines. The ideal candidate has a strong background in data science, machine learning engineering, and AI-driven chatbot systems with a focus on guardrails and safety mechanisms.


This role will directly impact how we leverage AI to innovative caregiving support, and ensure compliance with strict healthcare regulations. 


Key Responsibilities


AI & LLM Development:

  • Build and fine-tune large-scale LLM-based models tailored to healthcare use cases.
  • Develop efficient RAG pipelines to ensure real-time, accurate retrieval of domain-specific data. 

 Vector Database Management:

  • Architect and optimize vector search systems to manage embeddings and support intelligent queries.
  • Integrate vector databases like Pinecone, Weaviate, or FAISS with healthcare datasets.

 AI Productionization:

  • Design scalable solutions for deploying AI systems in production environments.
  • Monitor and enhance model performance, latency, and robustness post-deployment.

 Chatbot Development & Guardrails:

  • Develop AI-powered chatbots to improve patient and provider interactions.
  • Implement guardrails (e.g., ethical AI, hallucination mitigation, compliance protocols) to ensure safety and reliability. 

 Data Engineering & Compliance:

  • Work closely with data engineers to preprocess, clean, and secure sensitive healthcare data.
  • Ensure AI solutions adhere to healthcare standards like HIPAA and SOC2.

 Collaboration:

  • Collaborate with product, engineering, and clinical teams to align AI capabilities with business objectives.
  • Actively participate in team discussions to identify opportunities for AI innovation.
Requirements

Required Qualifications

  • Educational Background:
    • Bachelor’s or Master’s in Computer Science, Data Science, Machine Learning, or related field. PhD is a plus.
  • Technical Skills:
    • Proficient in Python and ML frameworks (e.g., TensorFlow, PyTorch).
    • Hands-on experience with LLMs like GPT, BERT, or similar.
    • Expertise in vector database technologies (e.g., Pinecone, Milvus, or Weaviate).
    • Familiarity with RAG workflows and conversational AI development.
  • Experience:
    • 7+ years in data science, machine learning, or AI engineering.
    • Demonstrated success in deploying AI models in production.

    Strong knowledge of designing and implementing guardrails for AI


    Preferred Qualifications

  • Knowledge of scalable infrastructure (e.g., AWS Lambda, Docker, ECS).
  • Experience with natural language processing in clinical or healthcare domains.
  • Strong communication skills to explain complex AI concepts to non-technical stakeholders.

 

EEO

Homethrive is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.


Homethrive provides equal employment opportunities to all employees and applicants without regard to race, color, religion, sex (including sex stereotyping), national origin, ancestry, citizenship status, pregnancy (which included pregnancy, childbirth, and medical conditions related to pregnancy, childbirth, or breastfeeding), physical disability, mental disability, age, military status or status as a Vietnam-era or special disabled veteran, marital status, registered domestic partner status, gender, gender identity, gender expression, medical condition (including, but not limited to, cancer-related or HIV/AIDS-related), genetic information, sexual orientation, or any other status protected by applicable federal, state, and local laws.