Our mission is "One Human, One Doctor". We are creating superhuman doctors, because access to doctors is a basic human right.
Start with making doctors superhuman. Our vision is to eliminate doctor distractions and help them navigate the best treatments for their patients.
You will own the architecture, development, and continuous improvement of Sully.ai’s NLP models powering the Receptionist and Assistant agents. Working cross‑functionally with Product, Clinical, and Reliability Engineering, you’ll translate clinical workflows into robust conversational AI solutions that meet HIPAA‑level security and compliance requirements.
Architect NLP Pipelines. Design end‑to‑end pipelines for intent detection, entity extraction, and dialogue management using Hugging Face Transformers.
Fine‑Tune Transformer Models. Adapt state‑of‑the‑art architectures (e.g., BERT, GPT) on domain‑specific data to optimize receptionist and assistant workflows, leveraging prompt engineering and RAG techniques.
Define Evaluation Frameworks. Establish benchmarks (F1‑score, ROUGE, MMLU) and A/B test protocols to measure dialogue accuracy, user satisfaction, and model latency.
Deploy & Scale. Containerize models with Docker, serve via FastAPI, and orchestrate on Kubernetes for high availability and observability.
Lead MLOps Best Practices. Build CI/CD pipelines for model training, testing, and versioning; integrate monitoring and alerting for data drift and performance regressions.
Collaborate & Mentor. Partner with Product and Clinical teams to curate training data, refine user flows, and onboard new engineers into our NLP practice.
5+ years of software engineering experience, with 3+ years focused on ML/NLP in production settings.
Proficiency in Python and deep learning frameworks (PyTorch or TensorFlow), with hands‑on experience using Hugging Face Transformers.
Demonstrated success in fine‑tuning and deploying transformer‑based models for conversational AI or related NLP applications.
Experience building and scaling RESTful services with FastAPI, containerizing with Docker, and managing Kubernetes deployments.
Strong analytical skills and familiarity with evaluation metrics for NLP systems (F1, ROUGE, MMLU).
Excellent communication and collaboration skills in fast‑paced, cross‑functional teams.
Languages & Frameworks: Python, PyTorch/TensorFlow, Hugging Face Transformers.
APIs & Services: FastAPI, Docker, Kubernetes, CI/CD (GitHub Actions, Jenkins).
Cloud & Data: AWS/GCP/Azure, SQL/NoSQL databases.
AI & MLOps: Prompt engineering, RAG, model versioning, monitoring & alerting.
Prior experience in healthcare technology or familiarity with FHIR and HIPAA compliance.
Contributions to open‑source NLP projects or publications in top‑tier conferences.
Experience with retrieval‑augmented generation (RAG) and prompt‑tuning techniques
Why Join Sully.ai ?
🔥 Shape the Future of Healthcare: Build category-defining partnerships that enable doctors to focus on saving lives.
📈 Early-Stage Impact: Join early and play a critical role in shaping our partnership roadmap and overall company growth.
🌎 Remote-First Culture: Work with a talented, mission-driven team in a flexible, remote environment.
💰 Competitive Compensation: Enjoy a competitive salary, equity, and the opportunity to make a real difference.
🏆 Solve Scalability Challenges: Tackle complex challenges in a rapidly growing company, driving impactful change in healthcare.
Sully.ai is an equal opportunity employer. In addition to EEO being the law, it is a policy that is fully consistent with our principles. All qualified applicants will receive consideration for employment without regard to status as a protected veteran or a qualified individual with a disability, or other protected status such as race, religion, color, national origin, sex, sexual orientation, gender identity, genetic information, pregnancy or age. Sully.ai prohibits any form of workplace harassment.
fulltimeUS - Remote / Remote (US)Full stack
fulltimeNorth Carolina
fulltimeUS - Bay Area / Mountain View, CA, US / San Francisco, CA, US$150K - $175K6+ years
fulltimeUS - Remote / Remote (US)Full stack
fulltimeUS - Remote / Remote (US)$200K - $250K6+ years
fulltimeUS - Remote / Remote (US)
fulltimeUS - Remote / Remote (US)Full stack
fulltimeUS - Remote / Remote (US)Full stack
fulltimeUS - Remote / Remote (US)Full stack
fulltimeUS - Remote / Remote (US)Full stack
fulltimeUS - Remote / Remote (US)
contractUS - Remote / Mountain View, CA, US / Remote (US)QA engineer$140K - $165K6+ years
fulltimeUS - Remote / Remote (Mountain View, CA, US)Devops$170K - $200K6+ years
fulltimeUS - Remote / Remote (Mountain View, CA, US)$150K - $175K6+ years
fulltimeMountain View, CA, US / Remote (San Francisco, CA, US; Santa Clara, CA, US; Sunnyvale, CA, US; Palo Alto, CA, US; Oakland, CA, US; San Jose, CA, US; Seattle, WA, US; Los Angeles, CA, US; San Diego, CA, US; Las Vegas, NV, US; Phoenix, AZ, US; OR, US; US)Full stack$150K - $300K3+ years
fulltimeSanta Clara, CA, US / Remote (San Francisco, CA, US; US)$120K - $150K6+ years
fulltimeUS - Remote / Remote (US)$150K - $200K6+ years
fulltimeUS - Remote / Remote (US)$180K - $230K6+ years
fulltimeUS - Remote / Remote (US)Full stack