MorphLLM is building Fast Apply models - get changes from Claude/Gemini into your code FAST
Morph builds the fastest LLM code editing inference engine in the world — we hit 10,500 tok/sec PER REQUEST, all on Nvidia hardware.
Our stack powers high-throughput AI workflows for vibe coding apps, devtools, PR bots, and IDEs.
We're hiring a founding engineer to push the limits of performance, safety, and scalability across our inference, retrieval, and diffing pipelines.
Apply:
Describe the machine learning project you're most proud of. Please go into extreme technical detail. We’re familiar with all the libraries.
Describe what you were or are deeply obsessed about (anything)
Nvidia, CUDA, FastAPI
ML algorithms
internSan Francisco, CA, USMachine learning$6K - $10K / monthlyJunior and above
fulltimeSan Francisco, CA, USMachine learning$100K - $150K1.00% - 5.00%3+ years
fulltimeSan Francisco, CA, US$75K - $110K1.00%1+ years