Founding AI Engineer at ZeroEntropy (W25)$120K - $200K • 0.50% - 1.50%
High accuracy search API over unstructured data.
About ZeroEntropy
ZeroEntropy is building next-generation technology for information retrieval over complex unstructured data.
We are a team of engineers and scientists from Berkeley, CMU, Ecole Polytechnique, and have become experts at information retrieval technologies.
About the role
ZeroEntropy is building the next-generation retrieval engine for AI systems. We’re rethinking search from the ground up: faster, more accurate, and built to serve as infrastructure for the next decade of AI.
As a Founding AI Engineer, you’ll work across research and engineering to design, train, and optimize machine learning systems that push the limits of what’s possible in performance-critical environments.
This is a hands-on role for someone who thrives in ambiguity, understands both the math and the machine, and wants to build core technology, not just use it. You’ll join an early, elite team where your ideas will directly shape the product and architecture.
This job is for you if:
- You’ve trained and deployed large models in production and debugged the weird edge cases.
- You’ve implemented research papers from scratch and made them faster, cleaner, and more accurate.
- You care as much about the quality of the data pipeline as the model itself.
- You’re comfortable designing experiments, interpreting noisy graphs, and making decisions under uncertainty.
- You love making beautiful, clean, type-safe code, with the goal of pure functional programming using algebraic data types, and can drop down to C++/CUDA when performance demands it.
- You understand distributed systems and what it takes to scale training and inference pipelines in the real world.
- You want to build a system from the ground up with minimal abstraction and maximum control.
Requirements:
- Deep experience with ML frameworks, experiment tracking, and distributed training.
- Strong foundation in math and CS fundamentals: linear algebra, probability, optimization, algorithms, data structures, and time complexity.
- Experience building and scaling robust data and training pipelines.
- Proficient in Python, with bonus points for C++, Rust, CUDA, or other performance-oriented tools.
- Comfortable with Linux, containers, and working close to the metal when needed.
- Bonus:
- Experience with model compression, quantization, or inference optimization.
- Background in information retrieval, NLP, or LLM internals.
- Familiarity with type-safe functional programming languages (e.g. OCaml, Haskell, SML).
About the role
- Based in San Francisco or willing to move there.
- Very competitive compensation, equity, and benefits.
Next Steps:
- In a quick sentence, write the most impressive thing you've ever done—feel free to brag!
- Sign up at https://dashboard.zeroentropy.dev/, and let us know how you would build this API from scratch in detail. We are not looking for GPT answers, we are looking for thoughtful responses on how you would build a state-of-the-art search engine to understand how you think and problem solve.
- Submit your response along with your resume as a PDF when applying.
Technology
We are looking for a highly skilled AI Developer to join us on our mission to build the future of search.
As the founding engineer, you'll shape the foundation of a high-impact technology with a scrappy but rigorous, and product-led growth mentality. You will have the autonomy to build, iterate quickly, experiment, and research at the forefront of AI technologies. Your curiosity to learn and growth mindset will help you excel as a hands-on builder who loves to roll up your sleeves and think out of the box to innovate on a core technology needed by all developers building in AI.
This job is for you if:
- You love to disassemble a C++/Rust program and trace through every single nanosecond of CPU usage in order to squeeze the very last drop of performance out of the machine.
- You know and understand Linux syscalls at a deep level, you look through glibc code, you can debug CUDA code.
- You understand theoretical foundations in mathematics and computer science, including combinatorics/number theory/abstract algebra.
- You love making beautiful, clean, type-safe code, with the goal of pure functional programming using algebraic data types; only mutating state where efficiency requires it.
Requirements:
- Extensive experience and proficiency in Rust/C/C++, Python, SML/OCaml, Assembly.
- Proven experience managing infrastructure with strong Linux, Bash, and SQL skills.
- Deep understanding of the tech stack down to the metal, including experience with assembly, EVM, graphics shaders, CUDA, or kernel work.
- Solid grasp of computer science fundamentals like algorithms, data structures, and time complexity, but also knowledge of functional programming concepts like map, filter, and reduce.
- Familiarity with software design principles and best practices.
- Experience or willingness to learn about scalability technologies like AWS/Azure, Docker, and Kubernetes.
- Desire to work with type-safe languages like Rust/Swift (algebraic data types, functional programming, etc).
What you will work on:
- Develop and implement robust features across our platform for high performance and responsiveness.
- Manage and optimize infrastructure using Linux, Bash, and SQL to ensure system scalability and reliability.
- Work autonomously on complex projects involving compilers, OS-level work, assembly, and more.
- Contribute to all phases of the development lifecycle, from research to deployment.
- Creating state-of-the-art ML models, including LLMs for information retrieval, rerankers, embedding models, etc.
- Creating scalable and low-latency infrastructure for a state-of-the-art search engine / database using Rust and k8s.
- Talking to customers to understand their needs, and improve on the technology to meet them.