AI Engineer at Zep AI (W24)
$180K - $230K  •  1.00%
The Memory Foundation For Your AI Stack
San Francisco, CA, US / Remote (US)
Full-time
3+ years
About Zep AI

Zep is building the long-term memory layer for the LLM application stack. Zep's open source project has seen 2K+ GitHub stars and 400K+ installs. Zep is in use at both enterprises and startups, such as Mattel, WebMD, and Athena Intelligence.

About the role
Skills: Google Cloud, Machine Learning, Microsoft Azure, Amazon Web Services (AWS)

About the Role

Zep is building the long-term memory layer for the LLM application stack. We have a large and active open-source community and recently launched our cloud service. We’re seeking an experienced AI Engineer to join our startup. As a critical member of our small, fast-paced team, you will design, implement, and maintain our Go-based APIs.

We are a remote-first organization. Zep is funded by YC, Engineering Capital, and angels such as Guillermo Rauch (Vercel).

Key Responsibilities:

  • Lead the development, implementation, and optimization of LLM-based AI solutions
  • Design and execute training strategies for LLMs, including data preparation and model fine-tuning
  • Develop and implement rigorous testing and validation protocols for AI models
  • Collaborate with cross-functional teams to integrate AI solutions into our products
  • Stay current with the latest advancements in AI and LLM technologies
  • Make critical decisions regarding AI architecture, model selection, and implementation strategies
  • Manage end-to-end project lifecycles with minimal supervision
  • Contribute to the company's AI strategy and vision

Required Qualifications:

  • Bachelor's or Master's degree in Computer Science, AI, Machine Learning, or a related field
  • Minimum 12-months experience advanced work with LLMs: agentic applications, advanced RAG, structured output, and more
  • Minimum of 5 years of hands-on experience with machine learning, including training, testing, deploying, and validation
  • Strong programming skills in Python and familiarity with deep learning frameworks such as PyTorch or TensorFlow
  • Extensive knowledge of natural language processing techniques and architectures
  • Experience with cloud computing platforms (e.g., AWS, GCP, Azure) for AI model deployment
  • Comfortable with ambiguity and thrives in a fast-paced, evolving environment

Nice to Have

  • Contributions to open-source projects, particularly LLM-related
  • Experience in REST API development, NoSQL database design, and RDBMS design and optimizations
  • Experience with Go or TypeScript

Benefits

  • Directly impact the development of the future LLM application stack
  • Competitive salary and equity compensation
  • Flexible work hours and remote work options
  • Health, dental, and vision insurance
  • Opportunities for professional growth and development
  • Collaborative and inclusive work environment
Technology

Low-latency inference is key for our offering. We use custom, fine-tuned models running in our own VPCs to power features such as real-time classification, and fact and schema-ed data extraction.

Our API is built in Go. Our web app in TypeScript and Svelte, and we offer Python, TypeScript, and Go SDKs.

Other jobs at Zep AI

fulltimeSan Francisco, CA, US / Remote (US; CA)Backend$125K - $200K1.00%3+ years

fulltimeSan Francisco, CA, US / Remote (US)Machine learning$180K - $230K1.00%3+ years

fulltimeRemote (San Francisco, CA, US)Full stack$125K - $200K1.00%3+ years

Hundreds of YC startups are hiring on Work at a Startup.

Sign up to see more ›