AI Infrastructure Engineer at StackAI (W23)
AI Agents for the Enterprise
SF Office - 171 2nd, 4th floor
Full-time
US citizen/visa only
About StackAI

Stack AI is a no-code drag-and-drop tool to quickly design, test, and deploy AI workflows that leverage Large Language Models (LLMs), such as ChatGPT, to automate any business process.

Our core value is to make it extremely easy to build arbitrarily complex AI pipelines using a visual interface that allows you to connect different data sources with different AI models.

Our customers use Stack AI to build applications such as:

  • Chatbots and Assistants: AI agents that interact with users, answer questions, and complete tasks, using your internal data and APIs.
  • Document Processing: apps to answer questions, summarize, and extract insights from any document, no matter how long.
  • Answer Questions on Databases: connect GPT-like models to databases (such as Notion, Airtable, or Postgres) and ask questions about them.
  • Content Creation: generate tags, summaries, and transfer styles or formats between documents and data sources.
About the role

About the Role

We’re hiring an AI Infrastructure Engineer to shape and scale the backend systems that power our AI platform. As a Series A company, your work will be foundational, enabling safe, efficient, and reliable AI workflows from end to end.

What You’ll Do

  • Design and implement scalable backend architectures for AI workloads (inference, orchestration, monitoring).

  • Own distributed job orchestration with Temporal and related systems.

  • Improve data pipeline performance by designing smarter caching strategies (e.g., file deduplication, hot/cold storage, Redis caching layers) to reduce redundant compute and API calls.

  • Build observability, monitoring, retries, and fault tolerance into all workflows.

  • Manage infrastructure reliability, incident response, and performance.

  • Develop tooling and platform infrastructure to support rapid growth.

  • Partner with ML engineers to bring models to production at scale.

What We’re Looking For

  • 4+ years of backend engineering (Python is a must).

  • Strong background in distributed systems, job orchestration, and task queues.

  • Deep knowledge of concurrency, parallelism, and multithreading—including async/await, event loops, thread pools, synchronization primitives, deadlocks, and race conditions—is a must. You should know how to design systems that maximize throughput without sacrificing correctness or safety.

  • Hands-on experience with Temporal, Redis, Airflow, Celery, RabbitMQ (or similar).

  • Experience with LLM serving and routing fundamentals (rate limiting, streaming, load balancing, budgets).

  • Comfortable with containers & orchestration: Docker, Kubernetes.

  • Familiarity with cloud platforms (AWS/GCP) and IaC (Terraform).

  • Experience with multiple storage systems: S3, Postgres, MongoDB, Redis, and Elasticsearch.

  • Track record scaling systems in startups or fast-paced environments.

  • Understanding of deploying, monitoring, and optimizing AI/ML systems in production with strong CI/CD practices.

Why You’ll Love Working Here

  • Play a foundational role at a fast-growing Series A startup that is shaping the future of AI in enterprise workflows.

  • Collaborate across Product, ML, and Platform teams, being the bridge between AI logic and scalable execution.

  • Build infrastructure that enables real value for large enterprises: low-code, secure, and scalable AI workflows.

  • Join a company that’s scaling thoughtfully and values developer experience.

Technology

Our tech stack includes:

  • Frontend: Next.js + Tailwind (Typescript)
  • Backend: FastAPI + Supabase (Python)
  • Databases: PostgreSQL + MongoDB

And we have internally built a super easy-to-use Machine Learning framework tailored to using Large Language Models in a flow-like sequence (akin to Pytorch + Langchain if you are familiar with those). It allows you to seamlessly integrate new functionality into the code base and we are also discussing whether to open-source it since it feels like magic!

Other jobs at StackAI

fulltimeSan Francisco, CA, US / New York, NY, US / RemoteBackend$150K - $250KAny (new grads ok)

fulltimeNew York, NY, US$100K - $170K3+ years

fulltimeSan Francisco, CA, US / New York, NY, US / RemoteFrontend$150K - $250K3+ years

fulltimeNYC Office - 1239 Broadway, Suite 1000 / Remote (US)Full stack

fulltimeNew York, NY, US / San Francisco, CA, US$75K - $120K1+ years

fulltimeNew York, NY, US / San Francisco, CA, US$100K - $200K1+ years

fulltimeSan Francisco, CA, US / New York, NY, US / RemoteMachine learning$150K - $250K3+ years

fulltimeSan Francisco, CA, US / New York, NY, USFull stack$120K - $160KAny (new grads ok)

fulltimeNew York, NY, US / San Francisco, CA, US$170K - $300K3+ years

fulltimeNYC Office - 1239 Broadway, Suite 1000 / Remote (US)

fulltimeSF Office - 171 2nd, 4th floorFull stack

fulltimeNew York, NY, US / San Francisco, CA, US / RemoteDevops$100K - $160K1+ years

fulltimeSF Office - 171 2nd, 4th floor

fulltimeSan Francisco, CA, US / RemoteFrontend$125K - $250K3+ years

Hundreds of YC startups are hiring on Work at a Startup.

Sign up to see more ›