Software Engineer Intern at Bluejay (X25)
$5.2K - $10.4K / monthly
The world's first quality assurance agency for voice AI.
San Francisco, CA, US
Internship
US citizen/visa only
About Bluejay

Bluejay is the end-to-end testing platform for conversational AI, with a strong focus on voice. We serve customers ranging from very large enterprises to the hottest startups and are backed by Y Combinator and other prominent investors and angels. Faraz and Rohan, former AI engineers at Microsoft and AWS founded the company. During Y Combinator, we grew to over $100K in ARR in under a month, and we now have deals with multiple Fortune 500 companies.

At Bluejay, we believe that trust is not a feature — it’s the foundation. In the future, the AI agent companies that win will be those we can trust: safe, reliable, and aligned with human intent. Our mission is to engineer trust into every AI interaction—whether it’s a voice agent answering a call or a multi-modal system handling sensitive data.

We’re setting the gold standard for trustworthy AI agent testing through three guiding principles:

Simulation is the New Standard. If your AI agent hasn’t been tested in a simulated environment, it hasn’t really been tested. We rigorously pressure-test every scenario, failure condition, and edge case before your agent reaches the real world.

Safety Isn’t Optional. Security, compliance, adversarial testing, and red teaming aren’t just boxes to check—they’re pillars of responsible AI development. We make it easy to proactively evaluate agents for failure modes before they cause harm.

Trust Demands Accountability. In a world where AI agents make decisions for us, we need impartial systems to evaluate their behavior. That’s why Bluejay exists as an independent, third-party arbiter—a standard of truth that companies, regulators, and end users can rely on. We aim to be the scoreboard, not the player.

Stop vibe testing. Quality is engineered. Bluejay is here to build the future of trustworthy AI.

About the role

What You’ll Do

  • Build the brains. Architect and develop systems that simulate, analyze, and evaluate conversational AI agents, including voice and multimodal systems.
  • Scale for the real world. Design resilient, scalable infrastructure using AWS (Lambda, EC2, WebSockets, WebRTC) to handle thousands of real-time conversations.
  • Engineer trust. Develop algorithms and pipelines that surface insights, detect failure modes, and ensure agents are safe, reliable, and aligned with human intent.
  • Collaborate deeply. Work closely with the founders, customers, and the entire team to shape product direction and technical architecture.

Who You Are

  • A builder who has designed and shipped projects in AI/ML, especially in NLP, LLMs, or conversational AI.
  • Comfortable owning end-to-end systems—from rapid prototyping to production-ready deployments.
  • Excited by the challenge of building highly scalable, real-time systems.
  • A creative problem-solver who loves tackling ambiguous technical challenges.
  • Entrepreneurial, scrappy, and energized by working on a small, fast-moving team.
  • Passionate about building technology that’s trustworthy and safe.

Why Bluejay

  • Join a rocket ship backed by Y Combinator and top-tier investors.
  • Shape the core technology and culture of the company.
  • Work on hard, fascinating problems at the frontier of AI safety and reliability.
  • Collaborate with a talented team who are former AI engineers from Microsoft and AWS.
  • Competitive compensation.

Other jobs at Bluejay

fulltimeSan Francisco, CA, USFull stack$120K - $170K0.20% - 1.00%Any (new grads ok)

internSan Francisco, CA, USFull stack$5.2K - $10.4K / monthlyAny

Hundreds of YC startups are hiring on Work at a Startup.

Sign up to see more ›