Summer 2026 - Full-Stack AI Engineer Intern at Cloudglue (S24)
$6.5K - $8.5K / monthly
Developer APIs to let your AI/LLM understand videos and audio.
San Francisco, CA, US / Remote (US)
Internship
US citizen/visa only
About Cloudglue
About the role

About Cloudglue

Cloudglue is building foundational infrastructure that enables AI to understand videos for the first time. Our APIs enable developers to add video search, multi-video chat, and get structured data extraction from video content - reliably and at scale - in just a few lines of code.

We work at the bleeding edge of multimodal AI: computer vision, audio processing, video understanding, agentic retrieval, and AI-first UI/UX to enable video AI capabilities that don’t exist today.

We are building towards a future where AI agents can see, hear and understand video as natively as text, and reimagining how the world learns from and interacts with video content.

Team

Our team has shipped AI/ML powered products at Snapchat and Amazon that served billions of users, deployed optimized large-scale ML systems that cut millions in costs, and published frontier research at venues like ICCV, NeurIPS, CVPR, AWS re:Invent and DEF CON. At Cloudglue, you’ll join a small, fast-moving team where every engineer makes a direct impact on the product and company trajectory.

Velocity

In less than 1 year at Cloudglue, our nimble team of 3 has:

  • Published 5+ frontier papers in video AI at top-tier conferences
  • Outperformed Gemini on cost, fidelity, speed, and features, backed by our research
  • Built and productionized Cloudglue APIs (in use by companies today)
  • Signed our first paying customers, have hundreds of developers onboarded, and growing

Your Role

We’re looking for a driven, deeply curious student to join us as a Full-Stack AI Engineer Intern. This isn’t just a coding role, you will:

  • Work directly with our CTO (ex-Snap, ex-Amazon, Carnegie Mellon alum) on projects that push the boundaries of multimodal AI.
  • Ship features end-to-end across our stack (React/TypeScript frontend, Node/Python backend).
  • Integrate frontier video/audio AI models into production APIs.
  • Propose new features.
  • Collaborate directly with founders, customers, and researchers to drive real-world impact.

If you want a startup experience where you can wear multiple hats, have a voice, and make visible contributions at the bleeding edge of video AI, Cloudglue is here to provide that experience.

Responsibilities

  • Full-Stack Development: Build and ship features across frontend (React/TypeScript) and backend (Node, Python).
  • AI Integration: Deploy and optimize cutting-edge multimodal AI models for video/audio understanding.
  • Tool & UI Design: Create intuitive developer tools and UIs that bring video/audio insights to life.
  • Collaboration & Ownership: Contribute ideas, own projects, and work closely with founders in a fast-paced startup environment.

What We’re Looking For

Required Skills

  • Strong CS fundamentals (algorithms, data structures).
  • Database proficiency (SQL, query optimization).
  • Excellent communication and collaborative mindset.

Nice to Haves

  • Full-stack web experience (TypeScript/React, Next.js, Supabase, Vercel).
  • Python backend + familiarity with AI orchestration frameworks (LangGraph, LangChain, Temporal, etc.).
  • Experience with vector databases (Pinecone, Weaviate, Milvus, pgvector).
  • UI/UX instincts for building developer-facing tools.
  • Cloud deployment knowledge (AWS/GCP, Docker/Kubernetes).

Other jobs at Cloudglue

internSan Francisco, CA, US / Remote (US)Full stack$6.5K - $8.5K / monthlyAny

Hundreds of YC startups are hiring on Work at a Startup.

Sign up to see more ›