About

We started Floopy because our own AI bill scared us.

Floopy is the infrastructure layer between your agent and the providers — so the cheapest model that can do the job, does it. Every time. Without sacrificing what your users feel.

We were running AI features in production and the bill went from "line item" to "largest line item" in a few months. We tried what everyone tries: hand-written routing rules, token limits, aggressive caching, prompt compression. It worked, a little. It also broke constantly — the moment a new model shipped, our rules were stale. The moment a prompt changed, our caches were poisoned. The moment we relaxed anything, quality wobbled.

The pattern we wanted didn't exist as a product: route every call by a learned policy grounded in real user feedback, with canary-safe promotion and fast rollback. So we built it. Floopy is now an OpenAI-compatible AI gateway teams use to move traffic between models without moving their quality bar.

We believe three things, strongly: optimization belongs in the request path, not in a dashboard. Every routing decision should be explainable to a non-ML engineer on call at 3am. And teams should only pay for infrastructure that has measurably earned the money.

Design principles

Four rules we don't break.

These are the constraints we hold the product to. When something else has to give, these don't.

01 / Quality first

Never trade quality for cost without your say.

Every candidate route ships behind a canary and an eval bar. Regressions roll back in seconds, not sprints.

quality = 0.95 → hard floor

02 / Zero lock-in

One flag turns us off.

Floopy is a drop-in base URL for the OpenAI SDK you already ship. Ejecting is a one-line config change — point the baseURL back at OpenAI — not a migration project.

baseURL: "https://api.openai.com/v1" → passthrough

03 / Show your work

Every decision is explainable.

Every routed call has a reason string. Every promotion has a diff. Every rollback has a trace.

span.floopy_reason = "haiku::cached"

04 / Earn the seat

If it didn't save, don't claim it.

Measured baseline vs. post-routing cost is always visible in your dashboard. If we can't move your bill, you'll see it — and so will we.

dashboard.savings = baseline − routed

Team

Small by design.

Eleven people. Half infra, half ML, all shipping. We hire slowly and rarely — if you're reading this and it sounds like home, write us anyway.

Sana Nakamura

CEO & Co-founder

Previously infra lead at Parallax. Built the original routing engine over one very long weekend.

Dmitri Olszewski

CTO & Co-founder

Ex-Anthropic applied research. Spends most days in the bandit convergence code.

Kofi Asante

Head of Eng

Previously staff at Stripe. Owns the SDK surface and refuses to break it.

Lia Ramírez

Head of ML

Former Vertex research. Wrote the paper the eval harness is based on.

Theo Okafor

Infra

On call when the graphs are green; usually also on call when they're red.

Priya Venkatesan

Security

Drove SOC 2 and ISO 27001. Previously at Cloudflare.

Amelia Jin

Design

Runs the design system & docs. If something looks off, blame her but gently.

Rafe Morozov

GTM

Closed deal #1. Still closes most of them.

Milestones

A short history.

Two years in; roughly a billion calls routed in the last ninety days.

Oct 2024

Internal prototype

Routing rules move from a cron job to an actual control plane. We stop breaking our own product.

Mar 2025

OpenAI-compatible endpoint

We ship the first public api.floopy.ai/v1. One base-URL swap and routing is live — no SDK to adopt, no prompts to change.

Jul 2025

Seed round · $8.4M

Led by Nimbus Capital, with angels from Anthropic, Anyscale, and Cloudflare.

Jan 2026

v2.0 routing engine

Contextual bandits ship. Median convergence drops from 7 days to 48 hours.

Today

≈ 900 teams · ~1B calls/mo

Still small. Still hiring selectively. Still shipping every Tuesday.

Backers

Patient capital, picked carefully.

NimbusLead · seed

AnvilParticipating

QuorumParticipating

Index/XAngels

CalderaAngels

Independents31 operators

Want to work with us?

We're a small team building carefully. Send a note — the best conversations don't start from a listing.

Say hi See benchmarks