Skip to content

Fireworks

Overview

Fireworks is an inference provider specializing in fast, cost-efficient serving of open-source models. Floopy proxies requests to Fireworks’ OpenAI-compatible API.

Supported Models

ModelContext WindowNotes
accounts/fireworks/models/llama-v3p3-70b-instruct128KLlama 3.3 70B
accounts/fireworks/models/llama4-maverick-instruct-basic128KLlama 4 Maverick
accounts/fireworks/models/qwen3-235b-a22b128KQwen 3 235B MoE
accounts/fireworks/models/deepseek-v3128KDeepSeek V3

Setup

  1. Go to Settings > Providers in the dashboard.
  2. Click Add provider and select Fireworks.
  3. Paste your Fireworks API key and click Save.

Usage

import OpenAI from "openai";
const client = new OpenAI({
baseURL: "https://api.floopy.ai/v1",
apiKey: process.env.FLOOPY_API_KEY,
});
const response = await client.chat.completions.create({
model: "accounts/fireworks/models/llama-v3p3-70b-instruct",
messages: [{ role: "user", content: "Explain quantum computing." }],
});

Provider-Specific Features

  • Fast inference — Fireworks specializes in optimized model serving with low latency.
  • Model naming — Models use the accounts/fireworks/models/model-name format.
  • Competitive pricing — Often lower per-token costs compared to other inference providers.