Simple, transparent, and affordable legal AI.

The Isaacus API has no upfront costs and no hidden fees. You pay only for what you use.

Pay as you go

API / Cloud

Self-serve, per-token pricing. You pay only for what you use.

No upfront costs and no hidden fees
$100 in free credits for new users
Per-token pricing — all amounts in USD

Get building See costs

Advanced

Enterprise

Private deployments, volume discounts, or an alternative pricing model.

Private air-gapped deployments
Finetuning on your own data
Azure on request

Per-token prices

Usage of our models is charged based on the number of tokens inputted into them. All amounts are in USD.

Model	Price
kanon-2-embedder	$0.35 / 1M tokens $0.00000035 / token
kanon-2-reranker	$0.35 / 1M tokens $0.00000035 / token
kanon-universal-classifier	$1.00 / 1M tokens $0.000001 / token
kanon-answer-extractor	$1.50 / 1M tokens $0.0000015 / token
kanon-2-enricher	$3.50 / 1M tokens $0.0000035 / token

These prices apply only to the cloud-hosted Isaacus API. The pricing for our air-gapped Amazon SageMaker models is publicly available on AWS Marketplace.

What each model does

Embedding

kanon-2-embedder

The most accurate legal embedding model on the Massive Legal Embedding Benchmark (MLEB).
Reranking

kanon-2-reranker

The most accurate legal reranker on Legal RAG Bench.
Universal classification

kanon-universal-classifier

Our most powerful universal classification model.
Extractive question answering

kanon-answer-extractor

Our base answer extractor, designed to balance precision with throughput.
Enrichment

kanon-2-enricher

The first enrichment and hierarchical graphitization model.
Open source

semchunk

Our semantic chunking algorithm is free and open-source. View on GitHub.

The fine print

Credits, and how we calculate the number of tokens you're charged for. Expand only what you need.

Free credits

Taxes and price changes

How charging works

Boilerplate tokens

The first difference between the number of tokens inputted into an API endpoint and the number of tokens inputted into a model is that boilerplate tokens can be added to inputs after they are received by the API endpoint.

Boilerplate tokens are typically, but not always, used to structure inputs into whatever format that the model expects. The table below shows the number of boilerplate tokens that are added to inputs for each of our models.

Model	Boilerplate tokens	Description
kanon-2-enricher	2	Inputs are formatted as `<\|startoftext\|>{text}<\|endoftext\|>`.
kanon-2-embedder	2 – 13	Queries use a retrieval query wrapper (13 tokens), documents a retrieval passage wrapper (12 tokens), and all other texts `<\|startoftext\|>{text}<\|endoftext\|>` (2 tokens).
kanon-2-reranker	3	Queries are formatted alongside texts as query + text pairs.
kanon-answer-extractor	3	Queries are formatted alongside texts as query + text pairs.
kanon-universal-classifier	3	Statements are formatted alongside texts as text + statement pairs.

Chunking

Isaacus Query Language (IQL)

Approximating costs

The following Python function can be used to approximate the number of tokens that will be inputted into a model typically within a margin of a couple dozen tokens though absolutely no warranties or guarantees are made as to its reliability.