Agent token use optimiser

Use
less
tokens.

Tokenless is a six-phase pipeline that gives AI agents 60–92% token savings on a wide range of agentic tasks — without losing the information that matters. Free and open source.

Get Started View on GitHub

Live savings estimate Running

Loaded

Avoided

tokens avoided

avoidance rate

SCAN

LOCATE

ZOOM

EXTRACT

ACT

DISCARD

How it works

Six phases.
One goal.

Every call runs the same deterministic pipeline. Token budgets are enforced at each phase — no overruns, no surprises.

Scan

5% of budget

Detect content type. Build a ranked region index without loading the full document.

Locate

3% of budget

Rank regions by relevance to your query. Deduplicate overlapping sections.

Zoom

35% of budget

Extract windowed content from top-ranked regions only. The rest is never read.

Extract

12% of budget

Parse fields, bullets, and tables from zoomed content into structured output.

Act

45% of budget

Your agent takes action using the compact result — not the raw document.

Discard

Raw content is collapsed from session memory. Nothing lingers in context.

Use Cases

Built for every
content type.

Tokenless auto-detects the format and routes it to the right processor. No configuration needed.

Coding

90%avoidance

Codebase Navigation

Read codebases at 70–90% savings. Index symbols, search by query, expand only the functions you actually need.

Log Analysis

88%avoidance

Log File Triage

Surface critical errors from 10,000-line logs. Groups by template, scores by severity — only read what matters.

API Navigation

86%avoidance

OpenAPI Specs

Find the right endpoint without loading the full spec. Parses OpenAPI 3.x and Swagger 2.x, scores by method and path relevance.

Web Scraping

87%avoidance

HTML Pages

Strip nav, footer, and scripts before processing. Zoom to headings and content sections. Extract 2–3 relevant paragraphs.

Agentic Systems

86%avoidance

Tool Deduplication

Collapse repeated tool call results into [dup #N] markers. 85% similarity threshold eliminates redundant outputs in long agent runs.

Chat Compression

80%avoidance

History Compression

Keep the last N turns verbatim. Older turns are summarised as key=value fact bullets — full context at a fraction of the cost.

Data Analysis

60%avoidance

CSV Data

Auto-detect headers and delimiters. Find matching row clusters, skip irrelevant columns. Column stats without full table reads.

Email Processing

60%avoidance

Email Threads

Extract senders, dates, and key facts from RFC 2822 messages without loading full bodies. Two-region model: headers + body paragraphs.

Installation

Up in sixty
seconds.

                Terminal
                
# Install the package
npm install tokenless

# (Optional) Set up Claude Code integration
tokenless init --global --hooks

                Usage
                
            

import { pipeline } from 'tokenless'

const result = await pipeline(content, query, {
  tokenBudget: 4000,
  topN: 3,
})

// result.economy.avoidanceRate  → 0.88
// result.bullets                → key facts
// result.fields                 → structured data
// result.economy.tokensAvoided  → e.g. 9,429
            

Uselesstokens.

Six phases.One goal.

Built for everycontent type.

Real numbers.Real documents.

Up in sixtyseconds.

Use
less
tokens.

Six phases.
One goal.

Built for every
content type.

Real numbers.
Real documents.

Up in sixty
seconds.