Every time you use a cloud AI service, your data leaves your device. Your queries, your context, your conversation history — all transmitted to servers you don't control. For many use cases, this is an unacceptable tradeoff.

The Problem with Cloud AI

Cloud-based AI services have fundamental privacy limitations:

Even services that claim to not store data still require transmission — and you have to trust their word.

The Browser as a Privacy Boundary

Modern browsers provide a natural security sandbox:

When AI runs in the browser, your data never leaves your device. There's no server to trust, no privacy policy to read, no data processing agreement to sign.

How ReLU.chat Achieves Zero Data Collection

ReLU.chat is architecturally incapable of collecting your data:

  1. No server-side inference: All NLP runs in the browser
  2. No telemetry: Zero analytics, tracking pixels, or phone-home calls
  3. No accounts: No login, no email required, no user profiles
  4. No storage: No server-side database of conversations
  5. Open source: The code is public — verify the claims yourself
The only network request is the initial page load. After that, everything runs locally.

The Technical Stack

Making this work required solving several engineering challenges:

Model Size

The all-MiniLM-L6-v2 sentence transformer is ~90MB in its original form. Through ONNX quantization, we reduced it to ~22MB — small enough to load on a mobile connection in a few seconds.

Inference Speed

ONNX Runtime WebAssembly provides near-native inference speed. Our benchmark: ~15ms per embedding computation on a mid-range laptop. That's fast enough for real-time conversation.

Offline Support

A service worker caches all assets after the first load. The chatbot works offline — no internet required after initial setup.

Knowledge Base

Instead of training a massive model on the entire internet, we use curated knowledge fragments. This limits scope but guarantees accuracy — a worthwhile tradeoff for domain-specific chatbots.

When Cloud AI Makes Sense

Browser-based AI isn't always the right choice:

But for focused applications — customer support, documentation, education, FAQ — browser-based AI offers a better privacy/accuracy tradeoff.

The Future

WebGPU will bring GPU-accelerated inference to browsers, enabling larger models at faster speeds. WebNN provides a native neural network API. The browser is becoming a first-class AI platform.

ReLU.chat is an early example of what's possible. As browser capabilities grow, the range of viable on-device AI applications will expand dramatically.

Try It

Experience privacy-first AI: visit ReLU.chat. Open DevTools, watch the network tab — you'll see zero outgoing requests during conversation. Your data stays yours.

The code is open source at GitHub. Verify the claims yourself.