AI Assistant

BlockVault ships with a built-in AI assistant. Ask it to check a balance, prepare a transfer, explain a transaction, search the web or remember something for next time — all from a chat window inside the wallet.

What makes it different from other AI chats: it can actually do things in your wallet, with your permission, and by default it runs entirely on your phone.

Three ways to run it

You pick how the assistant runs. You can switch any time, and you can mix modes for different conversations.

1. On your phone (default)

A Gemma 4 model runs locally on your device. No data leaves your phone, no API key, no subscription, no network needed.

Choose between two model sizes, depending on your phone's RAM:
- Gemma 4 E2B — about 2.58 GB, lighter and faster.
- Gemma 4 E4B — about 3.65 GB, smarter answers, needs more memory.
Powered by Google's LiteRT-LM runtime on both Android and iOS.
32k token context window — long enough to handle real conversations and tool results.
Uses GPU when available, falls back to CPU automatically.
Models are downloaded once from Hugging Face and stored encrypted on device.

2. Your own computer (LM Studio)

If you have a desktop or laptop with a decent GPU, you can run a bigger model on it and point BlockVault to it over your local network. The assistant in your phone becomes a chat client for the model running at home.

Works over Wi-Fi, no cloud account needed.
Great for larger Gemma 4 variants (12B, 4B) or any other LM Studio model.
See the LM Studio guide for setup.

3. Remote GPU (Delegate Inference)

For the most demanding tasks you can rent compute from a remote GPU and pay automatically with crypto from your wallet using the x402 payment standard. You set a price cap, the wallet handles the rest.

Pay per request, no subscription.
Good for one-off heavy queries where on-device speed isn't enough.

What the assistant can do

The assistant uses skills — small, well-defined actions like "get my balance" or "prepare a transfer". When you ask something, the model decides which skill (if any) to use, runs it, and answers you in plain language.

Built-in skills include:

Check balances across all your chains.
Prepare transfers — the assistant fills the form; you sign.
Show transaction history.
Search the web for crypto news, prices and project info.
Remember things — preferences, contacts, notes — stored locally only.
Read RWA listings from the BlockVault catalog.
Pay for tools with x402 micro-payments when an API needs it.

You can also add or edit your own skills with the in-app Skill Editor, which is helpful for power users.

What it never does

It never signs a transaction by itself. Every transfer, swap or contract call requires your explicit approval in the wallet UI.
It never sees your recovery phrase or password.
On-device mode never sends your messages anywhere. When you use LM Studio or Delegate mode, the wallet tells you clearly before any data leaves your phone.

Memory and privacy

The assistant has its own memory — facts, contacts and preferences you teach it. That memory is stored encrypted on your device and used only to make the assistant more helpful. You can review and delete entries any time.

Picking the right mode

Use case	Best mode
Daily wallet questions, on the go	On-device
Long conversations, complex analysis	LM Studio
Occasional heavy tasks	Delegate (pay-per-use)
Travel, offline, or sensitive privacy	On-device

AI Assistant ​

Three ways to run it ​

1. On your phone (default) ​

2. Your own computer (LM Studio) ​

3. Remote GPU (Delegate Inference) ​

What the assistant can do ​

What it never does ​

Memory and privacy ​

Picking the right mode ​