PRIVATE · ON-DEVICE · ENCRYPTED

Your private AI.
Right on your phone.

Scan documents, ask questions, get answers — all powered by AI that runs entirely on your device. No cloud. No accounts. Works offline.

Free to start · Pro from $9.99/mo · Founding pricing for waitlist members

See it in action

Your documents.
Your device. Only.

Process sensitive documents, get sourced answers, and extract structured data — entirely on-device. Nothing ever touches a server.

Sourced answers with page-level attribution
Analyze contracts, extract clauses, summarize reports
Snap a receipt or document — instant AI processing
9:41
UnQuestUnQuest

Summarize the key clauses in my NDA

Based on your NDA document, here are the key clauses:

Confidentiality Period: 3 years from disclosure date

Non-Compete: 12-month restriction within 50mi radius

IP Assignment: All work product belongs to Company

Source: NDA_Agreement.pdf — Page 2-4
Processed entirely on device
Ask about your documents...

Privacy First

Cloud AI sees everything.
We see nothing.

Your data stored on their servers

Your data stays on your device. Always.

API calls for every query

Zero network calls. Verifiably.

Prompts logged and analyzed

Zero telemetry on your content

Data used to train their models

Your data trains nothing but your own knowledge graph

Trust their privacy policy

Trust the architecture: on-device, encrypted, auditable

How it works

Three steps. Zero cloud.

01

Download once. Runs offline.

Install UnQuest and a compact 2.8GB AI model. After that, zero internet needed. No accounts, no API keys, no cloud — it all runs on your phone.

02

Feed it your world

Drop PDFs, scan contracts with your camera, import notes. UnQuest chunks, embeds, and indexes everything locally into a private knowledge graph.

03

Put it to work

Analyze an NDA. Categorize a month of receipts. Track trends across lab reports. Sourced answers with page attribution — all in airplane mode.

Join the waitlist

Founding member pricing for waitlist · No credit card required

Capabilities

Everything runs on your device

No API keys. No accounts. No data leaving your phone. Intelligence that processes, understands, and reasons — right where your data lives.

Document Intelligence

Analyze a 30-page contract in under a minute. Extract clauses, summarize terms, flag risks — entirely on your phone, even in airplane mode.

Knowledge That Compounds

Every document you process makes UnQuest smarter. A persistent knowledge base and cross-conversation memory connect insights across all your files.

Camera to Knowledge

Photograph a receipt, scan a contract, snap a lab report. Instant OCR, indexing, and AI analysis — one tap from paper to searchable knowledge.

Sourced Answers

Ask anything about your documents. Get answers with exact page and paragraph attribution — so you can verify every claim, not just trust it.

On-Device Voice

Speak to your knowledge base. On-device speech-to-text (Parakeet) and natural text-to-speech — your voice is never streamed to a server.

Local Image Generation (soon)

On-device image generation with the Bonsai MLX model is in active development. When it lands, prompts and pixels stay on your phone — no cloud render.

Siri & Shortcuts

Ask UnQuest straight from Siri, and automate document workflows with the Shortcuts app via native App Intents.

Domain Workflows

Purpose-built intelligence for professionals: NDA clause extraction, expense categorization, medical trend tracking. Real work, not generic chat.

Zero-Knowledge Architecture

Zero network calls for inference, zero telemetry, zero third-party SDKs, AES-256 encryption at rest. Keys in the Secure Enclave. A guarantee, not just a policy.

Models

Run the best open models — locally.

Download Qwen, Gemma, Llama, DeepSeek, Phi and more, optimized for Apple Silicon. Complex work routes to your downloaded model; quick tasks go to Apple Intelligence — automatically.

Qwen 3.5

1.2B–9B · vision · default

Gemma 3 / 3n

1B–4B · multimodal

Llama 3.2

1B–3B · 128K context

DeepSeek R1

1.5B · reasoning

SmolLM3

3B · dual reasoning

Phi-4 Mini

3.8B · math & logic

IBM Granite

2B · tool-use

LiquidAI LFM2.5

1.2B · fastest

Bonsai (MLX)

image gen · soon

Apple Intelligence

iOS 26+ · no download

Qwen 3.5 4B

Default

State-of-the-art reasoning in 2.8GB. Handles contract analysis, document Q&A, and structured extraction at 15-25 tokens/sec.

Parameters

4B

Quantization

Q4_K_M

Size

2.8 GB

Context

262K native

Apple Intelligence

iOS 26+

Apple's built-in foundation model. No download, instant response. Handles classification, simple chat, and triage — freeing the big model for heavy lifting.

Download

None

Tasks

Chat, classify

Speed

Instant

Requires

iOS 26+

Smart Inference Routing

Simple chat & classification

Apple Intelligence

RAG, analysis & workflows

Your downloaded model

Document processing & OCR

llama.cpp / MLX

Network calls0
Data uploaded0 bytes
Third-party SDKs0
EncryptionAES-256-GCM
Inference100% on-device
Parameters4 Billion
Works offlineAlways
Cloud dependencyNone
Network calls0
Data uploaded0 bytes
Third-party SDKs0
EncryptionAES-256-GCM
Inference100% on-device
Parameters4 Billion
Works offlineAlways
Cloud dependencyNone

How we compare

Privacy and capability.
Not either/or.

Cloud AI has the features but leaks your data. Other local AI apps protect your data but only offer chat. UnQuest gives you both.

CapabilityUnQuestCloud AIOther local apps
Runs on your device
Zero network calls
Document processing (PDF, OCR)
Knowledge graph across documents
Sourced answers with page attribution
Domain workflows (legal, financial, medical)
AES-256 encryption at rest
Camera scan → instant analysis
Works in airplane mode
On-device voice (speech in & out)
Free, fully on-device tier

Cloud AI = ChatGPT, Claude, Gemini · Other local apps = PocketPal, Enclave, Private LLM, Locally AI

Compatibility

Built for Apple Silicon

Native Swift. Metal GPU acceleration. Purpose-built for on-device inference — not a web wrapper, not a port.

iOS

17.0+

RAM

8 GB minimum

Storage

~3 GB for model

Chip

A17 Pro / A18+

DeviceChipRAMStatus
iPhone 17 Pro / Pro MaxA19 Pro12 GB
Best experience
iPhone 17 / AirA198 GB
Full support
iPhone 16 Pro / Pro MaxA18 Pro8 GB
Recommended
iPhone 16 / Plus / 16eA188 GB
Full support
iPhone 15 Pro / Pro MaxA17 Pro8 GB
Full support
iPhone 15 / Plus & earlierA16 / older6 GB
Insufficient RAM

FAQ

Questions answered

Completely. Once you download the 2.8GB model, every computation runs on your device's GPU. Analyze contracts, process receipts, query your knowledge base — all in airplane mode. Zero network calls, zero telemetry, zero third-party SDKs. Verifiably private.

ChatGPT and Claude send your data to remote servers for processing. UnQuest runs entirely on your device — your prompts, documents, and responses never leave your phone. Beyond privacy, UnQuest also processes your documents, builds a knowledge graph, and gives sourced answers with page attribution. The trade-off: the on-device model is smaller (4B vs 100B+), so it excels at focused tasks like document Q&A, extraction, and analysis.

Most on-device AI apps are chat interfaces that run a model locally. UnQuest goes further: it ingests and indexes your documents, builds a persistent knowledge graph that connects insights across files, and runs purpose-built workflows for legal, financial, and medical use cases — all with the same on-device privacy guarantee.

Drop in a PDF or scan with your camera. UnQuest extracts text (OCR for scanned docs), chunks it intelligently, generates embeddings, and indexes everything into a private, searchable knowledge graph — on-device. Then ask questions and get sourced answers with exact page attribution. Run workflows: extract NDA clauses, categorize receipt line items, track trends across lab reports.

All stored data — documents, conversations, embeddings, knowledge graph — is encrypted with AES-256-GCM. The encryption key is stored in the Secure Enclave via Apple's Keychain. Even if someone extracts your phone's storage, the data is unreadable without biometric authentication.

Any iPhone with 8GB RAM and Apple Silicon — that's iPhone 15 Pro, all iPhone 16 models, and the entire iPhone 17 lineup. Standard iPhone 15 and older models don't have enough memory to run a 4B parameter model. iPad and Mac support is planned.

15-25 tokens per second on A18 Pro with Metal GPU acceleration. Responses start streaming immediately — you don't wait for the full answer. Apple Intelligence handles quick tasks (classification, triage) near-instantly, freeing the heavy model for deep analysis.

Voice is fully on-device today: the Speech tab does speech-to-text (Parakeet, with Apple Speech as a fallback) and natural text-to-speech, so you can talk to your knowledge base hands-free. On-device image generation (Bonsai MLX) is in active development and ships in a later update. Either way, nothing leaves your phone.

Yes. UnQuest ships native App Intents, so you can ask it from Siri and build automations in the Shortcuts app — all running against on-device models.

There's a free tier with on-device chat, voice, and a few document imports. Pro is $9.99/month or $79.99/year and unlocks unlimited documents, RAG with citations, the full model catalog, and image generation. Professional workflow packs (Legal, Financial, Medical) are one-time purchases — $9.99 each, or $19.99 for all three.

We will never route your content through external servers. That's an architectural guarantee, not just a policy. Future plans include encrypted iCloud sync (using your own private CloudKit database) for multi-device access, but inference and document processing will always remain on-device.

Pricing

Start free. Go Pro when you're ready.

A simple Pro subscription unlocks the full workspace. Workflow packs are one-time add-ons. No cloud costs — because there's no cloud.

Free during beta · Pricing takes effect at launch · Waitlist members get founding pricing

Free

$0

Get started with on-device AI chat, voice, and basic document processing.

  • Unlimited AI chat (on-device)
  • On-device voice mode (speech-to-text + TTS)
  • 3 document imports
  • Basic summarization
  • 10 saved conversations
  • Works 100% offline
Join waitlist
Popular

Pro

$9.99/month

or $79.99/year — save 33%

The full private AI workspace. Cancel anytime — your data always stays on-device.

  • Everything in Free
  • Unlimited documents
  • Ask questions about your docs (RAG + citations)
  • Knowledge base + semantic search across files
  • Cross-conversation memory
  • Full on-device model catalog
  • Siri & Shortcuts
  • AES-256 encryption at rest
Lock in founding pricing

Workflow Packs

$9.99each · $19.99 for all three

Purpose-built intelligence for professionals. One-time purchase, add the packs you need.

  • Legal: NDA analysis, clause extraction
  • Financial: receipts, expense reports, tax docs
  • Medical: lab reports, trend tracking
  • Structured data extraction
  • Domain-specific prompts
Join waitlist

Your AI should be
yours alone

Private on-device intelligence is coming soon. Join the waitlist for early access and launch pricing.

Free to start · Pro from $9.99/mo · Waitlist members get founding pricing

No spam. We'll email you once when it's ready.