Journal — OmniFlux AI

Journal

Engineering notes, product stories and ideas from the team.

FeaturedVision2026-04-22·8 min read

Why we bet the company on on-device AI

Cloud AI is a transitional architecture. Here's why we believe the next decade of useful AI lives in your pocket — and what it means for the products we are building.

Engineering2026-04-08

Designing Flux Engine: one runtime for every product

A peek at the architecture choices behind our shared on-device inference runtime — and the constraints that shaped them.

9 min read

Product2026-03-25

WhisperFlux preview: speech, speakers, summaries — all local

Our flagship enters internal beta. A walkthrough of the streaming pipeline and what we learned shipping ASR on real phones.

7 min read

Engineering2026-03-11

Squeezing a 7B model onto your phone: a quantization field guide

Q4_K_M, AWQ, GPTQ, SmoothQuant — what actually matters when you only have 4 GB of RAM and a 4 W power budget.

8 min read

Privacy2026-02-26

Private by architecture, not by promise

A privacy policy is a promise. An app that has no servers is a fact. Here is the difference, in code.

6 min read

Engineering2026-02-12

Speaker diarization on a phone: a deep dive

How we run end-to-end speaker diarization in real time on a 4 W power budget — without uploading any audio.

11 min read

Product2026-01-29

VisionFlux: a roadmap for local visual understanding

Our second product takes shape. What local vision-language models can do today — and where we are betting they go next.

7 min read

Engineering2026-01-22

TranslateFlux: building a private, offline universal translator

Notes on translation latency, quality, and the engineering tricks that let a small model feel competitive with a much larger cloud one.

7 min read

Design2026-01-20

NoteFlux: design philosophy for an AI notebook that respects you

NoteFlux is a notebook with a quiet local model living inside. Here is what we want it to feel like — and what we are explicitly not building.

6 min read

Engineering2026-01-18

CodeFlux: an offline pair programmer that respects your repo

A 7B-class code model running locally with project-aware retrieval. Why a smaller model with the right context beats a larger one without.

8 min read

Privacy2026-01-17

No account, by design

You will never sign up for an OmniFlux app. We argue why \"no account\" is a feature, not a missing one — and how it changes everything downstream.

5 min read

Vision2026-01-16

Building for airplane mode

A simple design rule shapes everything we ship: every feature has to work without a network. Here is what that does to a product.

5 min read