Qwen3.6 | byteiota

Tag: Qwen3.6

Bonsai 27B — a 27-billion parameter AI model running on iPhone, neural network visualization

News

Bonsai 27B Runs on iPhone: The On-Device AI Tradeoffs

PrismML shrinks a 27B model to 3.9GB for iPhone. Benchmarks look solid, but developer reports ...

7 days ago

Speed performance chart showing Qwen3.6 27B token generation improvement with MTP enabled in llama.cpp

Industry Analysis

Qwen3.6 MTP in llama.cpp: 27B Model Now 1.7x Faster

llama.cpp MTP support turns Qwen3.6 27B into a 65 t/s machine on RTX 3090 — ...

June 30, 2026