Ie á”
(ÆÆ ÌÆÆ
ÆÆ ÆÓ
jÆÆÆÞ ÆÆ ÆÆÆä1ÆÆè ÆÆ ;‹
’„‹ ÆÆ=ÆÆ Æ BÆÆÆÆÆÆ ÆW ÆÆÆÆÆì»Í~
îÍ ?RÆAÆÆÆÆM ÍÆ ÆÆ ‹Ò ÆÆ ÆÆ ÆÆÆ‰ ñ ¯ç–
«¤’ Æ å„ «ÆÆÆ Æ ÆÆÆÆÆÆÆÆÆ ÆÖ ÆÆÆ®õwÆ ÆP í–¯ª
íÉ% + ¼º ÛÞÆÆ Y üÆÆÆå‚ Æ ÆÆÆÆ ;Ø ú ïº:
oÑQÆÆÆÆÆØÉÝÅBÆÆÆÆÇ-BEE-ÆÆÆÆÆÆÆÑ xŸÁÆÆz®ç†
¾Ÿ}sÆÆÞâÆÆÆÆÆÆÆÊÆ-AGENT-7Ëà ’ÆðF
˜ ÇÆÆÆÆÆØZ ¾ÆÑÆQ˧¼÷
ÆÆÆÆ ôXÆÆã“sÇÆÆÑ
ÆÆÆ #ÆÄiSœÆÆ ÆÆÆ
ÆÆ ÆÆTòHÆÆÆÆÆÆÆÆXiÆÆÆ ÆÆ
ÆÆ ÆÆÍ l=¡5ÇÇS9¾yï× —Æ FÆ
ÆÆ ÆÆ ¨ÆÆÆÆÆÆÆÆÆÆÆÆ} ÆÆ ÆÆ
Æ4 3Ʋ »ÇÇoàÇSL…¹ ÇÇ ÆÆ Ʀ
ÆÆ GÆÆÆÆÆÆÆÆÆÆÒ7 ËÆ
Ư ÆÆæëÚšYJQÄÇ Æ
1Æ çØÆÆÆÆÆÆÆÇ Æ6
ÆÆ §ËÆÆÆé Ƴ
–ÆÆ Æ
bee
a minimal coding agent harness
"I'm not a bot. I'm a bee. There's a difference."
Install
curl -fsSL https://raw.githubusercontent.com/elhenro/bee/main/install.sh | sh
or: go install github.com/elhenro/bee/cmd/bee@latest
then run bee, type /model, choose oMLX, Ollama, OpenRouter, etc. and pick a model. Local or hosted, your choice.
What is it?
Local LLMs
bee is built to work well with Ollama and oMLX so you can run locally on your own hardware and keep full control. No API keys, no rate limits, no data leaving your machine.
On macOS, oMLX works best. Native Apple Silicon acceleration with prompt caching keeps things fast and memory-efficient.
I run Huihui-Qwen3.6-35B-A3B-Claude-4.7-Opus-abliterated-mlx-8bit (34.32 GB) very reliably on a MacBook M3 Max 64 GB, with Qwen3-VL-4B-Instruct-MLX-4bit (2.90 GB) for vision support. bee handles vision automatically for models that only do text.
Settings for the Huihui-Qwen model: temperature 0.7, top-p 0.85, top-k 20, KV-cache quantization 8-bit.
Other models that work well: