LLM Usage notes


   ┌──────────────────────────────────────────────────────────────────────────┐
   │      ▓▓▓▒▒▒ PROVENANCE NODE 001-1 // ZONE: HMN_REMIXABLE ▒▒▒▓▓▓          │
   ├──────────────────────────────────────────────────────────────────────────┤
   │ SRC_AUTH : HMN                    │ SIG_STAT : UNSIGNED                  │
   │ AUTH_ID  : NICK PORCINO           │ TYPE     : SURVEY                    │
   │ DATE     : 20260619               │ TYPE     : SURVEY                    │
   ├──────────────────────────────────────────────────────────────────────────┤
   │ [ RIGHTS & PERMISSIONS MATRIX ]                                          │
   │ INDEX_ALLOW : YES                 │ CORP_TRAIN: NO                       │
   │ DERIV_ALLOW : YES                 │ GOV_SPDX  : ResponsibleSrc           │
   └──────────────────────────────────────────────────────────────────────────┘

Introduction

The following is a brief accounting of models found to be useful, and some characterizing notes based on experience using those models. Scripts to download and test most of these models may be found here:

https://codeberg.org/meshula/LabLlama/src/branch/dev/agentic/scripts

Not all models available via those scripts are useful enough to warrant further notes, and won’t be found below. If sufficient interest in this list develops, future work will include proper benchmarking via a system such as terminal bench, or the intelligence per watt benchmark listed in the notes.

QWEN

Qwen3.6 27B NVFP4

TAGVALUE
MODELQwen3.6 27B NVFP4
DATE20260618
PROCDGX SPARK 128GB
SRVUNKNOWN
PROMPT/GENN/A
THINKINGEXCELLENT
COHERENCEEXCELLENT
PLANNINGEXCELLENT
CODINGUNTESTED
SYCOPHANCYNOT OBNOXIOUS

Qwen3.6-27B-AEON-Ultimate-Uncensored-BF16-mlx-fp16

TAGVALUE
MODELQwen3.6-27B-AEON-Ultimate-Uncensored-BF16-mlx-fp16
DATE20260610
PROCM5 MAX 128GB
SRVOMLX
PROMPT/GENN/10
THINKINGEXCELLENT
COHERENCEEXCELLENT
PLANNINGEXCELLENT
CODINGGOOD, SOME HALLUCINATION
SYCOPHANCYVERY LOW

Qwen3-Coder-Next-Q6_K

TAGVALUE
MODELQwen3-Coder-Next-Q6_K
DATE20260524
PROCM3 MAX 128GB
SRVLLAMA-SERVE
PROMPT/GEN30/10
THINKINGN/A
COHERENCEGOOD
PLANNINGGOOD
CODINGGOOD, HALLUCINATION
SYCOPHANCYLOW

Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-GGUF

TAGVALUE
MODELQwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-GGUF
DATE20260524
PROCM3 MAX 128GB
SRVLLAMA-SERVE
PROMPT/GEN30/10
THINKINGEXCELLENT
COHERENCEEXCELLENT
PLANNINGEXCELLENT
CODINGGOOD
SYCOPHANCYLOW

Gemma

unsloth/gemma-4-26B-A4B-it-qat-GGUF:UD-Q4_K_XL

TAGVALUE
MODELunsloth/gemma-4-26B-A4B-it-qat-GGUF:UD-Q4_K_XL
DATE20260618
PROCM1 ULTRA 64GB
SRVLLAMA-SERVE —spec-type draft-mtp —spec-draft-n-max 2
PROMPT/GEN800/70
THINKINGEXCELLENT
COHERENCEEXCELLENT
PLANNINGUNTESTED
CODINGUNTESTED
SYCOPHANCYVERY LOW

gemma-4-26B-A4B-it-unsloth-mlx-oQ4-fp16

TAGVALUE
MODELgemma-4-26B-A4B-it-unsloth-mlx-oQ4-fp16
DATE20260610
PROCM5 MAX 128GB
SRVOMLX
PROMPT/GENN/30
THINKINGEXCELLENT
COHERENCEEXCELLENT
PLANNINGEXCELLENT
CODINGGOOD, SOME HALLUCINATION
SYCOPHANCYMODERATE

GPT-OSS

gpt-oss-120b-mxfp4

TAGVALUE
MODELgpt-oss-120b-mxfp4
DATE20260524
PROCM3 MAX 128GB
SRVLLAMA-SERVE
PROMPT/GEN30/10
THINKINGEXCELLENT
COHERENCEEXCELLENT
PLANNINGEXCELLENT
CODINGGOOD
SYCOPHANCYMODERATE

OTHER

Codestral-22B-v0.1.Q4_K_M

TAGVALUE
MODELCodestral-22B-v0.1.Q4_K_M
DATE20260524
PROCM3 MAX 128GB
SRVLLAMA-SERVE
PROMPT/GENN/30
THINKINGN/A
COHERENCEGOOD
PLANNINGMODERATE
CODINGMODERATE, HALLUCINATION
SYCOPHANCYLOW

Mixtral-8x7B-Instruct-v0.1.Q4_K_M

TAGVALUE
MODELMixtral-8x7B-Instruct-v0.1.Q4_K_M
DATE20260524
PROCM3 MAX 128GB
SRVLLAMA-SERVE
PROMPT/GENN/30
THINKINGN/A
COHERENCEPOOR
PLANNINGN/A
CODINGHIGH HALLUCINATION
SYCOPHANCYMARCH HARE

Further Reading

https://scalingintelligence.stanford.edu/pubs/ipw/