GRM-2.6-Opus

Chat with GRM-2.6-Opus on ZeroGPU

Chat with GRM-2.6-Opus in a ZeroGPU Space, optimized with text-only chat, NF4 4-bit loading, bounded context, streaming output, and thinking parsing. Model: OrionLLM/GRM-2.6-Opus