Sphax 18 minutes ago

That is some insane value. I've been using GLM Coding Plan Max with GLM 5.1 for a while and i've tested DeepSeek V4 Pro maybe for 3 weeks now and I found it to be better than GLM 5.1 for complex coding tasks. I've used 65m tokens and with that price it cost me $1.5, that's really cheap.

belinder 9 minutes ago

Anyone using deepseek through a gateway (not sure if right term) so there's no data retention? At work we're going through a few hundred million tokens a day in our app (using anthropic models), and we're looking for something significantly cheaper

  • bel8 4 minutes ago

    opencode allegedly has contractual no-data-retention policies with their providers.

    I recall reading about that in an issue or in their Discord server.

    But I would contact them formally to verify that.

cold_harbor 5 minutes ago

their MLA architecture cuts KV cache by ~5-13x vs standard attention. that's why inference is actually cheaper to run, not just a price war to gain market share.

bel8 14 minutes ago

Great! I have been using DeepSeek 4 Flash high for everything lately.

First accessible model with useable 1 million context window for me.

Havoc 30 minutes ago

Neat. I like DS for secondary checks on code. Sometimes spots things other models don't

kingjimmy 12 minutes ago

is this the Huawei chip difference?