RE: Dungeon Cities : DHF & HDF

You are viewing a single comment's thread from:

RE: Dungeon Cities : DHF & HDF

themarkymark (81)in #game • 2 months ago

I use MiniMax locally, their cloud sucks. They charge 2x for Highspeed, but it doesn't even hit the speeds advertised for their standard. I am using their cloud right now until the m2.7 weights drop and it is very disappointing. 34t/sec for standard and 44t/sec for high speed, yet it should be 50/100.

2 months ago in #game by themarkymark (81)

$0.00

14 votes

Sort:

Trending

[-]

urun (68) 2 months ago

interesting. you use it over minimax io or alibaba? They changed the 500 thing too ( it was 3 weeks ago a different usage maximum ( more, but cant remember details again).

I really wonder why you token speed is low, i never experienced it lmao.

$0.00

[-]

urun (68) 2 months ago

I like to run local qwen quant versions for different tasks. But the Big ones ofc.

$0.00

[-]

themarkymark (81) 2 months ago

Qwen 3.5 27B is probably the best choice for local, but it will be slow but it's a solid model with much lower memory demands than most.

$0.00

14 votes

[-]

urun (68) 2 months ago

and 1M context window

$0.00

[-]

themarkymark (81) 2 months ago

27B is only 262K context window.

$0.00

14 votes

[-]

urun (68) 2 months ago

80b has 1M :)

$0.00

[-]

themarkymark (81) 2 months ago

Are you talking about Qwen3-Coder-Next 80B? It's only 262K too, but either way these start to get context rot after 100K.

$0.00

14 votes

[-]

urun (68) 2 months ago

btw since you are also deeper into it, i experminted with memory files ( like claude) a bit. Any experience with it? So far with stuff like opencode + plugnins like rp1 or others it works well to be always up to date.

$0.00

[-]

themarkymark (81) 2 months ago

Check out mem0

$0.00

14 votes

[-]

themarkymark (81) 2 months ago

I use it via minimax direct. I signed up when I heard they are open sourcing m2.7 weights, something that was looking not likely.

The token speed is low due to demand. They promise 100+ tokens/sec with high speed, but it is barely faster than standard. You do get a lot of usage though compared to others and it's a good model. I'm waiting for m2.7 weights to drop so I can run it on my RTX 6000 Pros. Should be any day now.

$0.00

14 votes