I really want to use tools like Claude Code or self written agents (LangGraph) with top notch models without relying on US clouds or buying 5-10k€ hardware.

I had a look at Scaleway, IONOS and OVHcloud, but they all only offer the same outdated medium size models like gpt-oss-120b, llama-3.3:70b, Qwen3:32b. Scaleway at least just recently added Qwen3.5:397b.

The best I could find so far seems Nebius. The problem here: they do offer MiniMax-M2.5, GLM-5, Qwen3.5 but again just on US servers. Only the older versions like MiniMax-M2.1 or GLM-4.7 are EU hosted.

Is there just no infrastructure available in the EU to host such models at scale? I really hope that is not the reason.

What do you use? Please help!

Edit: You just now found that Nebius requires Google, GitHub, or Microsoft account to sign up. Sad.

  • fediuser76@piefed.world
    link
    fedilink
    English
    arrow-up
    2
    ·
    1 day ago

    I use Mammouth.ai. It similar to Regolo as that it is a privacy friendly interface between you and a service via their API (for instance Mistral, Gemini, OpenAI, etc.). Not as good as running your own, but the next best thing IMHO. What I like about Mammouth is that you have access to many different services for a standard monthly fee.