I joined Lemmy back in 2020 and have been using it as @[email protected] until somewhere in 2023 when I switched to lemmy.world. I’m interested in systemd/Linux, FOSS, and Selfhosting.

  • 29 Posts
  • 14 Comments
Joined 3 years ago
cake
Cake day: June 10th, 2023

help-circle








  • Maybe not as good as Claude, but they are good enough, and open-source, and free. The US market is going to learn the hard way why open-source curbstomps greedy bullshit.

    Correct, the American frontier models Claude Opus, GPT 5.4, GPT 5.5, and Gemini 3 Pro still score better (while costing significantly more), but the runner ups are all Chinese models.

    Then it’ll just come from Germany or France or elsewhere. It doesn’t take millions of dollars to train a good model, despite these US companies pretending that it does.

    Well, it does. Deepseek-R1 cost $6 million and that was considered to be very cheap. Europe only really has Mistral’s models, Proton’s Lumo and several models that focus on transparency, ethically sourced training data, and supposedly better local language support (OpenEuroLLM, GPT-NL), but they’re by far not as good as other models and I don’t expect them to be for quite some time.


  • I’ve said this before. The Chinese models are significantly better and will outcompete the models from the US, it was just a matter of people realizing that.

    My other prediction, being that they will lobby for tariffs or banning Chinese models outright also seems to be coming true.

    A Dark-Money Campaign Is Paying Influencers to Frame Chinese AI as a Threat

    Build American AI, a nonprofit linked to a super PAC bankrolled by executives at OpenAI and Andreessen Horowitz, is funding a campaign to spread pro-AI messaging and stoke fears about China.

    I do wonder how Europe is going to react. Will they just focus on their home grown Mistral or will they consider Chinese open weight models? I feel like the EU is quite wary of anything Chinese and that many people won’t fully comprehend the actual security risk and that they will initially dismiss are avoid them, but they can’t ignore it forever. Qwen 3.6 35B which can be ran at home is already leaving Mistral’s latest models in the dust.



  • We are obviously looking at things like Mythos, which is more sophisticated at finding vulnerabilities. In the next week or so, we will be changing our tack on coding the open and making our code public until we’re on top of that risk.

    Most of our repos, unless they’re essential, will be removed for security reasons.

    Security by obscurity because security vulnerabilities don’t exist if you can’t see them




  • Companies like Claude with their AI subscriptions might be losing money, but I doubt paid-per-token AI API usage is not making them money. There are several companies like e.g. DeepInfra and Fireworks that have sprung up to sell specifically that. I don’t think simply multiplying API cost with expected usage is sufficient to estimate how much will be charged however, because I suspect that OpenAI and Claude currently have a significant profit margin since they seem to be the defacto duopoly in the US.

    The chart above shows that quite clearly, the vertical axis is the combined score on various benchmarks. The horizontal axis shows the price. OpenAI and Claude do score higher, but the price difference is enormous, even if it wasn’t a log scale (70$ vs 1.3k$ for similar results!). The competition of these companies could drastically reduce the margins of US companies,

    I therefore think the pricing will depend on whether the large US AI companies manage to lobby the government to enact laws to cripple the competition of Chinese companies under the guise of security.