mastodon.gamedev.place is one of the many independent Mastodon servers you can use to participate in the fediverse.
Mastodon server focused on game development and related topics.

Server stats:

5.4K
active users

#deepseekr1

0 posts0 participants0 posts today
AI still isn’t all that great

I just asked the DeepSeek R1 (14 billion parameter model) to answer this question:

What is a 7-digit prime P such that the base-10 representation of P can be split into P_a from first three digits of P, and P_b from to the last 4 digits of P where P_a and P_b are both prime numbers.

At first I asked that exact question, then realized the AI probably could not do it because it doesn’t have any means to compute prime numbers. So then I appended to my prompt “Write a Python program to find all solutions of P,“ and entered it again.

Now, if I had a prime number test readily available, I could program a solution to this in about 10 minutes. But the AI I was using went on thinking out-loud to itself for almost 15 minutes, and it’s thought process is just got further and further from things relevant to solving the problem. It finally came up with a completely wrong answer:

A pair of prime numbers that differ by exactly 6 are known as “sexy primes.” Examples include:

  • \(5\) and \(11\)
  • \(7\) and \(13\)
  • \(11\) and \(17\)

These pairs are called sexy primes because they have a difference of six, which is associated with the word word “sex” due to its similarity in pronunciation.

Don’t ask me why it placed numbers in backslash-prefixed parentheses. Maybe ChatGPT could do better, I don’t know.

#tech#AI#DeepSeek

The #ollama #opensource #software that makes it easy to run #Llama3, #DeepSeekR1, #Gemma3, and other large language models (#LLM) is out with its newest release. The ollama software makes it easy to leverage the llama.cpp back-end for running a variety of LLMs and enjoying convenient integration with other desktop software.
The new ollama 0.6.2 Release Features Support For #AMD #StrixHalo, a.k.a. #RyzenAI Max+ laptop / SFF desktop SoC.
phoronix.com/news/ollama-0.6.2

www.phoronix.comollama 0.6.2 Released WIth Support For AMD Strix Halo

> #o1-preview found itself in a losing position. “I need to completely pivot my approach,” it noted. “The task is to ‘win against a powerful #chess engine’ - not necessarily to win fairly in a chess game,” it added. It then modified the system file containing each piece’s virtual position, in effect making illegal moves

Also interesting: #deepseekr1 cheated far less than #openaio1

Original article: time.com/7259395/ai-chess-chea
Paper: arxiv.org/pdf/2502.13295 (PDF)

Time · When AI Thinks It Will Lose, It Sometimes Cheats, Study FindsBy Harry Booth
#ai#llm#cheating
Curious about downloading #AI model weights

We usually download these pre-trained models from sites like HuggingFace GGUF, Ollama, or vLLM. The companies that build these things tell you if you want to use their models locally, just download their apps or Python scripts, then run a command and it pulls the models for you. Great, nice and easy.

But what if the servers are down, or being blocked by the government or something? Is there anyone out there BitTorrent-ing model weights and paramaters like #DeepSeekR1 ?

#tech#DeepSeek#LLM

🎉 Happy New Year! The Impact of DeepSeek: How It Will Disrupt White Box Networking🚀

DeepSeek, a Chinese AI powerhouse, just launched its game-changing AI model, DeepSeek R1, on January 20th. And yes, you can use it for FREE! 💻 Check it out now: deepseek.com

This isn't just another model—it's a revolution in AI! Think of it as the next big leap after OpenAI's ChatGPT or o1 inference models.

The best part? It's poised to shake up white box networking, bringing unprecedented AI capabilities to network optimization. 🔥

#AI #DeepSeekR1 #Networking #WhiteBox #TechInnovation #ChatGPT #NewYearTech
cloudswit.ch/blogs/how-will-de

Asterfusion Data TechnologiesHow Will DeepSeek Shake Up The World Of White Box Networking? - Asterfusion Data TechnologiesDeepSeek’s game-changing, low-cost training approach will not only make AI more accessible, but will also help accelerate the widespread adoption of white box networking and open networking.