DeepSeek’s distilled new R1 AI model can run on a single GPU

Besvar
nyheder
Indlæg: 9969
Tilmeldt: tirs sep 22, 2020 3:13 pm

DeepSeek’s distilled new R1 AI model can run on a single GPU

Indlæg af nyheder »

DeepSeek’s updated R1 reasoning AI model might be getting the bulk of the AI community’s attention this week. But the Chinese AI lab also released a smaller, “distilled” version of its new R1, DeepSeek-R1-0528-Qwen3-8B, that DeepSeek claims beats comparably-sized models on certain benchmarks. The smaller updated R1, which was built using the Qwen3-8B model Alibaba […]

Source: https://techcrunch.com/2025/05/29/deeps ... ingle-gpu/
Besvar