Max Klyga: "https://cerebras.ai/blog/intro…" - Gamedev Mastodon

Recent searches

Search options

Only available when logged in.

Max Klyga @neku42@mastodon.gamedev.place

https://cerebras.ai/blog/introducing-cerebras-inference-ai-at-instant-speed
> Cerebras inference delivers 450 tokens per second for Llama3.1 70B, which is 20x faster than NVIDIA GPU-based hyperscale clouds.

Aug 28, 2024, 10:39 PM··Elk

0boosts·1favorite

Drag & drop to upload