Story

Why isn't everyone using Cerebras?

tghack Friday, November 14, 2025

I work at a mid-sized startup dealing with latency issues in customer-facing flows that use LLMs. Using OSS-120B seems preferable to 5-mini or Anthropic models in many cases when we need speed, intelligence, and cost control. Is there some catch here beyond needing to acquire higher rate limits?

3 1

Read on Hacker News Comments 1