NHacker Next
  • new
  • past
  • show
  • ask
  • show
  • jobs
  • submit
From 300KB to 69KB per Token: How LLM Architectures Solve the KV Cache Problem (news.future-shock.ai)
az09mugen 30 minutes ago [-]
Unrelated, but 69KB is how much RAM Voyager 1 has.
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact
Rendered at 18:39:18 GMT+0000 (Coordinated Universal Time) with Vercel.