- Home
- Top
- New
- Best
- Show
- Ask
- Jobs
- Search

Download on App Store

Get the latest tech news delivered straight to your iPhone or Mac with push notifications and widgets

Contact
© 2025 Zumi Studios

- Home
- Top
- New
- Best
- Show
- Ask
- Jobs
- Search

Download on App Store

Get the latest tech news delivered straight to your iPhone or Mac with push notifications and widgets

Contact
© 2025 Zumi Studios

Story

Post-transformer inference: 224× compression of Llama-70B with improved accuracy

anima-core Wednesday, December 10, 2025

25 9

zenodo.org

Visit article Read on Hacker News Comments 9