This is a heavily interactive web application, and JavaScript is required. Simple HTML interfaces are possible, but that is not what this is.
Post
vb
reach-vb.hf.co
did:plc:nuqpydkh6dnkabwxo4tcdsj5
yo! nvidia finally released the weights for Hymba-1.5B - outperforms Qwen, and SmolLM2 w/ 6-12x less training
trained ONLY on 1.5T tokens
> massive reductions in KV cache size and improved throughput
> combines Mamba and Attention in a hybrid parallel architecture with a 5:1 ratio and meta-tokens
2024-11-26T19:34:12.570Z