This is a heavily interactive web application, and JavaScript is required. Simple HTML interfaces are possible, but that is not what this is.
Post
Abdoulaye Diack
diack.bsky.social
did:plc:x6m5zvil3cirjfu3co4uf2gb
PaliGemma 2 mix is out! This model can now handles short/long captioning, OCR, image Q&A, object detection, and segmentation. Available in 3B, 10B, and 28B parameter sizes and 224px/448px resolutions. Frameworks: Hugging Face Transformers, Keras, PyTorch, JAX, and Gemma.cpp.
goo.gle/4i1jOOU
https://goo.gle/4i1jOOU
2025-02-19T17:51:20.153Z