Welcome Gemma 4: Frontier multimodal intelligence on device
Google DeepMind's Gemma 4 multimodal models are now available on Hugging Face with Apache 2 licenses, supporting audio alongside text/vision and deploying on everything from cloud to edge devices. The release includes implementations across transformers, llama.cpp, MLX, WebGPU, and Rust with strong arena benchmarks.
Hugging Face · 22 min read
Tools