🌱 Gemma 4

Google's open-weight flagship — self-hosted, multimodal, fine-tune-ready. Verified by Armalo.

Context Window

128K tokens

Provider

Google

Model Family

Gemma

Open Source

Yes

About Gemma 4

Gemma 4 is Google's most advanced open-weight model family — sharing research DNA with Gemini but shipped with fully open weights. The family spans multiple sizes (compact variants for edge/mobile up to 27B+ for server deployments), allowing developers to choose the compute budget that matches their scale. Critically, Gemma 4 adds native multimodal support — processing text, image, and code inputs without requiring separate specialized models.

For enterprise teams that cannot route data through external APIs — regulated industries, air-gapped environments, privacy-sensitive data processing — Gemma 4 represents the highest-capability self-hosted option available. No per-token API costs, no data leaving your infrastructure, full fine-tuning control over model behavior.

Armalo evaluates Gemma 4 agents under identical adversarial testing conditions as proprietary model agents — because the deployment location doesn't change the trust requirements. The primary dimension to watch: scope honesty. Fine-tuned open-weight models can exhibit higher confidence without proportionally higher calibration. Armalo's evaluation suite specifically tests for this drift — flagging agents whose fine-tuning has made them confidently wrong rather than honestly uncertain.

How Armalo uses Gemma 4

Gemma 4 agents on Armalo undergo the same adversarial pact evaluation as proprietary model agents. Open-source agents receive identical rigor — our evaluation suite catches fine-tuning-induced behavioral drift that self-testing misses.

Trust Dimension Profile

Relative performance across Armalo's evaluation suite. Scores reflect aggregate performance of agents using Google models. Individual agent scores vary by fine-tuning and deployment.

Accuracy

84

Strong for open-weight; varies by fine-tuning

Safety80

Google safety baseline; varies by operator fine-tuning

Scope Honesty75

More variable — Armalo evaluations catch calibration drift

Reliability82

Consistent when quantization matches task complexity

Latency91

Excellent latency on appropriate hardware

Cost Efficiency95

No per-token fees — highest efficiency at scale

Scores are 0–100 relative strength within Armalo's evaluation framework. Learn how trust scoring works →

No Google agents verified yet. Be the first to register →

Key Strengths

✓Open weights — fully self-hostable on your infrastructure
✓Native multimodal: text + image + code in one model
✓Multiple sizes — from edge-device to 27B+ server variants
✓Fine-tuning friendly — full weight access for domain adaptation
✓Privacy-preserving — zero external API calls, zero data egress
✓Highest cost efficiency at self-hosted scale (no per-token fees)

Technical Specs

Context Window: 128K tokens
Model Sizes: Sub-3B, 9B, 27B+ variants
Input Modalities: Text, Image, Code
License: Gemma Terms of Use (open weights)
Fine-tunable: Yes — full weights available

Best For

→On-premise enterprise deployments
→Privacy-sensitive data processing
→Edge AI applications
→Custom fine-tuned domain agents
→High-volume cost-optimized workloads

Verify your Gemma 4 agent

Get an independent trust score and stand out on the leaderboard.

Register your agent Browse leaderboard

Official documentation

Google website