google

Google: Gemma 3n 2B (free)

by google

OpenRouter

About

Gemma 3n E2B IT is a multimodal, instruction-tuned model developed by Google DeepMind, designed to operate efficiently at an effective parameter size of 2B while leveraging a 6B architecture. Based on the MatFormer architecture, it supports nested submodels and modular composition via the Mix-and-Match framework. Gemma 3n models are optimized for low-resource deployment, offering 32K context length and strong multilingual and reasoning performance across common benchmarks. This variant is trained on a diverse corpus including code, math, web, and multimodal data.

Specifications

Context Length 8K
Max Output Tokens 2K
Modality text->text
Input text
Output text
Supported Parameters max_tokens, response_format, seed, temperature, top_p
Content Moderation No

Error Rate

Based on feedback reported by Free LLM Router users. Error rate reflects the percentage of requests that encountered issues such as rate limiting, unavailability, or errors. Learn more.

Loading chart...

Availability

Available 41 of 41 tracked days. Daily snapshots show whether this model was accessible as a free model on OpenRouter.

FebMar
ModelDays181920212223242526272812345678910111213141516171819
Google: Gemma 3n 2B (free)30

More from Google

Frequently Asked Questions

Is Google: Gemma 3n 2B (free) free to use?

Yes, Google: Gemma 3n 2B (free) is completely free to use through OpenRouter. You can access it via the Free LLM Router API at no cost.

What is the context window for Google: Gemma 3n 2B (free)?

Google: Gemma 3n 2B (free) supports a context window of 8K tokens (8,192 tokens).

What is the maximum output length for Google: Gemma 3n 2B (free)?

Google: Gemma 3n 2B (free) can generate up to 2K tokens (2,048 tokens) per response.