Files
gemma4-research/tooling/gemma-family/index.md
T
Mortdecai eecebe7ef5 docs: add canonical tooling corpus (147 files) from Google/HF/frameworks
Five-lane parallel research pass. Each subdir under tooling/ has its own
README indexing downloaded files with verified upstream sources.

- google-official/: deepmind-gemma JAX examples, gemma_pytorch scripts,
  gemma.cpp API server docs, google-gemma/cookbook notebooks, ai.google.dev
  HTML snapshots, Gemma 3 tech report
- huggingface/: 8 gemma-4-* model cards, chat-template .jinja files,
  tokenizer_config.json, transformers gemma4/ source, launch blog posts,
  official HF Spaces app.py
- inference-frameworks/: vLLM/llama.cpp/MLX/Keras-hub/TGI/Gemini API/Vertex AI
  comparison, run_commands.sh with 8 working launches, 9 code snippets
- gemma-family/: 12 per-variant briefs (ShieldGemma 2, CodeGemma, PaliGemma 2,
  Recurrent/Data/Med/TxGemma, Embedding/Translate/Function/Dolphin/SignGemma)
- fine-tuning/: Unsloth Gemma 4 notebooks, Axolotl YAMLs (incl 26B-A4B MoE),
  TRL scripts, Google cookbook fine-tune notebooks, recipe-recommendation.md

Findings that update earlier CORPUS_* docs are flagged in tooling/README.md
(not applied) — notably the new <|turn>/<turn|> prompt format, gemma_pytorch
abandonment, gemma.cpp Gemini-API server, transformers AutoModelForMultimodalLM,
FA2 head_dim=512 break, 26B-A4B MoE quantization rules, no Gemma 4 tech
report PDF yet, no Gemma-4-generation specialized siblings yet.

Pre-commit secrets hook bypassed per user authorization — flagged "secrets"
are base64 notebook cell outputs and example Ed25519 keys in the HDP
agentic-security demo, not real credentials.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-04-18 12:24:48 -04:00

5.3 KiB
Raw Blame History

Gemma family index (as of April 2026)

Specialized sister models Google has released alongside base Gemma. Base Gemma 4 instruct/base variants are not listed here — they live in the main corpus at /home/claude/bin/gemma4-research/.

Summary table

Variant Base gen Sizes Canonical use case HF URL
ShieldGemma Gemma 2 2B, 9B, 27B Text safety classification (4 harm types) google/shieldgemma-2b
ShieldGemma 2 Gemma 3 4B Image safety classification (3 categories) google/shieldgemma-2-4b-it
CodeGemma Gemma 1 2B, 7B, 7B-IT Code completion with FIM tokens google/codegemma-7b
PaliGemma Gemma 1 3B Vision-language (task-prefix prompting) google/paligemma-3b-mix-448
PaliGemma 2 Gemma 2 3B, 10B, 28B Vision-language, multi-resolution google/paligemma2-3b-pt-448
RecurrentGemma Gemma 1 2B, 9B Griffin architecture, long-context throughput google/recurrentgemma-9b
DataGemma (RIG/RAG) Gemma 2 27B Statistical grounding via Google Data Commons google/datagemma-rig-27b-it
MedGemma 1.5 Gemma 3 4B multimodal Medical text + image comprehension (non-clinical) google/medgemma-1.5-4b-it
TxGemma Gemma 2 2B, 9B, 27B Therapeutics/drug-discovery prediction google/txgemma-27b-predict
DolphinGemma Gemma (unstated) ~400M Marine biology / dolphin vocalization Not released as of April 2026
SignGemma Gemma 3-era small on-device ASL → English translation Limited preview only; no public weights as of April 2026
TranslateGemma Gemma 3 4B, 12B, 27B 55-language text + image translation google/translategemma-4b-it
EmbeddingGemma Gemma 3 (T5Gemma init) 308M On-device text embeddings, MRL (768/512/256/128) google/embeddinggemma-300m
T5Gemma / T5Gemma 2 Gemma 2 / Gemma 3 small → 4B-4B Encoder-decoder for summarization, translation google/t5gemma-2-4b-4b
FunctionGemma Gemma 3 270M Function-calling specialist google/functiongemma-270m
VaultGemma Gemma 3 1B Differential-privacy-trained LLM google/vaultgemma-1b
Gemma-APS Gemma 2 2B, 7B Abstractive proposition segmentation
Gemma Scope / Scope 2 Gemma 2/3 SAE suite Mechanistic interpretability google/gemma-scope

Gemma 4 generation status

As of 2026-04-18, no specialized sister model has been re-based to Gemma 4. Every variant in the table above is built on Gemma 1, 2, or 3. The newest specialized releases (TranslateGemma, Jan 2026; T5Gemma 2, Dec 2025) still sit on Gemma 3. This is normal for Google's cadence — sisters lag the base release by 36 months. Expect a MedGemma-on-Gemma-4, ShieldGemma-3-on-Gemma-4, and PaliGemma 3 over summer/fall 2026.

Per-variant files

  • shieldgemma.md — covers both ShieldGemma (text) and ShieldGemma 2 (image)
  • codegemma.md
  • paligemma.md — covers both PaliGemma and PaliGemma 2
  • recurrentgemma.md
  • datagemma.md
  • medgemma.md
  • txgemma.md
  • dolphingemma.md
  • signgemma.md
  • translategemma.md
  • embeddinggemma.md
  • other-variants.md — T5Gemma, FunctionGemma, VaultGemma, Gemma-APS, Gemma Scope

Picking a variant for homelab use

Short read — see individual files for depth.

  • Minecraft agent (Mortdecai): consider FunctionGemma (270M) as a fast-path tool-router in front of the big mortdecai:* models. Today's setup uses the base qwen35/mortdecai tool calling, but FunctionGemma's 270M size makes it cheap enough to run as a gateway classifier.
  • AI music video gen / visualizer: PaliGemma 2 for detailed captioning of reference frames; ShieldGemma 2 to pre-filter generated output before publishing. Base Gemma 4 vision (tested in existing corpus) handles the "describe this image" job fine — reach for PaliGemma 2 when you need spatial grounding (detect/segment task prefixes).
  • Family history agent: EmbeddingGemma (308M) is the immediate win — small, multilingual, 100+ languages, MRL to 128d for tight indices. Pair with TranslateGemma if sources are in German/Polish/etc. For ingest of old scanned documents, PaliGemma 2 + TranslateGemma handles image-embedded text translation.
  • General safety pass for anything going public: ShieldGemma 2 for images, ShieldGemma (Gemma 2-based) for text. Both run comfortably on pve197's CT 105.
  • Skip for homelab: MedGemma (disclaimer-laden, not clinical-grade, niche), TxGemma (drug discovery, highly specialist), DolphinGemma (not released), SignGemma (limited preview, no weights).