Zerogpu

Name: Zerogpu
Author: ZeroGPU

zerogpu/zerogpu-router·1 plugin

Offload classification, summarization, NER, PII detection, and short chat to ZeroGPU nano models so your agent stack stays fast and cheap.

Overview

zerogpu is a plugin marketplace for the Build phase that routes classification, summarization, extraction, and short chat to ZeroGPU small/nano models via 14 auto-invoked skills.

What is this marketplace?

14 auto-invoked skills for classification, summarization, extraction, and short chat
Routes NLP workloads to ZeroGPU small and nano models for cost optimization
Tagged flows: PII, NER, JSON extraction, follow-ups, and summarization
Productivity-category plugin sourced from agents/claude
Designed for automatic invocation rather than manual skill picking each turn
1 plugin (zerogpu-router)
14 auto-invoked skills stated in marketplace description
Plugin tags include nlp, classification, summarization, pii, ner, cost-optimization

Compatible agents: Claude Code, Cursor, Codex, any compatible agent

Community signal: 77 GitHub stars.

What problem does it solve?

Solo builders overpay and slow down agents by using large models for every label, summary, and PII scan.

Who is it for?

Agent founders optimizing inference cost on classification, summarization, NER, PII redaction, and lightweight chat sidecars.

Skip if: Projects with no external API allowance or teams that need exclusively on-device models with zero cloud routing.

What do I get? / Deliverables

After registration, repetitive NLP side tasks can auto-route to ZeroGPU small models while your main agent keeps frontier models for hard reasoning.

14 routed NLP skill hooks (classification, summarization, extraction, chat)
Cost-optimized path for PII, NER, and JSON extraction side tasks
Auto-invocation behavior for tagged productivity workflows

Plugins in this marketplace

1 plugin — install individually after you add the marketplace.

PluginVersion

Zerogpu RouterRoute classification, summarization, entity/PII/JSON extraction, follow-ups, and short chat to ZeroGPU small/nano models — 14 auto-invoked skills.—

Recommended Marketplaces

Ably Agent Skillsably/agent-skills

ably-agent-skills is the official Ably Claude Code marketplace that packages realtime messaging expertise into the ably …1 stars

Arm Referenceyerry262/arm-reference-mcp

arm-reference is a four-plugin Claude Code marketplace from yerry262 aimed at builders touching ARM—from bare-metal and …

Bgreenwell Pluginsbgreenwell/claude-plugins

bgreenwell-plugins is a two-plugin Claude Code marketplace by Brandon Greenwell aimed at builders who want another model…1 stars

Bingx Ai SkillsBingX-API/api-ai-skills

The bingx-ai-skills marketplace publishes BingX’s official Claude Code integration as one finance-category plugin that e…13 stars

Boar Networkboar-network/blockchain-mcp

Boar blockchain MCP is a plugin marketplace entry for solo builders who need on-chain facts inside an AI coding session …12 stars

Cc SwitchHuaer02/cc-switch

cc-switch is Huaer02’s Claude Code marketplace offering a single plugin that lets solo builders flip between API provide…

Journey fit

Primary fit

BuildIntegrations & version control

Routing inference to external small models is an integration decision made while wiring the agent product, before you optimize bills in production. zerogpu-router is a model-routing layer with fourteen auto-invoked skills—classic LLM vendor integration, not frontend polish.

How it compares

LLM routing integration marketplace, not a self-hosted weights repo or a design-system skill.

Common Questions / FAQ

Who is Zerogpu for?

Solo builders shipping Claude Code agents who want automatic routing of routine NLP tasks to ZeroGPU small and nano models.

When should I use Zerogpu?

Use it when you integrate cost-sensitive classification, summarization, entity or PII extraction, or short replies alongside your primary coding agent.

How do I add Zerogpu to my agent?

Add the Zerogpu Claude marketplace, enable the Zerogpu-router plugin from agents/claude, and configure ZeroGPU credentials per their docs.