
Grepai Embeddings Ollama
Point GrepAI at a local Ollama embedder so code search embeddings stay on your machine with no paid API.
Install
npx skills add https://github.com/yoanbernabeu/grepai-skills --skill grepai-embeddings-ollamaWhat is this skill?
- Documents Ollama advantages: privacy, no API cost, low latency, offline use, model choice
- Basic `embedder` block: provider ollama, model `nomic-embed-text`, localhost endpoint
- Supports custom endpoints including remote Ollama hosts on the LAN
- Install and serve flow: `ollama serve` and `ollama pull nomic-embed-text`
- Optional explicit embedding dimensions in config when models require them
Adoption & trust: 531 installs on skills.sh; 17 GitHub stars; 1/3 security scanners passed (skills.sh audits); trending (+100% hot-view momentum).
Recommended Skills
Microsoft Foundrymicrosoft/azure-skills
Azure Aimicrosoft/azure-skills
Azure Hosted Copilot Sdkmicrosoft/azure-skills
Lark Eventlarksuite/cli
Running Claude Code Via Litellm Copilotxixu-me/skills
Setup Matt Pocock Skillsmattpocock/skills
Journey fit
Primary fit
Embedding provider setup happens when you integrate local LLM tooling into the repo search stack during Build. Integrations fits wiring an external runtime (Ollama) into GrepAI via `embedder` config and health checks.
Common Questions / FAQ
Is Grepai Embeddings Ollama safe to install?
skills.sh reports 1 of 3 security scanners passed. Review the Security Audits panel on this page before installing in production.
SKILL.md
READMESKILL.md - Grepai Embeddings Ollama
# GrepAI Embeddings with Ollama This skill covers using Ollama as the embedding provider for GrepAI, enabling 100% private, local code search. ## When to Use This Skill - Setting up private, local embeddings - Choosing the right Ollama model - Optimizing Ollama performance - Troubleshooting Ollama connection issues ## Why Ollama? | Advantage | Description | |-----------|-------------| | 🔒 **Privacy** | Code never leaves your machine | | 💰 **Free** | No API costs or usage limits | | ⚡ **Speed** | No network latency | | 🔌 **Offline** | Works without internet | | 🔧 **Control** | Choose your model | ## Prerequisites 1. Ollama installed and running 2. An embedding model downloaded ```bash # Install Ollama brew install ollama # macOS # or curl -fsSL https://ollama.com/install.sh | sh # Linux # Start Ollama ollama serve # Download model ollama pull nomic-embed-text ``` ## Configuration ### Basic Configuration ```yaml # .grepai/config.yaml embedder: provider: ollama model: nomic-embed-text endpoint: http://localhost:11434 ``` ### With Custom Endpoint ```yaml embedder: provider: ollama model: nomic-embed-text endpoint: http://192.168.1.100:11434 # Remote Ollama server ``` ### With Explicit Dimensions ```yaml embedder: provider: ollama model: nomic-embed-text endpoint: http://localhost:11434 dimensions: 768 # Usually auto-detected ``` ## Available Models ### Recommended: nomic-embed-text ```bash ollama pull nomic-embed-text ``` | Property | Value | |----------|-------| | Dimensions | 768 | | Size | ~274 MB | | Speed | Fast | | Quality | Excellent for code | | Language | English-optimized | **Configuration:** ```yaml embedder: provider: ollama model: nomic-embed-text ``` ### Multilingual: nomic-embed-text-v2-moe ```bash ollama pull nomic-embed-text-v2-moe ``` | Property | Value | |----------|-------| | Dimensions | 768 | | Size | ~500 MB | | Speed | Medium | | Quality | Excellent | | Language | Multilingual | Best for codebases with non-English comments/documentation. **Configuration:** ```yaml embedder: provider: ollama model: nomic-embed-text-v2-moe ``` ### High Quality: bge-m3 ```bash ollama pull bge-m3 ``` | Property | Value | |----------|-------| | Dimensions | 1024 | | Size | ~1.2 GB | | Speed | Slower | | Quality | Very high | | Language | Multilingual | Best for large, complex codebases where accuracy is critical. **Configuration:** ```yaml embedder: provider: ollama model: bge-m3 dimensions: 1024 ``` ### Maximum Quality: mxbai-embed-large ```bash ollama pull mxbai-embed-large ``` | Property | Value | |----------|-------| | Dimensions | 1024 | | Size | ~670 MB | | Speed | Medium | | Quality | Highest | | Language | English | **Configuration:** ```yaml embedder: provider: ollama model: mxbai-embed-large dimensions: 1024 ``` ## Model Comparison | Model | Dims | Size | Speed | Quality | Use Case | |-------|------|------|-------|---------|----------| | `nomic-embed-text` | 768 | 274MB | ⚡⚡⚡ | ⭐⭐⭐ | General use | | `nomic-embed-text-v2-moe` | 768 | 500MB | ⚡⚡ | ⭐⭐⭐⭐ | Multilingual | | `bge-m3` | 1024 | 1.2GB | ⚡ | ⭐⭐⭐⭐⭐ | Large codebases | | `mxbai-embed-large` | 1024 | 670MB | ⚡⚡ | ⭐⭐⭐⭐⭐ | Maximum accuracy | ## Performance Optimization ### Memory Management Models load into RAM. Ensure sufficient memory: | Model | RAM Required | |-------|--------------| | `nomic-embed-text` | ~500 MB | | `nomic-embed-text-v2-moe` | ~800 MB | | `bge-m3` | ~1.5 GB | | `mxbai-embed-large` | ~1 GB | ### GPU Acceleration Ollama automatically uses: - **macOS:** Metal (Apple Silicon) - **Linux/Windows:** CUDA (NVIDIA GPUs) Check GPU usage: ```bash ollama ps ``` ### Keeping Model Loaded By default, Ollama unloads models after 5 minutes of inactivity. Keep loaded: ```bash # Keep model loaded indefinitely cur