
Retrieving Datacloud
Guide agents through retrieving and shaping Salesforce Data Cloud hybrid search indexes, DMO fields, chunking, and embedding configuration from structured metadata.
Install
npx skills add https://github.com/forcedotcom/sf-skills --skill retrieving-datacloudWhat is this skill?
- Hybrid search index JSON skeleton with source DMO, chunk DMO, and vector DMO developer names
- Chunking via passage_extraction with max_tokens 512 and strip_html true in examples
- Vector embedding model e5_large_v2 with dimension 1024 and max_token_limit 512
- Part of the Salesforce *-datacloud skill family with shared CREDITS and UPSTREAM docs
- Field-level chunkingConfiguration on structured Data Cloud DMO text fields
Adoption & trust: 611 installs on skills.sh; 513 GitHub stars; 3/3 security scanners passed (skills.sh audits).
Recommended Skills
Journey fit
Canonical shelf is Build because the skill encodes Data Cloud retrieval patterns—indexes, DMO developer names, chunk/vector DMOs—for implementing CRM data products. Integrations subphase matches Salesforce Data Cloud DMO wiring, hybrid searchType, passage_extraction chunking, and e5_large_v2 embedding blocks in the skill payload.
Common Questions / FAQ
Is Retrieving Datacloud safe to install?
skills.sh reports 3 of 3 security scanners passed. Review the Security Audits panel on this page before installing in production.
SKILL.md
READMESKILL.md - Retrieving Datacloud
# Credits & Acknowledgments Primary contributor: **Gnanasekaran Thoppae** This skill is part of the `*-datacloud` family. Shared attribution, upstream source mapping, and maintenance notes live in: - [../orchestrating-datacloud/CREDITS.md](../orchestrating-datacloud/CREDITS.md) - [../orchestrating-datacloud/UPSTREAM.md](../orchestrating-datacloud/UPSTREAM.md) { "label": "<INDEX_NAME>", "developerName": "<INDEX_NAME>", "description": "Hybrid search index on a structured Data Cloud DMO", "sourceDmoDeveloperName": "<SOURCE_DMO>__dlm", "chunkDmoName": "<INDEX_NAME> chunk", "chunkDmoDeveloperName": "<INDEX_NAME>_chunk", "vectorDmoName": "<INDEX_NAME> index", "vectorDmoDeveloperName": "<INDEX_NAME>_index", "searchType": "HYBRID", "vectorEmbedding": { "vectorEmbeddingRelatedFields": [] }, "rankingConfigurations": [], "chunkingConfiguration": { "fieldLevelConfigurations": [ { "sourceDmoDeveloperName": "<SOURCE_DMO>__dlm", "sourceDmoFieldDeveloperName": "<TEXT_FIELD>__c", "config": { "id": "passage_extraction", "userValues": [ { "id": "max_tokens", "value": "512" }, { "id": "strip_html", "value": "true" } ] } } ] }, "vectorEmbeddingConfiguration": { "embeddingModel": { "id": "e5_large_v2", "userValues": [ { "id": "dimension", "value": "1024" }, { "id": "max_token_limit", "value": "512" } ] }, "index": { "id": "HNSW", "userValues": [] }, "similarityMetric": "COSINE" } } { "label": "My_kav", "developerName": "My_kav", "sourceDmoDeveloperName": "ssot__KnowledgeArticleVersion__dlm", "chunkDmoName": "My_kav chunk", "chunkDmoDeveloperName": "My_kav_chunk", "vectorDmoName": "My_kav index", "vectorDmoDeveloperName": "My_kav_index", "searchType": "VECTOR", "vectorEmbedding": { "vectorEmbeddingRelatedFields": [] }, "chunkingConfiguration": { "fieldLevelConfigurations": [ { "sourceDmoDeveloperName": "ssot__KnowledgeArticleVersion__dlm", "sourceDmoFieldDeveloperName": "ssot__Name__c", "config": { "id": "passage_extraction", "userValues": [ { "id": "strip_html", "value": "true" }, { "id": "max_tokens", "value": "512" } ] } } ] }, "vectorEmbeddingConfiguration": { "embeddingModel": { "id": "e5_large_v2", "userValues": [ { "id": "dimension", "value": "1024" }, { "id": "max_token_limit", "value": "512" } ] }, "index": { "id": "HNSW", "userValues": [] }, "similarityMetric": "COSINE" }, "rankingConfigurations": [] } # retrieving-datacloud Query and search workflows for Salesforce Data Cloud. ## Use this skill for - quick SQL counts - paginated SQL (`sqlv2`) - async query lifecycles - table describe - vector search - hybrid search with optional prefilter - search index inspection and lifecycle work ## Example requests ```text "Run a Data Cloud SQL query against unified profiles" "Describe this Data Cloud table before I write SQL" "Help me troubleshoot vector search in Data Cloud" "Run a hybrid search with a prefilter in Data Cloud" "Create and inspect a search index" ``` ## Common commands ```bash sf data360 query sql -o myorg --sql 'SELECT COUNT(*) FROM "ssot__Individual__dlm"' 2>/dev/null sf data360 query describe -o myorg --table ssot__Individual__dlm 2>/dev/null sf data360 search-index list -o myorg 2>/dev/null sf data360 query vector -o myorg --index Knowledge_Index --query "reset password" --limit 5 2>/dev/null sf data360 query hybrid -o myorg --index Knowledge_Index --query "reset password" --limit 5 2>/dev/null ``` ## Example payloads - [examples/search-indexes/vector-knowledge.json](examples/search-indexes/vector-knowledge.json) - [examples/search-indexes/hybrid-structured.json](examples/search-indexes/hybrid-structured.js