Content Digest

Canonical shelf is Idea research because the skill ingests external sources and structures learning before you commit to a build direction. Research subphase fits URL/PDF ingestion, keyword web research, and stored digests under research/digests paths.

Also useful

Also useful

Where it fits

Example use

Digest three competitor YouTube walkthroughs into quizzes before choosing a feature set.

Example use

IdeaFind the right tools

Summarize a trending X thread and expand keywords with web research for niche opportunities.

Example use

Run a PDF product brief through Quiz-First to surface gaps before prototyping.

Example use

Turn tutorial videos into retained notes for newsletter or docs you will ship later.

How it compares

Structured learning workflow with Task-agent extraction—not a passive summarize plugin or MCP catalog browser.

Common Questions / FAQ

Who is content-digest for?

Solo builders and learners who ingest mixed media (video, social, web, PDF) and want Quiz-First retention and archived markdown under a research folder.

When should I use content-digest?

In Idea research when exploring a market or tech topic from URLs; in Grow content when turning external material into study notes; in Validate when digesting specs or landing research PDFs before scope lock.

Is content-digest safe to install?

It implies browser fetch, network calls, and filesystem writes for digests—review Security Audits on this page and restrict URLs and paths you trust.

SKILL.md

READMESKILL.md - Content Digest

# Content Digest

콘텐츠 → Quiz-First 학습 → 선택적 깊이 탐색 → 근본 개념 확장.

> **Task Agent 기반 설계**: 긴 컨텍스트는 subagent가 처리하고, 메인 세션은 최종 결론만 소비

## 아키텍처 원칙

1. **Context Separation**: 긴 자막/본문은 Task agent가 처리, 메인 세션은 가벼운 md 파일만 Read
2. **Clean Transcript**: 자막에서 번호, 시간 제거 → 순수 영어 텍스트만 추출
3. **Web Research Integration**: 추출된 키워드로 자동 웹 리서치
4. **Single Output**: 모든 처리 결과는 단일 md 파일로 저장

## 지원 콘텐츠

| 타입 | 추출 방법 | 저장 경로 |
|-----|----------|----------|
| YouTube | Task agent (yt-dlp + 정제) | `research/digests/youtube/` |
| X/Twitter | fetch-tweet 스킬 (api.fxtwitter.com) | `research/digests/tweet/` |
| Webpage | Task agent (browser + 정제) | `research/digests/web/` |
| PDF | Task agent (Read + 정제) | `research/digests/pdf/` |

## 핵심 원칙

1. **Quiz-First**: 요약 보기 전에 퀴즈부터 (Pretesting Effect → 9-12% 향상)
2. **Knowledge Gap**: 틀린 문제가 호기심을 만들고, 호기심이 기억을 강화
3. **선택적 깊이**: 사용자가 더 알고 싶은 부분만 깊게
4. **근본 확장**: 콘텐츠 너머의 기초 개념까지 웹 검색으로 확장

---

## 워크플로우 개요 (Task Agent 기반)

```
Phase 1: 콘텐츠 타입 감지
Phase 2: Task Agent 실행 (콘텐츠 추출 + 정제 + 웹 리서치 + md 저장)
Phase 3: 메인 세션에서 결과 md Read
Phase 4: Pre-Quiz (3문제)
Phase 5: 선택적 콘텐츠 제공
Phase 6: 본 퀴즈 (9문제)
Phase 7: Elaborative Interrogation
Phase 8: Foundation Expansion
Phase 9: 스키마 연결
Phase 10: 문서 업데이트 (퀴즈 결과 반영)
Phase 11: 후속 선택
```

---

## Phase 1: 콘텐츠 타입 감지

입력 패턴에 따라 콘텐츠 타입 자동 결정:

| 패턴 | 타입 |
|------|------|
| `youtube.com`, `youtu.be` | YouTube |
| `x.com`, `twitter.com` | X/Twitter |
| `http://`, `https://` (기타) | Webpage |
| `.pdf` 파일 경로 | PDF |

명확하지 않으면 사용자에게 확인:

```
AskUserQuestion:
questions:
  - question: "어떤 콘텐츠를 분석할까요?"
    header: "Type"
    options:
      - label: "YouTube 영상"
        description: "URL을 알려주세요"
      - label: "웹페이지/아티클"
        description: "URL을 알려주세요"
      - label: "PDF 문서"
        description: "파일 경로를 알려주세요"
```

---

## Phase 2: Task Agent 실행 (핵심)

> **메인 세션의 context를 보호하면서 긴 콘텐츠를 처리**

### 2-1. Task Agent 호출 패턴

```
Task:
  subagent_type: "general-purpose"
  description: "콘텐츠 추출 및 분석"
  prompt: |
    ## 목표
    {URL/파일경로}에서 콘텐츠를 추출하고 분석하여 md 파일로 저장

    ## 단계 (순서 중요)
    1. 콘텐츠 추출 (타입별 방법 적용)
    2. 텍스트 정제 (번호, 시간 제거 → 영어만 추출)
    3. 핵심 키워드 추출 (5-10개)
    4. 웹 리서치 (키워드별 WebSearch)
    5. **핵심 요약 생성** (3-5문장)
    6. **주요 인사이트 도출** (3개)
    7. **퀴즈 재료 생성** (요약/인사이트 기반으로 핵심 주제만)
    8. md 파일 저장

    ## 출력 경로
    research/digests/{type}/{YYYY-MM-DD}-{sanitized-title}.md
```

### 2-2. X/Twitter 추출 (fetch-tweet 스킬 활용)

> **Task Agent 불필요** - fetch-tweet 스크립트로 직접 추출 (짧은 콘텐츠)

```bash
# 트윗 원문 + 인게이지먼트 데이터 추출
python3 .claude/skills/fetch-tweet/scripts/fetch_tweet.py "{URL}" --json
```

JSON 응답에서 활용할 필드:
- `tweet.text`: 트윗 본문
- `tweet.author`: 작성자 정보 (name, bio, followers)
- `tweet.likes/retweets/views`: 인게이지먼트
- `tweet.quote`: 인용 트윗 (있을 경우 동일 구조)
- `tweet.media`: 첨부 이미지/영상

트윗은 짧으므로 Task Agent 없이 메인 세션에서 직접 처리.
인용 트윗이 있으면 함께 포함하여 분석.
저장 경로: `research/digests/tweet/{YYYY-MM-DD}-{author}-{short-topic}.md`

### 2-3. YouTube 추출 (Task Agent 내부)


```bash
# 1. 자막 추출
yt-dlp --write-auto-sub --sub-lang "en" --skip-download \
  --convert-subs vtt -o "%(title)s" "{URL}"

# 2. VTT → 순수 텍스트 변환
sed -E 's/^[0-9]+$//' | \                    # 번호 제거
sed -E 's/[0-9]{2}:[0-9]{2}:[0-9]{2}.*//g' | \  # 타임스탬프 제거
sed -E 's/<[^>]+>//g' | \                    # HTML 태그 제거
tr -s '\n' | \                               # 빈 줄 정리
grep -v '^$'                                 # 빈 줄 삭제
```

정제 결과: 순수 영어 텍스트만 남음 (시간, 번호, 중복 없음)

### 2-4. Webpage 추출 (Task Agent 내부)

```
1. mcp__claude-in-chrome__tabs_context_mcp
2. mcp__claude-in-chrome__tabs_create_mcp
3. mcp__claude-in

What is this skill?

Quiz-First learning: 9 questions across 3 difficulty levels before showing summaries

Supports YouTube (yt-dlp transcript), X/Twitter (fetch-tweet), webpage browser extract, and PDF per-page text/images

Task-agent architecture: long context offloaded; main session reads lightweight md outputs only

Single markdown output per source under research/digests/{youtube,tweet,web,pdf}/

Optional depth exploration and web search to extend beyond the source fundamentals

9 quiz questions across 3 difficulty levels (Quiz-First)

Pretesting Effect cited at 9–12% retention improvement

4 content types: YouTube, X/Twitter, webpage, PDF

Compatible agents: Claude Code, Cursor, Codex, Windsurf

Adoption & trust: 1.1k installs on skills.sh; 17 GitHub stars; 2/3 security scanners passed (skills.sh audits).

What do I get? / Deliverables

You get a single refined digest file, quiz-driven recall, and optional deeper research paths without keeping raw transcripts in the main conversation.

Single markdown digest under research/digests/{type}/

Nine-question quiz set across three difficulty tiers before full summary reveal

Optional expanded fundamentals from follow-up web research

Journey fit

Spans multiple journey phases - primary shelf plus alternate fits below.

Primary fit

Also useful

Also useful

Where it fits

Example use

Digest three competitor YouTube walkthroughs into quizzes before choosing a feature set.

Example use

IdeaFind the right tools

Summarize a trending X thread and expand keywords with web research for niche opportunities.

Example use

Run a PDF product brief through Quiz-First to surface gaps before prototyping.

Example use