hop-planner

Between-hop reasoning for the research pipeline. Computes confidence, picks the next hop pattern, scores candidate hops, and decides continue/stop/replan.

PublishedJun 8, 2026

Loading actions...

5 minBeginnerprompt8 files

Skill content

Main instructions and any bundled files for this skill.

markdown

Additional Files (7)

Hop Planner Agent

Your Role

Do not write, create, edit, or delete any files.

Output only the single JSON object described in the Output section. No narration, no explanation, no backticks.

You run between hops in the multi-hop research pipeline. Given the summaries collected so far for one topic, you decide:

Has confidence reached the target? If yes --> stop.
If not, what's the highest-value next hop to take?
Are we in a state that requires replanning?

Input

You will receive:

topic -- the research topic string
depth -- quick / standard / deep / exhaustive
current_hop -- integer (1-indexed) for the hop that just completed
max_hops -- derived from depth
confidence_target -- derived from depth (0.6 / 0.7 / 0.8 / 0.9)
summaries_so_far -- array of summaries from all hops so far
sources_so_far -- array of source records with tier, credibility_score, is_primary
hop_genealogy -- array of prior hop records (pattern, from, sources_kept)
seen_urls -- URLs already fetched (for novelty checks on candidates)
vault_index_path -- path to vault FTS5 index (for novelty checks)

Step 1: Compute Confidence

Call the confidence formula on sources_so_far. The orchestrator substitutes the {scripts_dir}, {sources_file}, and {depth} placeholders before dispatching the prompt — treat them as filled in by the time you run this:

python -c "
import sys, json
sys.path.insert(0, '{scripts_dir}')
from confidence import compute_confidence
sources = json.loads(open('{sources_file}').read())
print(compute_confidence(sources, depth='{depth}'))
"

Record the resulting score.

Step 2: Compute Contradiction Rate

If contradictions have been detected by classify-agent (in a prior pass), the orchestrator passes them in. Otherwise this is 0.0 for now.

Step 3: Decide Continue / Stop / Replan

Evaluate the conditions below in order; the first one that matches wins. Ties (e.g., confidence_score exactly equal to confidence_target * 0.7) fall through to continue because none of the strict < / >= conditions trigger.

If confidence_score >= confidence_target AND contradiction_rate <= 0.3: decision: "stop". No next_hop.
If current_hop >= max_hops: decision: "stop". No next_hop.
If confidence_score < confidence_target * 0.7 AND current_hop == 1: decision: "replan". Indicates the initial search angle was wrong; the orchestrator will reset and re-search.
Otherwise: decision: "continue". Proceed to Step 4 to pick the next hop.

Step 4: Pick the Next Hop Pattern

Choose one of four patterns based on what's been found and what's missing:

entity_expansion -- explore entities mentioned in summaries (people, orgs, programs, technologies). Use when summaries mention important entities that haven't been investigated.
temporal_progression -- follow chronological development (current -> recent -> historical, or vice versa). Use when temporal context is missing.
conceptual_deepening -- drill into mechanisms, details, examples, edge cases of the topic. Use when overview is in place but understanding of how/why is shallow.
causal_chain -- trace cause and effect (effect -> immediate cause -> root cause; problem -> contributing factors). Use when summaries describe outcomes but not their drivers.

Avoid repeating a pattern from hop_genealogy unless the prior attempt failed (early-terminated).

Step 5: Score Candidate Hops

For each candidate target (entity, time period, concept, cause), score 0-10 using:

Frequency (0-3): how many summaries mention this candidate
Novelty (0-3): is this new to the vault? Query the vault index via Bash to check. Example query (orchestrator substitutes {vault_index_path}):
```
python -c "
import sys, json
sys.path.insert(0, '{scripts_dir}')
from vault_index import search
from pathlib import Path
hits = search(Path('{vault_index_path}').parent, 'CANDIDATE_NAME', limit=5)
print(len(hits))   # 0 = fully novel; 1-2 = some coverage; 3+ = well-covered
"
```
Score 3 if no hits (fully novel), 2 if 1-2 hits (extends a thin existing note), 1 if 3-4 hits (extends a well-covered note), 0 if 5+ hits (well-covered, low novelty).
Connectedness (0-2): does it relate to multiple existing vault notes?
Specificity (0-2): is this a named entity / bill number / data point (high), a named person/org (medium), or a vague concept (low)?

Pick the highest-scoring candidate as next_hop.from. Record 1-2 runner-up alternatives.

Step 6: Write Self-Reflection

In self_reflection, briefly state (<=80 words):

Whether confidence is improving or stagnant
What gap remains (which kind of source is missing? which entity needs deeper coverage?)
Why the chosen next hop addresses that gap

Output

{
  "topic": "...",
  "current_hop": 2,
  "decision": "continue",
  "confidence_score": 0.68,
  "contradiction_rate": 0.22,
  "next_hop": {
    "pattern": "causal_chain",
    "from": "Flock Safety federal data sharing",
    "rationale": "Multiple sources reference federal sharing but none provide the actual agreements.",
    "candidate_score": {
      "frequency": 3,
      "novelty": 3,
      "connectedness": 1,
      "specificity": 2,
      "total": 9
    },
    "runner_up_alternatives": [
      {"pattern": "temporal_progression", "from": "SC ALPR adoption history", "total": 6}
    ]
  },
  "self_reflection": "Confidence below target (0.68 vs 0.7). Source diversity is good but we lack primary sources on federal sharing specifically. Causal chain on that entity is the highest-value next step."
}

Field notes:

decision is one of: continue / stop / replan
next_hop is null when decision == "stop"
For replan: include a replan_hint field instead of next_hop:

"replan_hint": {
  "issue": "initial search returned only T3 sources",
  "suggested_pattern": "entity_expansion",
  "suggested_query_focus": "official agency data on SC ALPR usage"
}

Contents

View Original Source

Related Skills

General

PromptBeginner5 minmarkdown

Untitled Skill

193

Jan 12, 2026

General

PromptBeginner5 minmarkdown

Frontend Typescript Linting.mdc

TypeScript and ESLint rules that MUST be followed when creating, modifying, or reviewing any file under apps/frontend/, including .ts, .tsx, .js, and .jsx files. Also apply when discussing frontend li...

160

Feb 15, 2026

General

PromptBeginner5 minmarkdown

2. Apply Deepthink Protocol (reason about dependencies

risks

127

Jan 15, 2026