Papers
Topics
Authors
Recent
Search
2000 character limit reached

Town Hall Debate Prompting

Updated 12 March 2026
  • Town Hall Debate Prompting is a multi-agent framework that partitions LLMs into distinct personas for structured, multi-round debates.
  • It leverages adversarial critique, role-specific argumentation, and consensus voting to significantly improve reasoning and classification tasks.
  • The mechanism integrates dynamic persona creation, turn-based debate protocols, and explicit scoring metrics to simulate expert panels and legislative deliberations.

Town Hall Debate Prompting (THDP) is a multi-agent prompting framework in which LLMs are partitioned into distinct personas or agents that engage in structured, multi-round debate to solve reasoning, classification, or generative tasks. THDP harnesses argument diversity, explicit role-play, adversarial critique, and consensus-building mechanisms to simulate human expert panels, legislative bodies, or civic town halls, yielding significant empirical improvements over standard one-shot or Chain-of-Thought (CoT) prompting approaches.

1. Formal Definition and Debate Mechanics

THDP operates by instantiating NN expert personas within an LLM (or across multiple LLMs), each endowed with role-specific prior knowledge, reasoning styles, or argumentative goals. Initial problem input is encoded as a state vector s0s_0. For each round t=1,,Rt = 1, \dots, R, every persona ii generates an argument ai,t=pi(st1;ϕi)a_{i,t} = p_i(s_{t-1}; \phi_i), where ϕi\phi_i parameterizes the persona’s viewpoint or cognitive style, and pip_i is a persona-specific reasoning function. The collective state is updated as st=f({ai,t}i=1N)s_t = f(\{a_{i,t}\}_{i=1}^N), typically by aggregating the debate contributions. After RR rounds, each persona votes (or scores candidate solutions): vi=g(ϕi,sR)v_i = g(\phi_i, s_R). The final answer is chosen by majority or via aggregate scoring.

Structured turn-taking protocols govern the debate, typically comprising:

  • Opening statement phase: Each persona presents an initial argument without seeing peers’ responses.
  • Rebuttal and critique rounds: Personas critique peer arguments, surface flaws, defend their prior reasoning, and refine their positions.
  • Voting or consensus: Each persona states a preferred solution, followed by majority voting, scoring, or audience polling.

Distinct phases can be orchestrated by a moderator or system prompt, and all exchanges are typically formal, evidence-driven, and may involve explicit reasoning breakdowns (Chain-of-Thought).

2. Persona and Role Construction

Persona creation in THDP can be dynamic or preset:

  • Dynamic persona synthesis: The LLM is prompted to select NN diverse expert roles targeted to the problem context, such as Logic Specialist, Constraint Solver, Pattern Recognizer, Devil’s Advocate, or Consensus-Builder (Sandwar et al., 28 Jan 2025). Personas may differ by domain expertise, argumentation style, stance (Pro/Con/Neutral), or simulated demographic attributes (e.g., by HEXACO or political leaning) (Karanjai et al., 31 Mar 2025, Chan et al., 2024).
  • Preset roles and pragmatic action tags: In civic and policy simulation, roles are extracted and consolidated from real-world deliberative transcripts, including attributes such as goals, tone, policy positions, and fine-grained speech act tags (e.g., [propose_motion], [ask_question], [call_vote]) (Merrill et al., 21 Nov 2025). These tags are used at inference time to steer the turn-level pragmatics of debate.

Persona-aware prompting or persona knowledge-aligned prompt tuning further enables the injection of persona-encoded knowledge directly into the input sequence through soft tokens, audience role construction, or RAG-based in-context persona retrieval (Chan et al., 2024, Karanjai et al., 31 Mar 2025).

3. Protocol Design and Implementation Variants

THDP protocols are modular and extensible. Key operational paradigms include:

  • Debate-driven binary classification: Two agents debate opposing hypotheses (e.g., phishing/legitimate for email detection), with a third judge agent scoring arguments on coherence, evidence, and rebuttal, and issuing a final binary verdict (Nguyen et al., 27 Mar 2025).
  • Multi-speaker logical reasoning: 3–15 personas, each with an LLM-determined expert role, reason stepwise, rebut peers, and collectively solve complex MCQ or logic grid tasks, with best results at N=5N=5 (Sandwar et al., 28 Jan 2025).
  • Prompt and instruction evolution: Multiple prompt variants are "owned" by competing agents that defend, critique, and propose crossovers to synthesize improved prompts (DEEVO), with debate transcripts guiding genetic operations and Elo-based scoring tracking prompt quality across Town Hall sessions (Nair et al., 30 May 2025).
  • Action-aware civic simulation: Persona-tagged and action-attributed dialogue (e.g., city council, court or school board meetings) enables simulation of highly realistic, procedurally-constrained town halls, with significant gains in perplexity, speaker fidelity, and fool rates over prompt-only baselines (Merrill et al., 21 Nov 2025).
  • Audience and voting integration: Non-agent participants (judge panels or synthetic “audience”) can be polled per round, or after each phase, to drive scoring, aggregate consensus, or focus the debate via live question injection (Karanjai et al., 31 Mar 2025, Srivastava et al., 21 May 2025).

A stylized round structure is outlined below:

Phase Description Example Prompt Element
Opening Each persona issues initial argument “Please deliver a 2–3 sentence opening statement.”
Critique/Rebuttal Personas refute others, reinforce their case “Given all prior statements, respond to claims you dispute. You have 150 tokens.”
Audience Q&A Audience/member asks clarifying/focused questions “Audience member X: Please clarify your position on…”
Voting/Scoring Judges/personas/audience rate arguments, aggregate consensus “Rate each side; choose a final label: PHISHING or LEGITIMATE. Provide justification.”
Moderator Controls timing, turn order, and agenda “Enforce 2 min per turn. Maintain order: Moderator→Alice→Bob→…”

4. Empirical Results and Theoretical Insights

Application of THDP across varied settings has yielded consistent performance improvements relative to baselines:

  • On logic puzzles (ZebraGrid), THDP with N=5N=5 expert personas improves per-cell accuracy by 13 percentage points over one-shot CoT (from 36.0% to 49.0% for GPT-4o) and achieves double-digit gains on hard-puzzle accuracy (Sandwar et al., 28 Jan 2025).
  • Reflect–Critique–Refine (RCR) prompting, a staged subroutine in multi-agent debate, leads to 1.9–3.7% absolute gains on quantitative reasoning, with halved LLM sycophancy rates compared to naïve multistep prompting (Srivastava et al., 21 May 2025).
  • In phishing email detection, agent diversity (heterogeneous agent pairs such as GPT-4+LLaMA-2) outperforms homogeneous pairs, and debate structure alone achieves high accuracy without the need for explicit Chain-of-Thought or role anchoring (Nguyen et al., 27 Mar 2025).
  • In tuneable prompt optimization, debate-driven generation with Elo rating selection (DEEVO) outperforms both manual prompt engineering and existing automated methods on both open- and closed-ended tasks, while preserving prompt diversity (Nair et al., 30 May 2025).
  • Persona-aligned prompt tuning for argument quality and persuasion yields significant gains: macro F1 improvements of 7.5–9.4 points over strong tuning baselines on impact classification and 2.5 pp accuracy increase on persuasion prediction (Chan et al., 2024).

These studies demonstrate that increasing the number and diversity of debating personas systematically improves accuracy and reasoning depth up to a problem-dependent optimum (N=5N=5 for logic tasks, N=3N=3 for cost-aware RCR), but further increases can trigger off-topic or incoherent interactions, especially for smaller models (Sandwar et al., 28 Jan 2025, Srivastava et al., 21 May 2025).

5. Evaluation Metrics and Consensus Mechanisms

Debate quality and final result reliability are assessed using a range of metrics:

  • Standard task metrics: accuracy, macro/micro F1, cell-level and puzzle-level accuracy, win rate (quality judged by GPT-4 or similar), and controversy controllability (Sandwar et al., 28 Jan 2025, Li et al., 2024).
  • Consensus and voting: Majority rule (per persona or audience votes); score aggregation (e.g., Si=αcoherencei+βevidencei+γrebuttaliS_i = \alpha\cdot\text{coherence}_i + \beta\cdot\text{evidence}_i + \gamma\cdot\text{rebuttal}_i) (Nguyen et al., 27 Mar 2025).
  • Distributional metrics: Jensen–Shannon divergence between model-generated and human survey-response distributions (Karanjai et al., 31 Mar 2025).
  • Speaker realism: Classifier fool rate (CFR), speaker attribution accuracy (SAA), and human Turing-style paired identification accuracy (Merrill et al., 21 Nov 2025).

Debate convergence is typically signaled by stable majority agreement, maximum rounds reached, or a voting threshold. Some frameworks introduce vote-fraction–weighted policy rewards (e.g., rTownHall(x,yi)r_\text{TownHall}(x,y_i) incorporating audience vote fraction viv_i) (Srivastava et al., 21 May 2025).

6. Applications, Variations, and Limitations

THDP has been instantiated for:

Key limitations include possible convergence on incorrect consensus (debate ≠ ground truth), model drift to neutral positions unless stance is reinforced, token inefficiency from long debates, dominant speakers, and the need for sampling or clustering in high-traffic audience settings (Sandwar et al., 28 Jan 2025, Srivastava et al., 21 May 2025, Li et al., 2024).

7. Future Directions

Research proposes the following extensions and open questions:

Town Hall Debate Prompting thus provides a modular, extensible paradigm for advancing LLM reasoning, persuasiveness, and simulation fidelity, grounded in iterated multi-agent interaction and role diversity, and empirically validated across reasoning, deliberation, and prompt optimization tasks (Sandwar et al., 28 Jan 2025, Srivastava et al., 21 May 2025, Karanjai et al., 31 Mar 2025, Chan et al., 2024, Nair et al., 30 May 2025, Merrill et al., 21 Nov 2025, Nguyen et al., 27 Mar 2025, Li et al., 2024).

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Town Hall Debate Prompting.