Design a modular architecture that separates control, reasoning, and verification.
Controller Module#
- Parses the task, initializes state, and decides whether to continue iterating.
- Maintains a recursion counter, confidence score, and optional exploration parameters.
Reasoning Core#
- Applies domain-specific transformations: pattern recognition, logical deduction, symbolic manipulation.
- Designed with tiny parameter counts (e.g., two-layer transformer or gated recurrent cell).
Verifier/Reflector#
- Evaluates candidate outputs, estimating correctness or alignment with constraints.
- Produces self-critique tokens that the controller feeds back into the next iteration.
Memory Buffer#
- Stores intermediate states, partial proofs, or extracted features for reference.
- Enables rollback when later iterations degrade quality.
Iteration Flow#
- Controller reads task and initializes state.
- Reasoning core proposes an intermediate solution.
- Verifier evaluates and generates critique signals.
- Controller updates state and decides to continue, adjust strategy, or stop.
- If converged, output final answer with reasoning trace.