Boolean Satisfiability
Build a circuit that decides satisfiability of a 3-CNF formula and emits a witness assignment if one exists.
Manhattan Reasoning runs the open-source Yosys toolchain on managed cloud FPGAs — synthesis, place-and-route, and on-device verification in seconds, not hours. Built for agents learning hardware design through reinforcement learning with verifiable rewards.
Reinforcement learning needs dense, fast feedback. A vendor cloud FPGA flow can take tens of minutes per build — fine for shipping a datacenter accelerator, fatal for an agent that needs thousands of reward signals an hour. We rebuilt the loop around lightweight open tooling.
Synthesis, place-and-route, and bitstream generation on managed FPGAs, returned as structured reward signals over a single API call.
Every submission is checked against a formal spec and on-device behavior — correctness, timing closure (Fmax), and resource usage (LUTs, FFs, DSPs).
No black-box vendor flow. The same Yosys/nextpnr pipeline runs locally and in the cloud, so rollouts are deterministic and auditable.
A graded benchmark of classic NP-complete problems, reframed as hardware-design tasks. Agents must reason from a formal specification to a synthesizable Verilog accelerator — then earn verifiable rewards for correctness, speed, and area.
Build a circuit that decides satisfiability of a 3-CNF formula and emits a witness assignment if one exists.
Given a weighted graph, produce hardware that finds a Hamiltonian tour of total cost ≤ K, or proves none exists.
Assign one of k colors to each vertex so no edge is monochromatic. Stream the adjacency matrix; emit a valid coloring.
Decide whether any subset of a multiset of integers sums exactly to T, and return the selecting bitmask.
Find a set of ≤ k vertices touching every edge. Reward scales with how tight a cover the agent can certify in hardware.
Maximize value under a weight budget. The agent designs a pipelined DP datapath and is scored on throughput.
Decide whether a cycle visits every vertex exactly once, returning the cycle order when one exists.
Find the largest fully-connected subgraph. The hardest family in the suite — reward rewards both correctness and clique size.
Pack items into the fewest fixed-capacity bins. Agents trade area for parallelism in their packing datapath.
Every rollout is a closed loop from policy to silicon and back — no human in the path, no ambiguous reward.
The policy reads a problem spec from the suite and proposes a synthesizable hardware module.
Yosys + nextpnr synthesize, place, route, and load the design onto a managed FPGA in seconds.
Graded instances run against a formal oracle: correctness, plus Fmax and resource usage.
A structured, multi-axis reward returns to the trainer — verifiable, reproducible, and fast.
Drop Manhattan Reasoning into any RL trainer. Submit a candidate design against a benchmark instance and get back a structured reward with full build telemetry — correctness, timing, and area. No FPGA hardware to manage, no vendor toolchain to license.
We're onboarding research labs and frontier teams to the private beta of our cloud FPGA platform and the NP-Complete Suite.
Request beta access →