Pyfagor - Visual Designer for GenAI Orchestration

Drag a node from the library

The left sidebar lists every block you can use: user input, retrieval, AI calls, tools, control flow, response. Drag any block onto the canvas - that's a node.

Cursor dragging a node from the library onto the canvas

Connect outputs to inputs

Each node has colored ports. Drag from an output port on one node to an input port on another to wire them up. The curve shows where data flows.

Tune each step in the inspector

Click a node to open the inspector on the right. Change the prompt, swap the model, set a retry, pick a doc source. The orchestration graph stays clean - config lives in the panel.

Inspector panel showing node configuration fields

Export the blueprint

Once the orchestration looks the way you want, select Export in the toolbar. You get a clean blueprint of your orchestration graph - ready to share, hand off to your backend, or implement directly.

Worked example · 6 min read

A grounded Q&A pipeline, designed on the canvas

A concrete example of how to use Pyfagor to design a simple orchestration layer - first the orchestration order in plain words, then how you lay it out node by node on the canvas.

The orchestration order

The user question never goes straight to the model. It enters the orchestration layer, and the layer owns every step from request to response. For a grounded Q&A over an internal handbook and a policies PDF, the sequence is:

The user interacts with the frontend and submits a question.
The orchestration layer receives the request as the single entry point.
The orchestration layer processes state - session, prior turns, routing decisions.
The orchestration layer builds context by retrieving and reranking chunks from both documents and merging them into one balanced prompt input.
The orchestration layer calls the model APIs, passing the question plus the built context - the docs are fed to the AI, not the other way around.
The orchestration layer validates the model output against the same context, checking every claim before it leaves the rail.
The orchestration layer returns the final, formatted response to the frontend.

That is the whole orchestration. The rest of this post shows how each step becomes a node on the Pyfagor canvas, and how the wires between them encode the order above.

The two-document Q&A blueprint on the Pyfagor canvas: a User Question node branches into two Document Source nodes, each feeding Search Documents, Rank Results, then Merge, Ask AI, Verify Answer, and Generate Response. — The full blueprint. Two parallel retrieval branches converge into a single answer pipeline.

Starting the orchestration graph: the question comes in

The entry node

Every orchestration graph needs an entry point. The User Question node is exactly that - it captures the user's raw, natural-language input and broadcasts it to whatever nodes are wired downstream.

Nothing else fires until something lands here. In our example, it sends the same query to two retrieval branches at once, which is what makes the rest of the pipeline parallel by default.

Two documents, two parallel branches

Sources feed parallel searches

The two Document Source nodes - one for handbook.pdf and one for policies.pdf - load, chunk, and index each file. They live side by side on the canvas because they have nothing to do with each other.

From each source, a Search Documents node takes the user's question and pulls the top-k chunks that look most relevant. Both searches happen at the same time - that parallelism is one of the first things you see clearly on the canvas that you'd otherwise have to reason about in code.

Each branch then runs through a Rank Results node. Vector search is good at lexical similarity, but it happily returns passages that sound related and aren't.

The reranker re-orders the chunks by true relevance to the question and trims the tail. This is the cheapest quality improvement in the whole pipeline, and almost always worth the extra hop.

The interesting decisions in a GenAI app aren't in the prompts. They're in how the pieces connect.

Bringing the branches back together

Two branches converge

The two ranked lists meet at a Merge node. It interleaves them, removes duplicates, and caps the result at a maximum number of chunks so the prompt stays inside the model's context window.

This is also where you'd enforce balance - making sure one document doesn't drown out the other. The merged context, together with the original question, is then passed to the Ask AI node, with a strict system prompt: answer only from the provided context, and cite the source for every claim.

Trust, but verify

Check, then ship

A model that's been told to ground its answer will still, sometimes, invent one. The Verify Answer node does a second pass: it checks that every claim in the draft answer can be traced back to a chunk in the merged context.

If something doesn't line up, you can loop back for another attempt, refuse to answer, or flag the response for review. This is the difference between a demo and something you'd actually put in front of users.

Finally, Generate Response takes the verified answer and wraps it in whatever shape your app needs - markdown for a chat UI, plain text for an email, a structured object for an API. It's deliberately the last node, so the formatting concern never leaks back into the model's reasoning.

What you actually get out

At runtime, the user sees a short, grounded answer with citations pointing back to the exact pages it came from.

At design time, you get a structured blueprint of the whole orchestration graph - every node, every connection, every config value - that your backend engineer can implement directly, or that you can drop into your own orchestration runtime.

Left: a chat panel showing the assistant answering with citations to handbook.pdf p.12 and policies.pdf p.4. Right: a JSON blueprint exported from the canvas listing the nodes, edges, and settings of the same orchestration graph. — Left: the user-facing answer with grounded citations. Right: the exported blueprint that produced it.

Try it in one click

Open the canvas and pick Two-Document Q&A from the starters menu. The exact blueprint described above will load on the canvas, fully wired and ready to edit. Swap the documents, change the model, add a third source - the orchestration stays visible the entire time.

Open the canvas

Design your AI backend
like a circuit, not a prompt.

How it Works

Drag a node from the library

Connect outputs to inputs

Tune each step in the inspector

Export the blueprint

A grounded Q&A pipeline, designed on the canvas

The orchestration order

Starting the orchestration graph: the question comes in

Two documents, two parallel branches

Bringing the branches back together

Trust, but verify

What you actually get out

Try it in one click

Design your AI backendlike a circuit, not a prompt.

How it Works

Drag a node from the library

Connect outputs to inputs

Tune each step in the inspector

Export the blueprint

A grounded Q&A pipeline, designed on the canvas

The orchestration order

Starting the orchestration graph: the question comes in

Two documents, two parallel branches

Bringing the branches back together

Trust, but verify

What you actually get out

Try it in one click

Design your AI backend
like a circuit, not a prompt.