Network Flow

Version:

Experience level: Beginner
Reasoning types: Prescriptive
Industry: Supply Chain & Logistics
Tags: AllocationLPNetwork Flow
Experience level: Beginner
Reasoning types: Prescriptive
Industry: Supply Chain & Logistics
Tags: AllocationLPNetwork Flow

What this template is for

Maximum flow is a classic optimization problem: given a directed network with capacities, find how much material (or traffic, bandwidth, product, etc.) you can push through the network. This template builds a simple max-flow linear program using prescriptive reasoning (optimization) that chooses a non-negative flow on each edge, respects edge capacities, and enforces flow conservation.

Prescriptive reasoning helps you:

Estimate throughput of a constrained network.
Identify bottlenecks by inspecting which edges saturate.
Compare designs by editing capacities and re-solving.

Who this is for

You want a small, end-to-end example of prescriptive reasoning (optimization) with RelationalAI.
You’re comfortable with basic Python and linear optimization concepts.

What you’ll build

A semantic model with an Edge concept loaded from CSV.
A continuous decision variable Edge.x_flow on each edge.
Capacity and conservation constraints defined with require(...).
A max-flow objective solved with the HiGHS backend.

What’s included

Model + solve script: network_flow.py
Sample data: data/edges.csv

Prerequisites

Access

A Snowflake account that has the RAI Native App installed.
A Snowflake user with permissions to access the RAI Native App.

Tools

Python >= 3.10

Quickstart

Follow these steps to run the template with the included sample data.

Download the ZIP file for this template and extract it:

curl -O https://private.relational.ai/templates/zips/v0.13/network_flow.zip
unzip network_flow.zip
cd network_flow

Create and activate a virtual environment

python -m venv .venv
source .venv/bin/activate
python -m pip install -U pip

Install dependencies

python -m pip install .

Configure Snowflake connection and RAI profile

rai init

Run the template

python network_flow.py

Expected output

Status: OPTIMAL
Maximum flow: 13

Edge flows:
 i  j  flow
 1  2   8.0
 1  3   5.0
 2  4   4.0
 2  5   4.0
 3  5   2.0
 3  6   3.0
 4  6   4.0
 5  6   6.0

Template structure

.
├─ README.md
├─ pyproject.toml
├─ network_flow.py     # main runner / entrypoint
└─ data/               # sample input data
  └─ edges.csv

Start here: network_flow.py

Sample data

Data files are in data/.

`edges.csv`

Each row represents a directed edge from node i to node j with capacity cap.

Column	Meaning
`i`	Source node ID
`j`	Target node ID
`cap`	Maximum flow capacity

Model overview

The model defines one concept (Edge) and one decision variable (Edge.x_flow).

`Edge`

Represents a directed, capacitated edge.

Property	Type	Identifying?	Notes
`i`	int	Yes	Loaded from `data/edges.csv`
`j`	int	Yes	Loaded from `data/edges.csv`
`cap`	float	No	Capacity for the edge
`flow`	float	No	Decision variable (continuous)

How it works

This section walks through the highlights in network_flow.py.

Import libraries and configure inputs

First, the script imports the Semantics and optimization APIs, configures the data directory, and sets a constant source node:

from pathlib import Path

import pandas
from pandas import read_csv

from relationalai.semantics import Model, data, per, require, select, sum
from relationalai.semantics.reasoners.optimization import Solver, SolverModel

# --------------------------------------------------
# Configure inputs
# --------------------------------------------------

DATA_DIR = Path(__file__).parent / "data"

# Disable pandas inference of string types. This ensures that string columns
# in the CSVs are loaded as object dtype. This is only required when using
# relationalai versions prior to v1.0.
pandas.options.future.infer_string = False

# Source node for the max-flow objective.
SOURCE_NODE = 1

Define concepts and load CSV data

Next, it creates a Model, defines the Edge concept, and loads data/edges.csv into that concept using data(...).into(...):

# --------------------------------------------------
# Define semantic model & load data
# --------------------------------------------------

# Create a Semantics model container.
model = Model("network_flow", config=globals().get("config", None), use_lqp=False)

# Edge concept: directed edges with endpoints (i, j) and capacity (cap).
Edge = model.Concept("Edge")

# Load edge data from CSV.
edges_csv = read_csv(DATA_DIR / "edges.csv")
data(edges_csv).into(Edge, keys=["i", "j"])

Define decision variables, constraints, and objective

Then it creates a SolverModel, declares Edge.x_flow and marks it as a decision variable with solve_for(...), and models the bounds and conservation constraints with require(...):

# --------------------------------------------------
# Model the decision problem
# --------------------------------------------------

Ei = Edge
Ej = Edge.ref()

# Create a continuous optimization model.
s = SolverModel(model, "cont")

# Edge.x_flow decision variable: flow on each edge.
Edge.x_flow = model.Property("{Edge} has {flow:float}")
s.solve_for(Edge.x_flow, name=["flow", Edge.i, Edge.j])

# Constraint: flow must be non-negative and cannot exceed edge capacity.
bounds = require(
   Edge.x_flow >= 0,
   Edge.x_flow <= Edge.cap
)
s.satisfy(bounds)

# Constraint: flow conservation at each node (inflow equals outflow).
flow_out = per(Ei.i).sum(Ei.flow)
flow_in = per(Ej.j).sum(Ej.flow)
balance = require(flow_in == flow_out).where(
   Ei.i == Ej.j
)
s.satisfy(balance)

# Objective: maximize total flow out of the source node.
total_flow = sum(Edge.x_flow).where(
   Edge.i == SOURCE_NODE
)
s.maximize(total_flow)

Solve and print results

Finally, it solves with Solver("highs") and prints the termination status, objective value, and a filtered flow table (only edges with Edge.x_flow > 0.001):

# --------------------------------------------------
# Solve and check solution
# --------------------------------------------------

solver = Solver("highs")
s.solve(solver, time_limit_sec=60)

print(f"Status: {s.termination_status}")
print(f"Maximum flow: {s.objective_value:.0f}")

flows = select(Edge.i, Edge.j, Edge.x_flow).where(Edge.x_flow > 0.001).to_df()

print("\nEdge flows:")
print(flows.to_string(index=False))

Troubleshooting

I get ModuleNotFoundError when running the script

Confirm you created and activated the virtual environment from the Quickstart.
Reinstall dependencies with python -m pip install ..
Verify you are running python network_flow.py from the network_flow/ folder.

The script fails while reading data/edges.csv

Confirm the file exists at data/edges.csv.
Verify the CSV includes headers i, j, and cap.
Check that cap values are numeric and non-negative.

My Edge flows table is empty

The output is filtered to Edge.x_flow > 0.001; small flows will not display.
If the maximum flow is 0, check that there are edges leaving SOURCE_NODE = 1 and that their capacities are positive.

I see an unexpected termination status (not OPTIMAL)

Try re-running; if you hit a time limit, consider increasing time_limit_sec.
Sanity-check the data for missing values (especially capacities).
If you modified the model, revert to the sample data to confirm the baseline still solves.