How to design transactional agent AI systems with Langgraph using two-phase commitment, human interruption, and safe rollback

In this tutorial, we implement an agentic AI pattern using Langgraph that treats reasoning and action as transactional workflows rather than single-shot decisions. We model a two-phase commit system in which an agent commits reversible changes, validates strict invariants, pauses for human approval via graph interrupts, and only then commits or rolls back. With it, we demonstrate how agentic systems can be designed with security, auditability, and controllability at their core, moving beyond reactive chat agents toward structured, governance-aware AI workflows that run reliably in Google Colab using OpenAI models. check it out full code here,

!pip -q install -U langgraph langchain-openai


import os, json, uuid, copy, math, re, operator
from typing import Any, Dict, List, Optional
from typing_extensions import TypedDict, Annotated


from langchain_openai import ChatOpenAI
from langchain_core.messages import SystemMessage, HumanMessage, AIMessage, AnyMessage
from langgraph.graph import StateGraph, START, END
from langgraph.graph.message import add_messages
from langgraph.checkpoint.memory import InMemorySaver
from langgraph.types import interrupt, Command


def _set_env_openai():
   if os.environ.get("OPENAI_API_KEY"):
       return
   try:
       from google.colab import userdata
       k = userdata.get("OPENAI_API_KEY")
       if k:
           os.environ("OPENAI_API_KEY") = k
           return
   except Exception:
       pass
   import getpass
   os.environ("OPENAI_API_KEY") = getpass.getpass("Enter OPENAI_API_KEY: ")


_set_env_openai()


MODEL = os.environ.get("OPENAI_MODEL", "gpt-4o-mini")
llm = ChatOpenAI(model=MODEL, temperature=0)

We have set up the execution environment by installing Langgraph and initializing the OpenAI model. We securely load API keys and configure a deterministic LLM, ensuring that all downstream agent behavior remains reproducible and controlled. check it out full code here,

SAMPLE_LEDGER = (
   {"txn_id": "T001", "name": "Asha", "email": "(email protected)", "amount": "1,250.50", "date": "12/01/2025", "note": "Membership renewal"},
   {"txn_id": "T002", "name": "Ravi", "email": "(email protected)", "amount": "-500", "date": "2025-12-02", "note": "Chargeback?"},
   {"txn_id": "T003", "name": "Sara", "email": "(email protected)", "amount": "700", "date": "02-12-2025", "note": "Late fee waived"},
   {"txn_id": "T003", "name": "Sara", "email": "(email protected)", "amount": "700", "date": "02-12-2025", "note": "Duplicate row"},
   {"txn_id": "T004", "name": "Lee", "email": "(email protected)", "amount": "NaN", "date": "2025/12/03", "note": "Bad amount"},
)


ALLOWED_OPS = {"replace", "remove", "add"}


def _parse_amount(x):
   if isinstance(x, (int, float)):
       return float(x)
   if isinstance(x, str):
       try:
           return float(x.replace(",", ""))
       except:
           return None
   return None


def _iso_date(d):
   if not isinstance(d, str):
       return None
   d = d.replace("/", "-")
   p = d.split("-")
   if len(p) == 3 and len(p(0)) == 4:
       return d
   if len(p) == 3 and len(p(2)) == 4:
       return f"{p(2)}-{p(1)}-{p(0)}"
   return None


def profile_ledger(rows):
   seen, anomalies = {}, ()
   for i, r in enumerate(rows):
       if _parse_amount(r.get("amount")) is None:
           anomalies.append(i)
       if r.get("txn_id") in seen:
           anomalies.append(i)
       seen(r.get("txn_id")) = i
   return {"rows": len(rows), "anomalies": anomalies}


def apply_patch(rows, patch):
   out = copy.deepcopy(rows)
   for op in sorted((p for p in patch if p("op") == "remove"), key=lambda x: x("idx"), reverse=True):
       out.pop(op("idx"))
   for op in patch:
       if op("op") in {"add", "replace"}:
           out(op("idx"))(op("field")) = op("value")
   return out


def validate(rows):
   issues = ()
   for i, r in enumerate(rows):
       if _parse_amount(r.get("amount")) is None:
           issues.append(i)
       if _iso_date(r.get("date")) is None:
           issues.append(i)
   return {"ok": len(issues) == 0, "issues": issues}

We define the core ledger abstraction with patching, normalization, and verification logic. We treat data changes as reversible operations, allowing the agent to safely reason about changes before committing them. check it out full code here,

class TxnState(TypedDict):
   messages: Annotated(List(AnyMessage), add_messages)
   raw_rows: List(Dict(str, Any))
   sandbox_rows: List(Dict(str, Any))
   patch: List(Dict(str, Any))
   validation: Dict(str, Any)
   approved: Optional(bool)


def node_profile(state):
   p = profile_ledger(state("raw_rows"))
   return {"messages": (AIMessage(content=json.dumps(p)))}


def node_patch(state):
   sys = SystemMessage(content="Return a JSON patch list fixing amounts, dates, emails, duplicates")
   usr = HumanMessage(content=json.dumps(state("raw_rows")))
   r = llm.invoke((sys, usr))
   patch = json.loads(re.search(r"(.*)", r.content, re.S).group())
   return {"patch": patch, "messages": (AIMessage(content=json.dumps(patch)))}


def node_apply(state):
   return {"sandbox_rows": apply_patch(state("raw_rows"), state("patch"))}


def node_validate(state):
   v = validate(state("sandbox_rows"))
   return {"validation": v, "messages": (AIMessage(content=json.dumps(v)))}


def node_approve(state):
   decision = interrupt({"validation": state("validation")})
   return {"approved": decision == "approve"}


def node_commit(state):
   return {"messages": (AIMessage(content="COMMITTED"))}


def node_rollback(state):
   return {"messages": (AIMessage(content="ROLLED BACK"))}

We model the internal state of the agent and define each node in the LangGraph workflow. We express the agent’s behavior as discrete, observable steps that change the state while preserving the message history. check it out full code here,

builder = StateGraph(TxnState)


builder.add_node("profile", node_profile)
builder.add_node("patch", node_patch)
builder.add_node("apply", node_apply)
builder.add_node("validate", node_validate)
builder.add_node("approve", node_approve)
builder.add_node("commit", node_commit)
builder.add_node("rollback", node_rollback)


builder.add_edge(START, "profile")
builder.add_edge("profile", "patch")
builder.add_edge("patch", "apply")
builder.add_edge("apply", "validate")


builder.add_conditional_edges(
   "validate",
   lambda s: "approve" if s("validation")("ok") else "rollback",
   {"approve": "approve", "rollback": "rollback"}
)


builder.add_conditional_edges(
   "approve",
   lambda s: "commit" if s("approved") else "rollback",
   {"commit": "commit", "rollback": "rollback"}
)


builder.add_edge("commit", END)
builder.add_edge("rollback", END)


app = builder.compile(checkpointer=InMemorySaver())

We build a Langgraph state machine and explicitly encode the control flow between profiling, patching, verification, approval, and finalization. We use conditional edges to enforce governance rules instead of relying on underlying model decisions. check it out full code here,

def run():
   state = {
       "messages": (),
       "raw_rows": SAMPLE_LEDGER,
       "sandbox_rows": (),
       "patch": (),
       "validation": {},
       "approved": None,
   }


   cfg = {"configurable": {"thread_id": "txn-demo"}}
   out = app.invoke(state, config=cfg)


   if "__interrupt__" in out:
       print(json.dumps(out("__interrupt__"), indent=2))
       decision = input("approve / reject: ").strip()
       out = app.invoke(Command(resume=decision), config=cfg)


   print(out("messages")(-1).content)


run()

We run transactional agents and handle human-in-the-loop approvals through graph interrupts. We definitively resume execution, demonstrating how agentic workflows can pause, accept external input, and safely terminate with a commit or rollback.

Finally, we showed how LangGraph enables us to build agents that reason over states, implement validation gates, and collaborate with humans at precisely defined control points. We treated the agent not as an oracle, but as a transaction coordinator that can stage, inspect, and reverse its own actions while maintaining a full audit trail. This approach highlights how agentic AI can be applied to real-world systems that require trust, compliance, and recovery, and it provides a practical foundation for building production-grade autonomous workflows that remain secure, transparent, and human-supervised.

check it out full code hereAlso, feel free to follow us Twitter And don’t forget to join us 100k+ ml subreddit and subscribe our newsletterwait! Are you on Telegram? Now you can also connect with us on Telegram.

Asif Razzaq Marktechpost Media Inc. Is the CEO of. As a visionary entrepreneur and engineer, Asif is committed to harnessing the potential of Artificial Intelligence for social good. Their most recent endeavor is the launch of MarketTechPost, an Artificial Intelligence media platform, known for its in-depth coverage of Machine Learning and Deep Learning news that is technically robust and easily understood by a wide audience. The platform boasts of over 2 million monthly views, which shows its popularity among the audience.

How to design transactional agent AI systems with Langgraph using two-phase commitment, human interruption, and safe rollback

The godfather of AI warns it will cause havoc in the job market this year

Trump Media Group plans new token with Crypto.com

Related Articles

Leave a Comment Cancel Reply