Design an Accounting AI Agent for Accuracy and Trust

Design an Accounting AI Agent for Accuracy and Trust

@Propio

@Propio

Led the 0 → 1 design of an accounting agent that reduced manual review time by 60%, increased categorization accuracy to 90%, and drove 10× ARR growth through human-AI collaboration in regulated workflows.

Led the 0 → 1 design of an accounting agent that reduced manual review time by 60%, increased categorization accuracy to 90%, and drove 10× ARR growth through human-AI collaboration in regulated workflows.

MY ROLE

Solo Product Designer

(First Design Hire)

Solo Product Designer

(First Design Hire)

TEAM

Founder, CTO, 5 Engineers, 2 Business Operations

Founder, CTO,

5 Engineers,

2 Business Operations

TIMELINE

Sep - Dec 2025 (4 months)

Sep - Dec 2025 (4 months)

KEY WORDS

B2B SaaS, Data-Heavy, Regulated Workflow, AI Agent, AI Collaboration

B2B SaaS, Data-Heavy, Regulated Workflow, AI Agent, AI Collaboration

Features at a glance

Designing AI to facilitate tedious processes within regulated accounting workflows

As the first and only product designer at Propio, I led the design from 0 to 1, shaping research, interaction models, and system behavior. Working closely with the founding team, I ensured the product delivered measurable impact while maintaining trust and control for accountants.

May 2025

Discrepancy Found

There’s $132 discrepancy between your bank and Propio ledger this month. Please review them and finalize reconciliation.

Start Review

Customer Revenue Scaled by

9.6x

within 6 months
Check Review Efficiency Raised By

60%

validated by 10+ user testings
Categorization Accuracy Boost By

~50%

through human-AI collaboration

What is Propio?

Propio use AI to automate transaction categorization for accountants.

We support accountants in reviewing and categorizing transactions, making up nearly 70% of their workload. Propio uses AI to assist with categorization, reducing manual effort while keeping accountants in control.

Who we’re building for?

We design for small accounting firms working in regulated environments.

Our users are small accounting firms (2–15 people) managing dozens of client businesses at once. Their workflows are structured, high-stakes, and constantly reviewed, where even minor mistakes carry financial risk.

High review frequency

High review frequency

Regulated Workflow

Regulated Workflow

Low Error Tolerance

Low Error Tolerance

Problem Space

However, AI miscategorized transactions, creating more manual work and breaking trust.

1

Low AI Categorization Accuracy

When the AI made mistakes, the same errors often repeated, leading to frustration and declining trust from our clients.

Confidence Level Overview

Confidence Level Overview

520

520

180

180

240

240

40

40

AI Accuracy

AI Accuracy

Improve

Improve

40%

40%

2

Manual Work Was Still Heavy

Edge cases like checks and uploaded receipts still required full manual review, and accountants felt like they were double-checking the AI’s work.

Research Findings

Research revealed a new direction for the product.

Based on 8 user interviews and internal design workshops, we found that AI must preserve user autonomy, reduce cognitive load, and improve contextual accuracy so accountants can collaborate with AI without disrupting their workflow.

User Interviews & Team Workshops

User Interviews & Team Workshops

New Product Direction

New Product Direction

?

🙆🏻‍♀️

User Autonomy

Contextual Accuracy

⬇️

Load Reduction

Question 1: Where should AI live in an accountant’s workflow?

Question 1: Where should AI live in an accountant’s workflow?

We tested different ways of integrating AI into the accounting workflow without disrupting it.

To embed AI without disrupting how accountants already work, I experimented with three different interaction models to find the balance between automation and user autonomy, ensuring AI could support the workflow while keeping accountants in control.

1

Chat-based AI

2

Secondary Workspace

3

On-Demand Agent

Iteration 1

Chat-based AI

✅ Yes

  • Familiar interaction pattern

  • Effective for simple, one step tasks


❌ But

  • Hard to review and edit previous inputs

  • Users lost orientation in longer conversations

Iteration 2

Secondary Workspace

✅ Yes

  • Separated conversation from execution

  • Easier to review and edit prior inputs


❌ But

  • Context switching increased cognitive load

  • Users had to re orient each time they returned

Iteration 3 - Final Solution

On-Demand Agent

✅ What worked

  • Conversation and execution remained connected

  • Easy to review and adjust inputs inline

  • Reduced workflow interruption

Why the On-demand Agent Work?

The solution works because it respects how accountants operate. By keeping AI on demand rather than intrusive, it reduces cognitive interruption while maintaining control over the workflow.

I like that I can choose when to bring AI in. It helps me without pulling me away from what I’m working on.

Why the On-demand Agent Work?

The solution works because it respects how accountants operate. By keeping AI on demand rather than intrusive, it reduces cognitive interruption while maintaining control over the workflow.

I like that I can choose when to bring AI in. It helps me without pulling me away from what I’m working on.

Question 2: How can AI become more accurate in categorization?

Question 2: How can AI become more accurate in categorization?

Why is AI categorization accuracy low?

Accountants manage multiple clients, each with their own categorization rules and business context. Without understanding those nuances, AI often makes decisions based only on surface-level patterns, leading to incorrect categories.

Method

Examples

Amazon - $1,200

Amazon - $1,200

Office Supplies

Office Supplies

Company A

Company A

Office Supplies

Office Supplies

Professional Service

Professional Service

Amazon - $1,200

Amazon - $1,200

Company B

Company B

For this client, Amazon purchases over $500 are Professional Service

For this client, Amazon purchases over $500 are Professional Service

🗂️

Reason 1

Each client had unique rules and financial logic. Without that context, AI relied on general patterns instead of client-specific knowledge.

🔄

Reason 2

AI did not learn effectively from past corrections, so the same mistakes were repeated.

We introduced Human-in-the-loop interaction model

To address the accuracy gap, I worked closely with engineering to rethink how AI handled business context. We introduced a human in the loop model that allowed accountants to provide client specific context before decisions were finalized. This shifted AI from guessing to collaborating, building contextual memory directly into the workflow.

Human-in-the-loop Method

Collaboration Principles

👀

Make AI decisions transparent

So accountants can review suggestions with confidence.

⚠️

Let AI ask for missing context

Humans cannot see what AI is missing, so it should ask instead of guessing.

🔄

Learn from every correction

Each adjustment improves future predictions.

Solution — A Collaborative Calibration System

We rollout a collaborative system by making AI confidence visible, guiding accountants through calibration, and embedding client specific rules into memory. tThe system can learn and improves over time without disrupting the workflow.

Transaction Dashboard

The dashboard shows how AI performs in categorizing transactions, including the distribution of confidence levels and the overall accuracy rate.

Calibration Space

This is where AI and accountants collaborate to calibrate categorization results and improve accuracy.

Comparison

After the human-AI collaboration process, the accuracy rate significantly increases. Accountants can revisit the dashboard anytime if the accuracy drops again.

Human-in-the-loop Impact

This collaborative calibration system enables AI to continuously learn from accountants, significantly increasing categorization accuracy.

AI Acceptance Rate

⬆️

Repeated Corrections

⬇️

I used to spend almost an hour correcting one client. Now I finish the whole process in 15 minutes 😲

Feedback from user testing

Question 3: How can an AI handle repetitive transactions?

Question 3: How can an AI handle repetitive transactions?

Where do repetitive transactions happen?

Accountants manually review the bank statement, locate each check, open the image, and categorize it one by one. It’s repetitive, detail-heavy work that takes time and energy, especially when done across multiple clients.

⚠️ Repetitive, Time-consuming

⚠️ Manual Efforts

1

Review Statement

Review Statement

Amazon.com

2

Find Check Transaction

Find Check Transaction

Amazon.com

Amazon.com

3

Look into Check Image

Look into Check Image

4

Manually Categorize

Manually Categorize

Payee: Best Buy

Amount: $83.00

→ Category: Office Supplies

Payee: Best Buy

Amount: $83.00

→ Category: Office Supplies

How can AI help in handling check transaction?

AI automates the process by fetching check transactions from bank statements, using OCR to extract key details from check images, and suggesting categories for review. By preparing everything upfront, accountant can focus on reviewing and making the final call.

Step 1

Prompt AI to fetch checks

Accountants can ask AI to gather all check transactions from the bank statement and bring them into a single review space.

Step 2

Review and approve AI suggestions

AI clip the check image and suggest a proper category. Accountants simply review, adjust, and approve, shifting from manual processing to decision-making.

Impact - Automated Check Handling

The accountant’s role shifts from manual operator to informed reviewer, reducing repetitive handling and accelerating check transaction review.

Time to review a check

60% ⬇️

I can see the check right in the system without downloading anything. That’s crazy!

Feedback from user testing

Impact - Automated Check Handling

The accountant’s role shifts from manual operator to informed reviewer, reducing repetitive handling and accelerating check transaction review.

Time to review a check

60% ⬇️

I can see the check right in the system without downloading anything. That’s crazy!

Feedback from user testing

Let’s Get in touch!

Eager to uncover market opportunities and delivering state-of-the-art products with seamless, intuitive user experiences!

peisyuan0910@gmail.com

@ 2025 | Pei Syuan Chou

Let’s Get in touch!

Eager to uncover market opportunities and delivering state-of-the-art products with seamless, intuitive user experiences!

peisyuan0910@gmail.com

@ 2025 | Pei Syuan Chou