What I Learned Building a Deviation Analysis Prototype for EPC Contractors

While researching how AI could solve real problems in traditional industries, I came across an interesting sector: industrial equipment procurement. EPC contractors and equipment vendors deal with hundreds of RFQ documents annually — each one containing complex technical specifications, engineering standards, and compliance requirements buried in 30-100 page PDFs.

I wanted to understand the problem deeply, so I spent a few days building a focused prototype: an AI-powered deviation analysis tool for industrial equipment RFQs.

Here’s what I learned — both technically and strategically.

The Problem: Procurement in Industrial Equipment

When a valve manufacturer receives an RFQ from an EPC contractor, they need to:

Extract technical specifications (size, pressure rating, materials)
Identify required certifications (API 6D, NACE MR0175, API 607)
Compare everything against their standard product catalog
Flag deviations that require custom engineering
Generate clarification questions for ambiguous requirements
Prepare an accurate quotation

Today this is done manually. An experienced engineer spends 2-4 hours per RFQ, with a high error rate on complex specifications. Missed deviations mean lost money or lost bids. One overlooked NACE compliance requirement can mean quoting the wrong materials, delivery delays with penalty clauses, or — worst case — safety issues and liability exposure.

The cost of a single error often exceeds the annual cost of software that could have caught it.

What I Built

An open-source prototype that analyzes RFQ documents and identifies specification deviations from standard product offerings, running entirely on local AI.

The processing flow is straightforward:

Upload an RFQ PDF
Extract text and technical parameters using a local LLM
Compare extracted specs against YAML-defined standard offerings
Highlight deviations categorized by severity
Generate clarification questions for ambiguous specs
Export a detailed Excel report

A key design choice: 100% local inference using Ollama. Documents never leave the user’s machine. Zero marginal cost per analysis. Works completely offline. For sensitive industrial documents containing proprietary specifications, this tradeoff matters.

Example Output

Parameter	RFQ Requirement	Standard Spec	Deviation	Severity
Pressure Rating	PN63	PN40	Upgrade Required	Major
Material	CF8M (ASTM A351)	CF8M	Match	—
Test Standard	API 598	API 598	Match	—
Fire Safe	API 607	Not Standard	Addition Required	Major

Technical Challenges

PDF Table Extraction Is Harder Than It Looks

RFQ specifications live in tables, but PDFs don’t preserve structure well. The prototype uses PyMuPDF, which handles machine-generated PDFs adequately but struggles with merged cells, complex layouts, and multi-column formats. Scanned documents fail entirely without OCR.

AI-powered extraction (like GPT-4 Vision) can achieve near-perfect results on hard cases, but at $0.50-$2.00 per document versus $0.00 for the local approach. For high-value RFQs worth $100K+, that cost is trivial. For bulk screening, it adds up fast. This isn’t just a technical decision — it’s a product decision about where accuracy matters most.

Accuracy vs. Speed Is a Product Decision

More detailed LLM prompts yield higher accuracy but slower processing. The right balance depends on context: for $5M bids, engineers want maximum accuracy. For initial screening of 50 RFQs to prioritize the top 10, speed and cost matter more. A production system should offer both modes.

Source Citation Is Non-Negotiable

Engineers won’t trust “AI says NACE required.” They need to see where it’s stated — the page number, the section, the exact text. Maintaining source metadata through the processing pipeline added complexity but proved essential. Without it, users verify everything manually and the tool provides no time savings.

Why Local AI (Ollama + Llama)

Consideration	Local (Ollama)	Cloud APIs
Cost per document	$0	$0.10–$0.50
Privacy	Documents stay on your machine	Sent to third-party servers
Offline use	Full functionality	Requires connectivity
Setup effort	Install Ollama + download model (5-40GB)	API key only
Quality ceiling	Approaching but not exceeding frontier models	Highest available accuracy

For industrial RFQ analysis with proprietary documents, privacy and zero marginal cost often outweigh the quality gap. But this is ultimately a business decision, not a technical one. A production system should probably support both — local inference for routine analysis, cloud APIs for critical high-value bids where maximum accuracy justifies the cost.

Beyond Code: Strategic Learnings

Real Value Is Risk Reduction, Not Time Savings

My initial hypothesis was “save proposal teams 30% of their time.” The actual insight is different: missing one deviation can cost more than a year of software. The reframe is important — from “process RFQs faster” to “catch what you’d miss.” Proposal teams under pressure make mistakes. AI doesn’t get tired or rushed. One prevented error means positive ROI.

This has pricing implications too. Value-based pricing tied to bid size may make more sense than per-document or per-seat models.

The Cold Start Problem Is the Biggest GTM Barrier

AI needs historical data — past proposals, product specs, customer preferences — to work well. But new customers don’t have digitized archives. Realistically, a system like this takes 3-6 months to become truly valuable as it builds a useful library of specs and learns from outcomes.

Pre-loading industry templates (API standards, common specifications) can provide immediate value and bridge the gap, but the cold start problem explains why validation periods for this kind of tool need to be measured in months, not weeks.

Integration Is Make-or-Break

To be truly valuable, a deviation analysis tool needs to connect to ERP systems for product specs and pricing, CRM for customer history, document management for templates, and email for receiving RFQs. An early-stage product can’t build 50 custom integrations — the strategy has to be: focus on 3 universal integrations, offer an API for everything else, and partner with ERP consultants for complex cases.

Vertical Specialization Creates Moats

Generic RFP tools struggle with engineering standards (ASME, API, NACE), technical specifications (pressure ratings, materials), and complex datasheets. Building AI-native for a narrow vertical like industrial equipment creates defensible positioning that horizontal tools retrofitting AI can’t easily replicate.

Prototype Limitations

This is an MVP that demonstrates core concepts. A few things worth noting:

Static specifications. Product specs are loaded from a YAML file at startup. A production system needs specs that update from accepted proposals over time, with confidence scoring based on historical success rates.
No learning loop. The system doesn’t track which deviations customers accept, which proposals win, or how to calibrate confidence over time. Without feedback, you make the same mistakes repeatedly.
Limited product coverage. Only three valve types (globe, ball, butterfly) have specifications defined. Expanding to pumps, actuators, or heat exchangers requires industry-specific parameter mappings and different compliance frameworks.
Single-user architecture. No authentication, no database, no audit trail. Everything lives in memory or YAML files.

These aren’t minor gaps — they’re fundamental requirements for production use. But the prototype successfully proves that local LLMs can extract structured data from technical documents, that the privacy/cost tradeoff of local inference works for this use case, and that the core deviation detection logic produces useful results.

Try It Yourself

The prototype is open-source. If you have Python 3.11+ and can install Ollama, you can be running your first analysis in about 15 minutes.

Install Ollama and pull a model (ollama pull llama3.3:latest)
Clone the repository
Set up the environment and run streamlit run src/ui/app.py

Full installation instructions are in the README.

Reflections

A few days of building taught me more than I expected about this sector and about B2B AI in general.

Technically, working with LLMs requires thinking about accuracy, cost, and speed tradeoffs as product decisions — not just engineering problems.

Strategically (but not validated), in B2B the real value may not be in what you initially think. Time savings sound good in a pitch, but risk reduction may guide buying decisions.

About the sector, I understand why industrial procurement can result resistant to digitization — the workflows are intricate, the compliance requirements vary by geography and application, and one mistake can have real consequences. That complexity is also where AI could add the most value.

If you’re working on AI for B2B or industrial workflows or even public sector processes, I’d love to connect.

Stay Updated

Interested in future projects? Join a small list to hear about new explorations.

Built by Alessio Fiorentino · GitHub · Report an Issue