Every order contains sensitive information. Storing it creates GDPR risks. So we built an architecture that learns from structure and context, not from stored documents.
The three roads to AI-powered order processing
Building AI for order automation requires training data. The real question is where that data comes from and what happens to it.
In practice, we see three common approaches. Most solutions choose one of the first two. We deliberately chose the third.
Approach 1: OCR combined with machine learning
The first approach starts with OCR. A document is scanned, text is extracted and a model learns patterns over time.
It sounds logical. In reality it breaks quickly.
OCR struggles with handwriting, unusual layouts and low quality scans. The machine learning layer behind it requires large volumes of labelled data before it becomes reliable.
For B2B distributors dealing with hundreds of different order formats, from PDFs to spreadsheets to photos, this quickly becomes a maintenance challenge. Every new edge case requires retraining.
Approach 2: fine-tuning a large language model
Another approach is to train or fine-tune a large language model on internal order data.
The idea is that the model will eventually learn the company specific context.
But this comes with trade-offs. You need to store large volumes of raw order data, allocate significant compute resources and iterate for months before the model stabilises.
You are also tied to a specific model architecture. When a provider releases a new version, the fine-tuning process may have to start again.





