πŸŽ‰ Typeless is now Fabrx! Same great product, new name.
LogisticsΒ·10 min read

Bill of Lading Data Extraction API β€” Deploy in 60 Seconds, No Templates Required

Extract structured data from any bill of lading or shipping label with a live API endpoint in under 60 seconds. No templates, no model training, EU AI Act compliant on the free plan.

Every logistics operation runs on paper β€” or its digital equivalent. Bills of lading, shipping labels, freight manifests, carrier receipts: the data locked inside these documents drives every downstream decision, from inventory updates to customs filings to invoicing. And yet, for most teams, extracting that data still means manual entry, brittle OCR templates, or waiting weeks for a vendor to train a model on your specific carrier formats.

Fabrx takes a different approach. Describe the fields you want in plain language, and you get a live REST API endpoint in under 60 seconds β€” no templates, no training data, no vendor onboarding calls. This article walks through exactly how it works for bills of lading and shipping labels, what data you can extract, and why the compliance posture matters more than most logistics teams realize.

What Is Bill of Lading Data Extraction β€” and Why Manual OCR Falls Short

A bill of lading (BOL) is the foundational document of freight shipping. It serves simultaneously as a receipt of goods, a contract of carriage, and β€” in many jurisdictions β€” a title document. A standard BOL captures shipper and consignee details, shipment origin and destination, commodity descriptions, quantity and weight, freight charges, special handling instructions, and the carrier's signature.

The challenge is that no two carriers format a BOL the same way. A BOL from a regional LTL carrier looks nothing like one from an ocean freight forwarder, which looks nothing like a cross-border trucking document. There are hundreds of active templates in circulation, and they change whenever carriers update their systems or regulatory requirements shift.

Traditional OCR approaches fail here in predictable ways:

  • Template-based OCR breaks the moment a carrier updates their form layout. Your team discovers the failure when a data field comes back blank β€” or worse, populated with the wrong value.
  • Pre-trained fixed-schema models (like those offered by most OCR API vendors) extract the 20 or so fields the vendor decided were important. If you need a custom field β€” declared value in a specific currency, a carrier-specific reference number, a regulatory code β€” you're out of luck or back to building templates.
  • Manual data entry doesn't fail silently, but at 500–50,000 BOLs per month, the labor cost is unsustainable, and human error rates compound downstream.

The core problem isn't OCR accuracy β€” modern models read text from documents reliably. The problem is the gap between "reading text" and "delivering structured, field-mapped data that integrates cleanly with a WMS or ERP." That gap is where most logistics automation projects get stuck.

How Fabrx Works: Describe Your Schema, Get a Live API Endpoint

Fabrx replaces the template-building and model-training workflow with a conversational schema builder. Instead of configuring field coordinates or labeling training documents, you describe what you want to extract in plain language:

"Extract the shipper name, consignee name, shipper address, consignee address, BOL number, pro number, ship date, delivery date, commodity description, number of pieces, total weight in pounds, freight class, declared value, and any special handling instructions."

Fabrx maps that description to a typed JSON schema, shows you a preview of the output structure, and generates a live REST endpoint. The entire process β€” from description to deployable API β€” takes under 60 seconds.

Fabrx advantage: No competitor in the BOL OCR space makes an explicit time-to-live-API claim. Nanonets, Mindee, and Klippa all require template configuration, model training, or enterprise onboarding before you can make your first production API call. With Fabrx, you paste the endpoint into your integration and start processing documents immediately.

The schema you define is versioned. When a carrier updates their BOL format, or when regulations require a new field, you update the schema description β€” not a template, not a training dataset. Old API versions continue serving existing integrations while the new version deploys. For 3PL and freight tech developers maintaining integrations across dozens of carriers, this is a meaningful operational difference.

Under the hood, Fabrx routes your documents through the AI provider you select (or the default, if you haven't configured BYOK), applies your schema as structured extraction instructions, validates the output against your field types, and returns clean JSON. Every extraction is logged with field-level provenance so you can trace exactly where each value came from in the source document.

What Data You Can Extract from a Bill of Lading (and How to Customize It)

Most BOL extraction tools give you a fixed list of fields. Fabrx gives you any field that appears in the document, structured however your downstream systems expect it. Common BOL extraction schemas include:

  • Parties: Shipper name, shipper address, consignee name, consignee address, third-party billing party, notify party, broker name
  • Reference numbers: BOL number, PRO number, purchase order number, customer reference, shipment ID, seal numbers
  • Dates and routing: Ship date, estimated delivery date, origin terminal, destination terminal, carrier SCAC code, trailer number
  • Commodity detail: Description of goods, HS codes, NMFC codes, freight class, number of handling units, total pieces, total weight, dimensions
  • Financial: Declared value, freight charges, accessorial charges, prepaid vs. collect vs. third-party payment terms
  • Compliance fields: Hazmat indicators, UN numbers, emergency contact information, FDA registration numbers, USDA compliance codes
  • Carrier annotations: Exceptions noted at pickup, driver signature, delivery confirmation, shortage or damage notations

Beyond these standard fields, you can extract carrier-specific fields that appear nowhere in pre-trained models. If your carrier uses a non-standard "Route Code" or "Load Sequence" field, you describe it and Fabrx extracts it. If you need declared value normalized to USD regardless of the currency printed on the BOL, you specify the transformation in your schema description.

Fabrx advantage: Mindee's BOL OCR API extracts a fixed pre-trained schema. If you need a field that isn't in their list, you cannot add it without building a custom model β€” a project measured in weeks and labeled training data. Fabrx schema customization takes 30 seconds of typing.

Shipping Label Extraction: Same API, Same Schema Builder

Shipping labels β€” whether parcel labels from UPS, FedEx, DHL, or USPS, or carrier-specific LTL labels β€” present a different extraction challenge than BOLs. They're dense, often printed at low resolution, frequently photographed rather than scanned, and packed with barcodes, QR codes, and routing symbols alongside human-readable text.

The data most teams need from shipping labels:

  • Tracking number (and barcode format: Code 128, QR, Data Matrix, GS1-128)
  • Service level (Ground, 2-Day, Overnight, Freight)
  • Sender and recipient name, company, address, city, state, ZIP, country
  • Weight and dimensions
  • Reference fields (order number, invoice number, return authorization)
  • Zone, sort code, routing code
  • Special handling indicators (fragile, temperature-controlled, signature required)
  • Package count (e.g., "1 of 3", "2 of 3")

Because Fabrx uses the same conversational schema builder for shipping labels as for BOLs, you don't need a separate integration, a separate vendor, or a separate pricing tier. Define your shipping label schema, get an endpoint, and call it with the same authentication your BOL integration already uses.

Fabrx advantage: Most BOL OCR vendors treat shipping labels as a separate product category with separate pricing. Fabrx treats them as the same problem: a document with fields you want extracted. One schema builder, one API contract, one integration to maintain.

Deploy in Under 60 Seconds β€” No Templates, No Model Training

Here's what the deployment flow looks like in practice:

  1. Sign up and open the schema builder. No credit card required on the free plan. The schema builder is a chat interface β€” you describe your document and the fields you want.
  2. Describe your extraction schema. Type or paste a description of the fields you need. Fabrx parses your description into a typed JSON schema and shows you a preview. You can refine it conversationally: "make weight a number in pounds", "add a nested array for line items", "mark BOL number as required".
  3. Test against a sample document. Upload a BOL or shipping label. Fabrx runs the extraction and shows you the structured output alongside the source document. You can click any field value to see the exact text region it was drawn from.
  4. Copy your endpoint and API key. Your endpoint is live immediately. No deployment step, no approval queue, no waiting for a model to train.
  5. Integrate. POST your document to the endpoint. Receive structured JSON. Route it to your WMS, TMS, or ERP.

Your bill of lading extraction API β€” live in under 60 seconds.

No templates. No training data. EU AI Act compliant on the free plan.

Get started free β†’

Full Observability: Field-Level Data Lineage on Every Extraction

Logistics document automation isn't just about speed β€” it's about trust. When a BOL extraction feeds directly into a warehouse management system, an ERP financial module, or a customs filing, incorrect data has real consequences: misdirected shipments, compliance violations, incorrect invoicing, failed audits.

Fabrx provides field-level data lineage on every extraction. For each field in your output JSON, you can see:

  • The exact text region in the source document from which the value was extracted
  • The confidence level of the extraction
  • Whether the value required normalization (e.g., date format standardization, address parsing)
  • The model and schema version used for this extraction
  • A complete audit log of the extraction request and response

This observability layer matters for regulated supply chains β€” FDA-regulated goods, customs brokerage, hazardous materials β€” where you need to demonstrate not just that data was extracted, but where it came from and how it was processed. It also matters for exception handling: when an extraction returns a low-confidence value, your integration can route that document to a human reviewer rather than silently passing bad data downstream.

For logistics operations managers, this is the difference between "the AI extracted it" and "I can show you exactly where every number came from." That distinction matters in freight disputes, customs audits, and carrier claim processes.

EU AI Act Compliant, PII Detection Included β€” Even on the Free Plan

Bills of lading contain significant personal data: shipper and consignee names, company names, physical addresses, sometimes contact phone numbers and email addresses. Under GDPR, this is personal data. Under the EU AI Act β€” with high-risk provisions enforced from August 2026 β€” AI systems that process personal data in regulated contexts face additional obligations around transparency, human oversight, and audit trails.

Most BOL OCR vendors don't address this. Nanonets, Klippa, and KlearStack focus their compliance messaging on accuracy metrics, not data governance. Mindee's documentation doesn't mention the EU AI Act. For European logistics companies and any operation importing into the EU, this is a gap that becomes a liability in August 2026.

Compliance: Fabrx includes PII detection and redaction on all plans, including free. Every extraction request passes through PII classification before processing. Detected PII is flagged in the audit log, and you can configure field-level redaction policies so sensitive data never appears in your extraction logs. The entire platform is built to the EU AI Act's requirements for transparency, human oversight capability, and audit trail completeness β€” not bolted on as an enterprise add-on.

This has practical implications beyond regulatory compliance. Shipping documents from EU-based shippers or consignees are subject to GDPR's data minimization and purpose limitation principles. If your extraction pipeline stores raw document images and full extraction outputs indefinitely, you have a GDPR exposure. Fabrx's configurable retention policies and field-level redaction give you the tools to build a compliant pipeline without building a custom data governance layer on top of a third-party API.

For more on Fabrx's compliance architecture, see our detailed overview of GDPR and EU AI Act compliant document processing.

BYOK: Choose Your AI Provider, Keep Control of Your Data

Shipping documents contain competitively sensitive information: which customers you're shipping to, which suppliers you're receiving from, declared values that reveal pricing relationships, HS codes that reveal product compositions. For enterprise logistics operations, the question of which AI provider processes this data β€” and under what data processing terms β€” is not a technical footnote. It's a procurement and legal requirement.

Fabrx supports Bring Your Own Key (BYOK) across more than 100 AI providers. You can route BOL extractions through:

  • Your own Azure OpenAI deployment (with your data processing agreement already in place)
  • Anthropic Claude via your own API key (with your own DPA)
  • Google Vertex AI under your Google Cloud agreement
  • A self-hosted model running in your own infrastructure, with no data leaving your environment
  • Any of 100+ other providers supported through the Fabrx provider abstraction layer

When you use BYOK, Fabrx's schema builder and observability layer work exactly as described above β€” you get the schema builder, the structured output, the field lineage, and the compliance tooling β€” but document content flows through your chosen provider under your data processing terms. Fabrx never sees the document content.

Fabrx advantage: No competitor in the BOL OCR space offers BYOK with 100+ provider options. Klippa and Nanonets process all documents through their own infrastructure. Mindee's API sends documents to their servers with no BYOK option. For enterprise logistics operations with existing cloud provider DPAs, or for operations processing documents that reveal sensitive commercial relationships, BYOK is often a go/no-go requirement.

Integrate with Your WMS, TMS, or ERP in Minutes

A BOL extraction API is only useful if it connects to the systems that need the data. Fabrx delivers structured JSON, which means integration is a matter of HTTP β€” whatever your WMS, TMS, or ERP can receive a webhook or make an HTTP call, it can consume Fabrx output.

Common integration patterns:

  • Direct API call from your document intake pipeline: When a BOL arrives (email attachment, EDI portal upload, carrier API, scan station), your pipeline POSTs it to the Fabrx endpoint and receives JSON to insert into your WMS or TMS.
  • Webhook-triggered processing: Configure Fabrx to POST extraction results to your webhook endpoint as soon as processing completes. Your system doesn't need to poll.
  • Batch processing: Submit a queue of BOLs via the batch endpoint. Fabrx processes them in parallel and delivers results as they complete.
  • No-code automation: Connect Fabrx to Zapier, Make, or n8n for no-code routing β€” useful for smaller operations that don't have engineering resources for a full integration. See our guide on building a no-code document API for step-by-step instructions.

For scanned or photographed BOLs β€” common in dock receiving workflows β€” Fabrx handles the OCR layer automatically. You don't need a separate OCR preprocessing step. Upload the image or PDF; receive structured JSON. For details on how Fabrx processes scanned documents, see our overview of scanned document OCR to structured data.

Fabrx vs. Nanonets, Mindee, and Klippa β€” Side-by-Side Comparison

The BOL OCR market has several established players. Here's how Fabrx compares on the dimensions that matter most for logistics operations:

CapabilityFabrxNanonetsMindeeKlippa
Time to live API<60 secondsDays–weeks (template setup)Minutes (fixed schema)Weeks (enterprise sales)
Custom field extractionAny field, described in plain languageYes, with training dataNo (fixed pre-trained schema)Yes, with configuration
Schema versioningBuilt-inNoNoNo
Field-level data lineageYes, on all plansLimitedNoEnterprise only
EU AI Act complianceYes, all plans incl. freeNot addressedNot addressedNot addressed
PII detection & redactionAll plansEnterpriseNot availableEnterprise
BYOK (AI provider)100+ providersNoNoNo
Shipping label extractionSame API, same schemaSeparate productSeparate modelSeparate product

The competitive gap is clearest on the combination of developer experience (time to live API, no-code schema builder) and compliance posture (EU AI Act, PII detection, BYOK). No single competitor addresses all four dimensions. Nanonets and KlearStack have strong accuracy stories but weak compliance and slow deployment. Mindee is developer-friendly but locked to a fixed schema. Klippa has enterprise compliance features but no developer self-serve story. Fabrx is the only option that covers all four without requiring enterprise procurement.

Get Your First BOL API Endpoint Live Today

Bill of lading data extraction is a solved problem β€” the question is how much friction stands between your current workflow and a production-ready API. Template-based systems demand weeks of setup and break when carrier formats change. Pre-trained fixed-schema APIs give you someone else's field list. Custom model training takes labeled data and engineering time you probably don't have.

Fabrx's approach β€” describe your schema, get an endpoint, integrate β€” is measurably different from every competitor in this space. The 60-second deployment claim isn't marketing language; it's what the product delivers. The EU AI Act compliance posture isn't a roadmap item; it's available on the free plan today.

If you're processing BOLs or shipping labels at any volume β€” even a few hundred per month β€” the integration effort is worthwhile. At 500 BOLs per month, eliminating manual data entry pays for itself within the first billing cycle. At 5,000 per month, the operational leverage compounds across every downstream system that receives cleaner, faster, more reliable data.

The free plan includes 100 extractions per month, full EU AI Act compliance tooling, PII detection, and field-level data lineage. No template configuration, no model training, no sales call required.

Your bill of lading extraction API β€” live in under 60 seconds.

No templates. No training data. EU AI Act compliant on the free plan.

Get started free β†’

Your document extraction API β€” live in under 60 seconds.

No templates. No training data. EU AI Act compliant on the free plan.

Get started free β†’