Production-Grade PHP Driver's License OCR via REST API

Achieve 99.7%+ extraction accuracy and sub-1500ms latency without maintaining a single regex pattern.

Steve HarringtonUpdated 2026-01-16
A flow diagram showing a photo of a driver's license being sent to the StructOCR API and returning structured JSON data with fields like name, address, and document number.
Figure 1: StructOCR converts raw Driver's License images into validated JSON data.

Why Driver's License OCR is Difficult

Building in-house Driver's License OCR is a significant engineering challenge beyond generic text extraction. Open-source tools like Tesseract fail on low-quality, real-world images due to glare, shadows, and non-standard fonts. The core problem lies in parsing the PDF417 barcode, which contains critical data but is susceptible to distortion and damage. Furthermore, each jurisdiction has unique layouts and field names, requiring a complex and brittle web of RegEx patterns. Manually implementing checksum validation for MRZ or document numbers adds another layer of complexity, leading to high maintenance costs and inconsistent accuracy.

Enterprise-Grade Extraction with StructOCR

StructOCR bypasses the limitations of generic OCR engines. Our API leverages pre-trained Deep Learning models specifically designed for identity documents. Upon receiving an image, our system performs automatic pre-processing, including deskewing, denoising, and glare removal, before analysis. This ensures high accuracy even on suboptimal inputs. Unlike Tesseract, which returns unstructured lines of text, StructOCR provides a standardized JSON output with validated fields like `date_of_birth` and `document_number`. This eliminates the need for post-processing and manual data correction, reducing development time from months to hours.

Production Use Cases

  • Digital Onboarding (KYC): Reduce drop-off rates by pre-filling user data from Driver's Licenses in < 2 seconds.
  • Fraud Prevention: Detect tampered fonts or mismatched PDF417 checksums automatically.
  • Vehicle & Equipment Rental: Instantly verify driver age and extract vehicle class endorsements (A, B, C, M) to streamline rentals.

Implementation: Raw API Request

The following PHP code demonstrates a complete extraction flow using cURL. It handles image encoding, sets the required 'x-api-key' header, and parses the structured JSON response.

Prerequisite: PHP 7.4+ with cURL extension

CODE EXAMPLE
<?php

// 💰 Save 30%+ vs competitors. Get 200 free requests instantly:
// 👉 https://structocr.com/register

$apiKey = 'YOUR_API_KEY_HERE';
$apiUrl = 'https://api.structocr.com/v1/driver-license';
$imagePath = 'license.jpg';

// 1. Validate and Encode Image
if (!file_exists($imagePath)) {
    die('Error: File not found.');
}

$imageData = file_get_contents($imagePath);
$base64Image = base64_encode($imageData);

// 2. Prepare JSON Payload
$payload = json_encode(['img' => $base64Image]);

// 3. Initialize cURL
$ch = curl_init();

curl_setopt_array($ch, [
    CURLOPT_URL => $apiUrl,
    CURLOPT_RETURNTRANSFER => true,
    CURLOPT_POST => true,
    CURLOPT_POSTFIELDS => $payload,
    CURLOPT_HTTPHEADER => [
        'Content-Type: application/json',
        'x-api-key: ' . $apiKey // Required Authentication Header
    ]
]);

// 4. Execute and Parse
$response = curl_exec($ch);
$httpCode = curl_getinfo($ch, CURLINFO_HTTP_CODE);

if (curl_errno($ch)) {
    die('cURL Error: ' . curl_error($ch));
}
curl_close($ch);

// 5. Handle Response
$result = json_decode($response, true);

if ($httpCode === 200 && isset($result['success']) && $result['success']) {
    $data = $result['data'];
    echo "✅ Extraction Successful!\n";
    echo "Type: " . $data['type'] . "\n";
    echo "Name: " . $data['given_names'] . " " . $data['surname'] . "\n";
    echo "Doc Number: " . $data['document_number'] . "\n";
    echo "Region: " . $data['region'] . " (" . $data['country_code'] . ")\n";
    echo "Vehicle Class: " . $data['vehicle_class'] . "\n";
    echo "Expiry: " . $data['date_of_expiry'] . "\n";
} else {
    echo "❌ API Error (Code $httpCode):\n";
    // Print error message from API if available
    if (isset($result['error'])) {
        echo "Error: " . $result['error'] . "\n";
    } else {
        echo $response;
    }
}

?>

Technical Specs

  • Latency: < 5s (Average)
  • Uptime: 98.5% SLA
  • Security: AES-256 Encryption & SOC2 Compliant
  • Input: JPG, PNG, WebP (Base64 Encoded)
  • Max File Size: 4.5MB
  • Output: JSON (Structured Data)

Key Features

  • Global Coverage: Supports formats from USA, EU, and Asia.
  • Date Normalization: All dates automatically formatted to YYYY-MM-DD.
  • Vehicle Class Parsing: Extracts allowed vehicle categories (e.g., A, B, C).

Sample JSON Output

StructOCR returns a normalized JSON object, regardless of the input image angle or quality.

{
  "success": true,
  "data": {
    "type": "drivers_license",
    "country_code": "USA",
    "region": "CALIFORNIA",
    "document_number": "D1234567",
    "surname": "DRIVER",
    "given_names": "JANE MARIE",
    "date_of_birth": "1995-08-15",
    "date_of_expiry": "2025-08-15",
    "date_of_issue": "2020-08-15",
    "sex": "F",
    "address": "1234 ELM ST, SACRAMENTO, CA 95814",
    "vehicle_class": "C"
  }
}

Frequently Asked Questions

How does StructOCR compare to AWS Textract or Google Vision?

General-purpose OCR services like Textract return an array of raw text lines and coordinates. You are still responsible for parsing this data and mapping it to meaningful fields. StructOCR is a specialized API that performs this final step, returning a structured JSON object with labeled fields like `surname` and `date_of_birth`, saving you significant development effort.

Do you store the uploaded images?

No. Images are processed in-memory and permanently deleted immediately after the API response is generated. We do not persist any customer PII on our servers.

How do you handle blurry or low-quality images?

Our API includes a mandatory, automated pre-processing pipeline that performs image enhancement, including de-noising, sharpening, and perspective correction, before the OCR models are executed. This maximizes accuracy on real-world mobile captures.

More OCR Tutorials

Precise Data Extraction and Seamless Integration with AI-powered OCR API.

Empower your solutions with automated data extraction by integrating best-in class StructOCR via API seamlessly.

No credit card required • Full API access included