Indonesia KTP (e-KTP) OCR Python SDK

Instantly extract data from Indonesian IDs using our native Python library.

Steve HarringtonUpdated 2026-01-22
AI extracting data from a Indonesia ID card
StructOCR engine analyzing a Indonesian document in real-time.

Parsing KTP (e-KTP) Challenges

Indonesian KTPs present several OCR challenges. Firstly, inconsistencies in formatting and layout across different regions are common. Secondly, the presence of native Bahasa Indonesia text combined with varying print quality makes accurate text recognition difficult.

Why StructOCR for Indonesia

Our model is specifically trained on a large dataset of Indonesian KTPs, addressing layout variations and native text recognition. The Python SDK simplifies integration by providing a straightforward API, allowing you to process images with just a few lines of code.

Common Use Cases in Indonesia

  • Digital Onboarding: Verify users for Fintech apps in Indonesia.
  • Telecom Registration: Automate SIM card registration with KTP (e-KTP).
  • Hotel Check-in: Speed up guest registration workflows.

Python SDK Integration

Install the SDK via pip: `pip install structocr`. Then use the following code.

Prerequisite: Python 3.6+ and `structocr` library installed.

PYTHON EXAMPLE
from structocr import StructOCR

# 💰 Save 30%+ vs competitors. Get 200 free requests instantly:
# 👉 https://structocr.com/register
# Initialize with your API Key
client = StructOCR("YOUR_API_KEY_HERE")

def scan_indonesia_id():
    # Note: Supports JPG, PNG, WebP (Max 4.5MB)
    # Target: KTP (e-KTP)
    image_path = "indonesia_national_id.jpg"

    try:
        print(f"Scanning {image_path}...")
        
        # The SDK handles file upload and API communication
        # It automatically detects that this is a Indonesian document
        result = client.scan_national_id(image_path)

        # Check success flag (SDK returns a dict matching the JSON response)
        if result.get('success'):
            data = result['data']
            print("✅ Indonesia Extraction Successful!")
            
            # Basic Identity
            print(f"Region:      {data.get('country_code')} (Series: {data.get('card_series')})")
            print(f"Name:        {data.get('given_names')} {data.get('surname')}")
            print(f"ID Number:   {data.get('document_number')}")
            
            # Critical Field: Personal Identity Number (CNP/CPF/NIN)
            print(f"Personal #:  {data.get('personal_number')}")
            
            # Demographics
            print(f"DOB:         {data.get('date_of_birth')} ({data.get('sex')})")
            print(f"Address:     {data.get('address')}")
        else:
            print(f"❌ Extraction Failed: {result.get('error')}")

    except Exception as e:
        # Handle SDK or Network errors
        print(f"An error occurred: {e}")

if __name__ == "__main__":
    scan_indonesia_id()

Technical Specs

  • Latency: < 5s (Average)
  • Uptime: 98.5% SLA
  • Security: AES-256 Encryption & SOC2 Compliant
  • Input: JPG, PNG, WebP (Max 4.5MB)
  • Output: JSON (Structured Data)

Key Features

  • Native Script Support: Reads English and local characters.
  • Blur Detection: Automatically rejects blurry images.
  • Fraud Check: Validates KTP (e-KTP) number format.
  • Smart Crop: Removes background noise automatically.

JSON Response Example

The SDK returns a Python dictionary matching this JSON structure.

{
  "success": true,
  "data": {
    "type": "national_id",
    "country_code": "IDN",
    "nationality": "WNI",
    "document_number": "3171234567890001",
    "card_series": "",
    "personal_number": "3171234567890001",
    "surname": "SANTOSO",
    "given_names": "BUDI",
    "sex": "M",
    "date_of_birth": "1990-05-15",
    "place_of_birth": "JAKARTA",
    "address": "Jl. Jend. Sudirman No. 5, Jakarta Pusat",
    "date_of_issue": "2020-01-01",
    "date_of_expiry": "2030-01-01",
    "issuing_authority": "Kabupaten Bogor"
  }
}

Frequently Asked Questions

Does the Python SDK handle image uploads?

Yes, the SDK automatically handles base64 encoding and file uploads.

Is data stored?

No. Images are processed in-memory and deleted immediately.

How to handle errors?

The SDK result dictionary contains a 'success' boolean and an 'error' message if failed.

More OCR Tutorials

Precise Data Extraction and Seamless Integration with AI-powered OCR API.

Empower your solutions with automated data extraction by integrating best-in class StructOCR via API seamlessly.

No credit card required • Full API access included