Beta release – features may change without notice

Generate law-aware sample data
in seconds

We built DataGen Pro to remove the busywork of crafting dummy data by hand. Each schema is inspired by compliance requirements so your QA and PoC flows can rely on realistic records. During beta, rate limits or breaking changes may occur.

10,000+
Rows / request
8
Compliance schemas
100%
Free

Compliance disclaimer

DataGen Pro provides regulation-inspired schemas, but no generated dataset is guaranteed to meet legal requirements. Validate with your compliance or legal team before production use.

health-lod.csv
payroll-basic.json
employee_idheight_cmweight_kgblood_pressure
EMP001172.568.2120/80
EMP002165.855.7118/75
EMP003178.272.1125/82
2,000 rows generated0.8s

Choose a schema

Pick from compliance-inspired templates crafted for QA and staging workloads

FHIR R4 Clinical Bundle

Synthetic FHIR R4 bundles spanning Patient, Observation, Encounter, and AuditEvent resources for interoperability testing.

FHIRHealthcare
12 columns ~0.9s / 1k bundles

PCI-DSS Advanced Card Log

Extended PCI-DSS logging with 3-D Secure outcomes, dispute lifecycle markers, and tokenization flags.

PCI-DSSFraud
28 columns ~0.7s / 1k rows

ISO 20022 Payments

Cross-border pacs.* message stubs with compliance metadata for treasury and settlement testing.

ISO 20022Payments
18 columns ~0.6s / 1k messages

Card Transaction (PCI-DSS)

Card authorization log conforming to PCI-DSS v4.0 style fields.

PCI-DSSPayments
20 columns ~0.4s / 1k rows

Payroll (Basic)

Payroll statements referencing JIS Q 15001 and My Number guidelines.

PayrollPersonal data
26 columns ~0.6s / 1k rows

eKYC Attributes

Identity verification profiles inspired by Japan's AML/KYC guidelines.

AML/KYCIdentity
26 columns ~0.8s / 1k rows

Mandatory Medical Checkup (JP LOD)

Sample health exam dataset based on Japan's Industrial Safety and Health Act.

Occupational healthCompliance
26 columns ~0.5s / 1k rows

Specific Health Checkup (JP Tokutei)

Extended metabolic screening dataset aligned with the Act on Assurance of Medical Care for Elderly People.

Metabolic screeningHealth insurers
32 columns ~0.7s / 1k rows

Why teams pick DataGen Pro

Compliance-aware dummy data in minutes, not days

Compliance-minded schemas

Blueprints shaped by occupational health, PCI-DSS, and identity regulations.

High throughput

Generate tens of thousands of rows in seconds to unblock QA runs.

Deep customization

Tune age spans, gender mix, abnormal ratios, approval rates, and more.

API-first

Drop into CI/CD and staging pipelines with a single POST endpoint.

Stateless delivery

Responses stream back in-memory—no generated records are stored server-side.

Free & open-source

Released under MIT License so you can adapt or self-host without friction.

API reference

Integrate via REST with streaming CSV/JSON responses

POST/api/generate

Request example

{\n  "schema": "health-lod",\n  "num": 2000,\n  "output": "csv",\n  "options": {\n    "genderRatio": { "male": 0.5, "female": 0.5 },\n    "age": { "min": 25, "max": 60 },\n    "departments": ["Sales", "Engineering"]\n  }\n}

Tip: "options" accepts schema-specific knobs such as abnormal ratios or fraud scoring.

Response

200text/csv · application/json
Content-Disposition:attachment; filename="health-lod-2000.csv"
curl -s -X POST https://datagen-pro.vercel.app/api/generate \\n+  -H 'Content-Type: application/json' \\n+  -d '{\n    "schema": "health-lod",\n    "num": 1000,\n    "output": "csv",\n    "options": {\n      "genderRatio": { "male": 0.5, "female": 0.5 },\n      "age": { "min": 25, "max": 60 },\n      "departments": ["Sales", "Engineering"]\n    }\n  }' \\n  --output health-lod-dummy-1000.csv
const response = await fetch('https://datagen-pro.vercel.app/api/generate', {\n  method: 'POST',\n  headers: { 'Content-Type': 'application/json' },\n  body: JSON.stringify({\n    schema: 'health-lod',\n    num: 1000,\n    output: 'json',\n    options: {\n      followUpRatio: 0.1,\n      abnormalRatio: 0.12\n    }\n  })\n});\n\nconst data = await response.json();
import requests\n\nresponse = requests.post('https://datagen-pro.vercel.app/api/generate',\n  json={\n    'schema': 'health-lod',\n    'num': 1000,\n    'output': 'csv',\n    'options': {\n      'followUpRatio': 0.1\n    }\n  }\n)\n\nwith open('health-lod-dummy-1000.csv', 'wb') as f:\n    f.write(response.content)

Pricing

Free during beta. Open-source forever.

Forever Free
$0/mo
Unlimited schema usage
10,000 rows per request
No API throttling in beta
Commercial use permitted
MIT-licensed source
Community support