A lot of AI claims projects stall for a very boring reason:
The vendor asks, “Can you share a sample of your data?”
IT isn’t sure what’s available, Claims isn’t sure how clean it is, and Legal isn’t sure what can be shared.
This checklist is designed so that, in about a day, your team can get “AI ready enough” to have a strong, practical conversation with an AI vendor like Curacel’s AI claims infrastructure without a 6-month data project.
If you want a broader view of what this can unlock, you can also read our guide on Claims Management: AI-Powered Insurance and Automation.
You don’t need a perfect data warehouse. You just need clarity on what exists, where it lives, and who owns it.
Confirm which fields you can pull reliably:
This is the backbone for any AI-powered triage, analytics or fraud detection.
Next, note the policy attributes you can link to those claims:
You don’t need every clause of the wording; just the key fields your teams already use to make decisions.
Most of the “mess” in claims sits in documents and media. Make a quick inventory of where you store:
Also note: are these linked to claim IDs in your system, or scattered across drives and email?
This mix of structured (claims + policy) and unstructured (documents + images) is what an AI vendor will use to automate document handling, risk scoring and fraud flags.
Before anyone says “data lake”, do something much simpler: pull ~100 recent claims from one important product line (often motor or health) and inspect them.

Ask three basic questions:
Look for:
Capture a short list of “things we know are off”.
Check if the same thing appears under multiple names, for example:
Inconsistency doesn’t stop AI, but it does require quick mapping or normalisation. Knowing this upfront saves time later.
Scan for:
You’re not fixing it all now. You’re building a realistic picture of “what our data really looks like” for the vendor.
Now give your future partner enough signal to size effort, infrastructure and ROI.
For each major product line, estimate:
Even “about 1,500–2,000 motor claims a month” is useful.
Note:
This helps a vendor suggest where automation will have the biggest impact first.
Prepare a small, curated set of claims with simple labels:
For each, add 1–2 sentences:
This tiny “training set” is gold for designing triage flows, fraud scores and routing rules that match your real world. For a deeper dive into how this kind of pattern-spotting works in practice, see our article on Anomaly Detection in Claims: How AI is Transforming Insurance Operations.
The quickest way to slow down an AI project is to surprise Legal, Compliance or IT.
Capture the basics before you jump on vendor calls.
List the key roles:
Knowing these names early means you can loop them in properly once a proof-of-concept is on the table.
Clarify what’s acceptable in your organisation:
You’re not choosing tools yet, just setting guardrails.
Write a short one-pager covering:
Now, when a vendor proposes a data flow, you can immediately check: “Does this fit inside our rules?”

Copy this into Notion, Confluence or a shared doc and tick through as a team:
Datasets
Data Quality
Volumes & Variability
Examples
Governance & Access
If you can tick most of this, you’re already ahead of most AI initiatives.
When you show up with this prep:
Offer: Share a sample of this checklist and a few example claims with us, and in one call we’ll show you:
Book a demo and AI data readiness review →
Subsribe to our newsletter to receive weekly content