How Smaller Firms Can Achieve Data Lineage with Simple Agents

John O'ConnellAdvisor Perspectives welcomes guest contributions. The views presented here do not necessarily represent those of Advisor Perspectives.

The data lineage conversation in wealth management has been dominated by enterprise software vendors promising sophisticated tracking systems that cost six figures and require dedicated IT teams. But here's what the sales presentations miss: Smaller firms can achieve meaningful, defensible data lineage using tools they already own — if they understand what lineage actually means in practice.

Data lineage isn't about expensive technology. It's about answering three straightforward questions when a regulator, client, or internal auditor asks about AI-generated content:

  • What data went into this output?
  • Who used it, and when?
  • Where is the record stored?

If you can answer those questions reliably, you have functional lineage. Everything else is infrastructure theater.

The ChatGPT Session Tracking Foundation

The starting point for practical lineage is treating every business-related AI conversation as a documented process. This requires a structured prompt and response log stored in a secure, shared location such as SharePoint, Box, or Google Drive.

Each entry should capture four elements: the date and timestamp of the interaction, the user's name, the purpose or client context (using non-identifying descriptors), and the nature of the output generated. When that conversation influences client work, the final output gets saved directly to the relevant CRM record or document management system.

This creates what engineers call a manual lineage trail. In plain language, it means you can show exactly how advice or content was generated, who generated it, and when. The system isn't automated, but it's defensible — and that's what matters when compliance questions arise.