
Amperity launches ‘Chuck Data’: The first AI Agent built for customer data engineering in Databricks
Amperity, the AI-powered customer data cloud, today launched Chuck Data, the first AI Agent built specifically for customer data engineering. Chuck uses Amperity’s years of experience and patented identity resolution models, trained on billions of data sets across 400+ enterprise brands, as critical knowledge behind the AI. Chuck runs in the terminal and empowers engineers to quickly understand their data, tag it, and resolve customer identities in minutes – all from within their Databricks lakehouse.
As pressure mounts to deliver business-ready insights quickly, data engineers are hitting a wall: while infrastructure has modernised, the work of preparing customer data still relies on manual code and brittle rules-based systems. Chuck changes that by enabling data engineers to “vibe code” – using natural language prompts to delegate complex engineering tasks to an AI assistant.
Chuck connects directly to a user’s Databricks environment, leveraging native compute and large language model (LLM) endpoints to execute high-impact workflows like identity resolution, compliance tagging, and data profiling.
“Customer data engineering is full of repetitive, painful work, so we built Chuck to get rid of it,” said Derek Slager, Co-Founder and CTO at Amperity. “Chuck understands your data and helps you get stuff done faster, whether you’re stitching identities or tagging PII. No orchestration, no UI gymnastics—it’s just fast, contextual, and command-driven.”
Key Features of Chuck Data:
- Natural language command interface for customer data tasks
- Identity resolution via Amperity Stitch, running on Databricks compute
- PII tagging and customer profiling across Unity Catalog
- Compliance through accurate and easy PII tagging within Databricks
- Zero-copy architecture: Chuck never moves your data
Built for Databricks, Backed by Amperity’s Identity Expertise
Chuck runs entirely in your terminal, using your Databricks environment for compute, storage, and LLM execution. With a single install, engineers can run natural language commands that eliminate manual code and deliver accurate, scalable customer profiles.
A core capability of Chuck is running Amperity’s patented identity resolution algorithm – the same trusted Stitch technology used in its enterprise CDP. Users can run unlimited free Stitch on datasets up to 1 million records with a generous budget of credits for larger data sets included for free with the research preview program.
Paid plans unlock unlimited runs, access to Amperity’s stable ID algorithm, and enterprise support.
Experience Chuck Data Live at Databricks Data + AI Summit
Amperity will demo Chuck live at Booth #704 during the Databricks Data + AI Summit, June 9–12, 2025, in San Francisco. Stop by to see how Chuck:
- Resolves millions of identities in seconds
- Creates transparent customer ID graphs
- Streamlines data engineering workflows with AI
Want to see Chuck in action? Check out the demo at www.amperity.com/capabilities/chuck.