scian
·Scian Team
data-qualitygovernancecrm

RevOps Data Governance: A Practical Framework for Clean CRM Data

Every RevOps leader has the same nightmare: the CEO asks a simple question — "How many customers do we have?" — and three different reports give three different answers. The sales dashboard says 847. The CS platform says 912. The billing system says 803. None of them are right, and nobody can explain why.

This isn't a technology problem. You can buy the best CRM, the best BI tool, and the best data enrichment service, and your data will still be garbage if you don't have governance.

Data governance is the unsexy, foundational work of defining who owns what data, how it gets into your systems, what "good" looks like, and how you maintain it over time. It's the difference between a CRM that drives decisions and a CRM that everyone has learned to ignore.

Why CRM Data Degrades

CRM data doesn't start dirty. It becomes dirty through predictable, preventable mechanisms:

Human entry error. Reps type company names differently (Salesforce vs. salesforce.com vs. SFDC). They enter phone numbers without country codes. They put notes in the wrong fields. Manual data entry has an error rate of 1-5%, and with thousands of records, that adds up fast.

Data decay. B2B data decays at roughly 30% per year. People change jobs, companies get acquired, phone numbers change, addresses update. If you're not actively maintaining your data, nearly a third of it becomes inaccurate every year.

Integration drift. When your marketing automation, CRM, CS platform, and billing system disagree on the source of truth, data conflicts accumulate. A contact updated in HubSpot doesn't sync to Salesforce because of a mapping error. An account hierarchy in the billing system doesn't match CRM. These small discrepancies compound.

Scope creep. Over time, teams add fields, objects, and workflows without coordinating with each other. The result is a CRM cluttered with unused fields, redundant data points, and conflicting definitions. What does "Customer" mean — anyone with a signed contract, anyone who's ever paid you, or anyone in the CS platform?

No entry standards. Without validation rules and picklist enforcement, every rep creates data however they want. Free-text fields where structured data is needed. Missing required information. Inconsistent formatting. The absence of standards guarantees inconsistency.

The Data Ownership Model

The first step in data governance is answering the question: who is responsible for this data?

Three Roles of Data Ownership

Data Owner: The business leader accountable for the accuracy and completeness of a data domain. The VP of Sales owns opportunity and pipeline data. The VP of Marketing owns lead and campaign data. The VP of CS owns customer health and renewal data. The data owner defines what "good" looks like and is accountable for quality.

Data Steward: The operational person who implements and enforces governance policies. This is typically someone in RevOps. The data steward writes the validation rules, manages the enrichment processes, runs the hygiene cadences, and monitors data quality dashboards. They're the enforcer.

Data Contributor: Anyone who creates or modifies data — reps, SDRs, CSMs, marketing coordinators. Contributors are responsible for entering data accurately and completely, following the standards defined by the owner and enforced by the steward.

Mapping Ownership

Create a data domain ownership map:

Data DomainData OwnerData StewardPrimary System
Accounts/CompaniesVP SalesRevOps ManagerCRM
Contacts/PeopleVP MarketingMarketing OpsCRM
Opportunities/DealsVP SalesSales Ops AnalystCRM
Products/PricingVP ProductRevOps ManagerCPQ/CRM
Customer HealthVP CSCS OpsCS Platform
Campaign/AttributionVP MarketingMarketing OpsMAP/CRM
Billing/SubscriptionVP FinanceFinance OpsBilling System

This map eliminates ambiguity. When there's a data quality issue with opportunity data, everyone knows who's accountable and who fixes it.

Field-Level Governance

Not every field in your CRM deserves the same level of governance. Categorize fields into tiers:

Tier 1: Critical Fields (Strict Governance)

These fields drive reporting, routing, automation, and decision-making. Errors here have high business impact.

Examples: Account Name, Industry, Employee Count, ARR, Opportunity Stage, Close Date, Amount, Owner, Lead Source, Territory.

Governance rules:

  • Required on record creation (validation rules prevent saving without them)
  • Picklist values only (no free text)
  • Regular enrichment and validation (quarterly minimum)
  • Audit trail enabled
  • Change requires approval for certain fields (e.g., Opportunity Amount above threshold)

Tier 2: Important Fields (Moderate Governance)

These fields are used in segmentation, enrichment, and secondary reporting. Errors cause inefficiency but not critical failures.

Examples: Job Title, Department, Technology Stack, Company Revenue, Lead Score, MEDDIC/BANT qualification fields.

Governance rules:

  • Strongly encouraged but not hard-required on creation
  • Picklist values preferred, free text with format guidelines where needed
  • Enrichment backfill on a monthly cadence
  • Periodic audit (quarterly)

Tier 3: Nice-to-Have Fields (Light Governance)

These fields capture context but don't drive automation or critical reporting.

Examples: Competitor mentioned, Notes, Personal interests (for rapport building), Secondary phone number.

Governance rules:

  • Optional
  • Free text is acceptable
  • No active enrichment
  • Annual review to determine whether the field should be promoted, demoted, or deprecated

Field Deprecation

For every field you add, consider deprecating one. CRM field bloat is real — organizations with 500+ custom fields on the Account object are not uncommon, and most of those fields are unused. Run a field utilization audit annually: any field that's populated on less than 5% of records and hasn't been used in a report in 12 months is a candidate for deprecation.

Validation Rules That Work

Validation rules are your first line of defense against dirty data. But poorly designed validation rules create user frustration and workarounds (reps entering "TBD" or "xxx" to bypass them).

Principles for effective validation rules:

  1. Validate at the right moment. Don't require fields on record creation that the user can't possibly know yet. Require Account Industry when the Account is created, but require MEDDIC fields when the Opportunity moves past Stage 2, not at creation.

  2. Use conditional logic. Validation rules should be contextual. Require a Closed Lost Reason only when the stage is Closed Lost. Require a Next Steps field only on open opportunities with activity older than 14 days.

  3. Provide clear error messages. "Validation error on field X" is useless. "Please select an Industry before saving the Account record. This field is required for territory assignment and reporting" tells the user why the rule exists and how to fix it.

  4. Audit bypass rates. If reps are entering "N/A," "TBD," "Unknown," or "." to bypass validation rules, the rule is either poorly timed, poorly communicated, or requiring information they genuinely don't have. Fix the rule, not the symptom.

Duplicate Management

Duplicates are the most visible data quality problem and one of the hardest to solve permanently.

Prevention

Real-time matching on creation. When a user creates a new Account or Contact, check against existing records in real-time. Salesforce's native duplicate rules handle basic matching; tools like RingLead or Cloudingo provide more sophisticated fuzzy matching.

Web-to-lead deduplication. Marketing forms should check for existing records before creating new ones. This requires matching logic in your marketing automation platform (email match is the minimum; company name + email domain matching catches more).

Integration deduplication. When data flows from external systems (enrichment providers, event platforms, partner portals), run it through deduplication before it enters CRM. Never auto-create records from integrations without matching.

Resolution

Prevention won't catch everything. Run a deduplication process monthly:

  1. Export suspect duplicates using matching rules (exact email, fuzzy company name + state, phone number matching).
  2. Review matches and determine the surviving record (usually the record with the most recent activity or the most complete data).
  3. Merge records, preserving all activity history, opportunities, and case records on the surviving record.
  4. Document merges for audit trail purposes.

Ongoing Metrics

Track duplicate creation rate (new duplicates per month), existing duplicate percentage (total suspected duplicates as a percentage of total records), and merge volume. A healthy CRM has less than 3% suspected duplicates across Accounts and Contacts.

Data Enrichment Strategy

Enrichment fills in the gaps that human entry misses. Here's how to approach it:

Enrichment Providers

ZoomInfo is the dominant player for B2B contact and company data. Firmographic data (revenue, employee count, industry, technology stack) and contact data (email, phone, job title) are strong for North American markets.

Apollo offers similar data at a lower price point and is popular with mid-market companies. Data quality is slightly lower than ZoomInfo but adequate for most use cases.

Clearbit (now part of HubSpot) provides real-time enrichment and is well-integrated with HubSpot CRM. Good for HubSpot-native organizations.

6sense and Bombora provide intent data rather than firmographic enrichment — they tell you which accounts are actively researching topics related to your product. This is enrichment of a different kind, focused on buying signals rather than demographics.

Enrichment Cadence

On creation: Enrich new records as they enter CRM. This ensures baseline data quality from the start.

Monthly refresh: Re-enrich Tier 1 fields monthly to catch job changes, company updates, and data decay.

Quarterly deep refresh: Re-enrich all fields quarterly, including Tier 2 fields and technology stack data.

Trigger-based enrichment: Enrich records when they hit specific lifecycle stages (MQL, SQL, Opportunity Created) to ensure the data supporting sales engagement is current.

Enrichment Rules

Not all enriched data should overwrite existing data. Define rules:

  • Enriched data overwrites only if the existing field is blank or outdated (older than 6 months).
  • Enriched data never overwrites human-verified data (e.g., a rep confirmed the contact's direct phone number — the enrichment provider's data shouldn't overwrite it).
  • Log all enrichment-driven changes for audit purposes.

Data Hygiene Cadence

Data governance isn't a project. It's a process. Build these recurring activities into your RevOps calendar:

Weekly: Review and merge new duplicates. Address data quality alerts from validation rules (high bypass rates, incomplete required fields).

Monthly: Run enrichment refresh on Tier 1 fields. Audit lead routing accuracy (are leads going to the right owners?). Review and clean up bounced emails and invalid phone numbers.

Quarterly: Field utilization audit (are fields being used?). Health score back-testing (is data quality correlated with score accuracy?). Re-enrichment of all active records. Review and update validation rules based on user feedback and bypass rates.

Annually: Full data audit — record counts by object, data completeness percentage, duplicate rate, enrichment coverage. Review and deprecate unused fields. Update the governance policy document.

Compliance: GDPR, CCPA, and Beyond

Data governance isn't just about operational efficiency — it's about legal compliance.

Key requirements:

  • Right to deletion. You must be able to delete a person's data from all systems upon request. If your CRM, marketing automation, CS platform, and data warehouse all have separate copies of a contact record, you need a process to find and delete all instances.

  • Consent management. Track and honor communication preferences. If a contact opts out of marketing emails, that preference must be enforced across all systems that send email.

  • Data minimization. Collect only the data you need. Those 500 custom fields? Many of them may violate the principle of data minimization. If you can't articulate a business purpose for a field, you probably shouldn't be collecting it.

  • Data processing records. Document what data you collect, why, how long you retain it, and who has access. This is a regulatory requirement under GDPR and increasingly expected under CCPA and other frameworks.

RevOps's role: Work with legal to implement technical controls (consent fields, deletion workflows, retention policies) and maintain compliance documentation. Data governance and compliance governance are the same discipline.

Measuring Data Health

You need a data health score — a single metric (or small set of metrics) that tells you whether your CRM data is trustworthy.

Components of a data health score:

  • Completeness: Percentage of Tier 1 fields populated across all records. Target: 95%+.
  • Accuracy: Percentage of records where enriched data matches existing data (a proxy for accuracy). Measured during enrichment refreshes. Target: 90%+.
  • Uniqueness: Inverse of duplicate rate. Target: 97%+ unique records.
  • Timeliness: Percentage of records enriched or verified within the last 90 days. Target: 80%+.
  • Consistency: Percentage of records with standardized formatting (consistent naming, valid picklist values, proper data types). Target: 95%+.

Roll these into a composite score (0-100) and track it monthly. Report it to leadership alongside revenue metrics. Data quality is revenue quality — when the data is bad, the decisions are bad, and the revenue follows.

Clean CRM data isn't a destination. It's a practice. Build the framework, assign the ownership, automate what you can, and measure relentlessly. The companies that treat data as a strategic asset outperform those that treat it as an afterthought — not because they have better technology, but because they make better decisions.

Related Articles

Get your free CRM health score

Connect HubSpot. Get your data quality score in 24 hours. No commitment.

Start Free Assessment