K2view named a Visionary in Gartner’s latest Magic Quadrant for Data Integration 🎉

Read More
Start Free
Book a Demo
New! 2025 State of Test Data Management Survey 📊
Get the Survey Results arrow--cta

Test data management Salesforce style: Sandboxes – yes, quality test data – no

Amitai Richman

Amitai Richman,Product Marketing Director

In this article

    Get Gartner Report
    report

    Gartner® Report

    Market Guide for Test Data Management

    Get Gartner Report

    Table of Contents

    Test data management Salesforce style: Sandboxes – yes, quality test data – no
    8:32

    Improve test data management in Salesforce with precise subsetting, inflight data masking, and synthetic data generation – via a business entity approach. 

    The problem with test data at Salesforce 

    Salesforce doesn’t lack sandboxes. It lacks reliable, realistic, compliant test data delivered fast enough to keep up with development.

    Ask any Salesforce data architect, QA lead, or DevOps engineer, and you’ll most likely hear the same story:

    • Long waits for full-copy refreshes 

    • Broken relationships after seeding 

    • Inconsistent data masking 

    • Test failures no one can reproduce 

    This article will explain why Salesforce test data is uniquely challenging, what good Test Data Management (TDM) requires, and why traditional tools aren’t built for this job. It will also show how an entity-based test data management approach delivers complete, compliant test data in minutes to Salesforce, and any other platform. 

    The complaints nobody is addressing 

    The most common claims include: 

    • Full-copy refreshes take days or weeks and still require cleanup. 

    • Partial-copy sandboxes never have the data you need. 

    • Seeding tools often break object relationships. 

    • Masking is performed after data lands in sandboxes, creating compliance risks. 

    • Salesforce systems become cluttered with stale and corrupted data. 

    So, engineers tend to fabricate test data manually, resulting in slower testing, and more defects reaching production. 

    Why Salesforce data breaks test data approaches 

    Here’s why Salesforce data tends to throw a wrench into the works of many of the most well-known TDM solutions

    1.    Complex object hierarchies 

    Accounts, contacts, cases, opportunities, orders, entitlements, custom objects, and managed package objects are deeply connected. When one link is missing, tests break. 

    Sandboxes don’t reflect production – or reflect it too closely, because: 

         • Dev Pro sandboxes lack realistic data. 


        • Full Copies carry raw PII into non-production environments.  


    2.    Strict rules enforcement 

    Governance limits cause fragility, in the sense that:  

        •  Large seeding jobs time out or require complex retry logic. 

        •  Salesforce is rarely the only system involved. 

        •  Billing, ERP, service, order management, and data lakes all store-related data. 

    Test data must reflect the entire business process, not just Salesforce.  

    3.    Use of anti-patterns 

    Anti-patterns are workarounds, not solutions: 

    •  Cloning full-copy orgs to reset environments. 

    •  Restoring production backups into lower tiers. 

    •  Using Excel to create data manually. 

    What Salesforce teams need from TDM 

    High-quality test data requires more than just periodic refreshes. See the other requirements, and the reasons for them, in the followng table: 

    Requirement  Why it matters 
    Data subsetting  Enables system agility and tighter feedback loops 
    Referential integrity  Prevents broken relationships and missing parents 
    Inflight data masking  Ensures PII is never exposed in lower tiers 
    Synthetic data generation  Covers negative paths and rare edge cases 
    Repeatability for CI/CD  Enables real shift-left testing 
    Provisioning in minutes  Eliminates wait time and speeds delivery 

    These test data best practices also support a shift-left testing approach, modern releases, and automation. 

    Common TDM approaches and where they fail 

    Backup tools, sandboxes, and seeding utilities all have value, but none were designed with test data management in mind. 

    1. Backup tools (Own, Odaseva) 

      Great for disaster recovery – but not for test data management – backups recreate the past, while TDM provisions for the future. Using a backup tool for test data is like using a sledgehammer to hang a picture. It’s too heavy, too slow, and likely to damage the surrounding structure. And backup tools can’t generate targeted, masked, scenario-specific test data. 

    2. Salesforce native tools 

      Salesforce has the following built-in TDM capabilities: 

      •  Sandbox refreshes are slow, expensive, and reintroduce PII. 

      •  Scratch Orgs are great for configuration but not for realistic data. 

      • Data Loader cannot preserve complex relationships. 

    3. Automation frameworks 

      Test data automation frameworks automate deployments but still rely on incomplete and inconsistent data inputs. 

    4. Point seeding tools 

      Point seeding tools seed objects but have difficulty with deep relationships, managed packages, attachments, cross-system consistency, and consistent masking. Plus, broken or missing data makes automation unreliable. 

    Salesforce use cases that benefit most from TDM 

    Below are examples of real Salesforce use cases that would benefit greatly from the right test data preparation tools

    • Regression testing for seasonal releases: Validation cycles shrink from days to hours. 

    • UAT scenarios: Provision complete entities such as customers with cases, orders, contracts, and entitlements.

    • CI/CD and shift-left testing: Provision data automatically inside pipelines. 

    • End-to-end integrated testing: Business processes span Salesforce, billing, ERP, and service systems. Test data must reflect this reality. 

    • Migration and consolidation: Repeatable provisioning dramatically accelerates cutover rehearsal. 

    Critical capabilities for TDM of Salesforce data 

    Below are 3 most important test data management capabilities for Salesforce: 

    1. Data subsetting 

      Data subsetting provisions exactly what an environment needs. For example: Provisioning a Dev Pro sandbox with exactly 500 relevant customer entities, including cases, orders, subscriptions, and activities, in under 10 minutes. Subsetting reduces storage usage, stabilizes environments, and speeds up testing. 

    2. Inflight data masking 

      Dynamic data masking masks data before it reaches the sandbox, thus protecting names, contact information, addresses, custom fields, attachments, notes, and logs. But the referential integrity of the masking must be consistent across Salesforce and all other external systems. 

    3. Synthetic data generation 

      Synthetic data generation covers a wide variety of test scenarios, such as payment failures, escalations, order exceptions, negative test paths, and rare conditions – all without exposing real PII. Production data doesn’t for the most part. 

    Entity-based test data management at Salesforce 

    Where traditional tools treat Salesforce data as disconnected tables, the K2view entity-centric test data management approach treats them as complete entities. This matters because a customer is not a row or a column, but a complete dataset aggregated from multiple, disparate sources such as accounts, orders, invoices, and subscriptions.

    Patented Micro-Database™ technology packages everything related to one business entity (one customer, for example) into a complete, consistent, and compliant unit.

    Traditional subsetting is like copying a contact but losing the phone number and message history. K2view copies and stores every snippet of information associated with the business entity – eliminating the most common cause of Salesforce test data instability: missing context.

    So, if, today, a Salesforce team waits 1 week for a full-copy refresh and then spends an additional 2 days manually masking the test data, many tests would likely fail due to stale data and broken relationships.  

    With entity-based TDM, however, the timetable would look very different, hypothetically something like this: 

    • QA requests a "customer with cases and orders" slice. 

    • K2view retrieves and masks the data on the fly. 

    • The data is provisioned into a Dev Pro sandbox. 

    • Everything is complete and intact in under 10 minutes. 

    The outcomes would also be quite different: 

    • 80-90% faster environment setup 
    • Zero PII exposure in lower tiers 
    • Stable, repeatable data for automated testing 

     Therefore, a complete test data management Salesforce program should include: 

    1. Clear definitions of test data needs by business process 
    2. A mapped sandbox strategy aligned to testing requirements 
    3. Data masking policies that meet compliance standards 
    4. Entity-based subsetting rules 
    5. Automated provisioning within CI/CD pipelines 
    6. Synthetic data support for missing or rare scenarios 
    7. A method for validating cross-system referential integrity 

    K2view accelerates every step through automation, entity modeling, and in-flight data masking. 

    Learn how K2view Test Data Management tools 
    deliver test data of the highest quality in minutes. 

     

    Achieve better business outcomeswith the K2view Data Product Platform

    Solution Overview
    Get Gartner Report
    report

    Gartner® Report

    Market Guide for Test Data Management

    Get Gartner Report