Balanced, realistic synthetic data
All methods
of data generation
Supporting the 4 key synthetic
data generation techniques
Any use case
with 1 set of tools
Testing apps, training AI/ML models, sharing B2B data, and much more
Self-service
portal and APIs
Empowering data teams with
full control and automation
Generating synthetic data by business entities
Our patented approach makes all the difference
Auto-discover the business entity schemas (e.g., customer, device, loan, order, etc.) for which the synthetic data is needed.
Apply the appropriate data generation method(s) to the data model, to create the most complete, accurate, and compliant synthetic data possible.
Deliver the data to the target systems and manage access, reservation, versioning, rollback, and integration with CI/CD and ML pipelines.
Combining all 4 data generation methods
- 01 Generative AI
- 02 Rules Engine
- 03 Entity Cloning
- 04 Data Masking
01Generative AI
Generative AI is used when there's not enough production data to:
- Subset the source data needed to train the model
- Mask the training data to ensure compliance
- Train the GPT model to generate the synthetic data
- Apply business rules to increase accuracy
02Rules Engine
Rules engines, used for testing new application functionality, must:
- Generate data based on pre-defined business rules – on demand or via API
- Create business entities, such as customers, automatically
- Customize, test, and debug functions, without coding
- Define business rule parameters
03Entity Cloning
Entity cloning is used for performance and load testing to:
- Generate massive datasets on demand
- Select the most relevant business entity (e.g., a customer with the right criteria for a particular test case)
- Extract, mask, and clone the entity along with all its data
- Create unique identifiers for every cloned entity
04Data Masking
Data masking, which obscures sensitive data, must:
- Anonymize sensitive data in a very lifelike way
- Discover Personally Identifiable Information (PII) automatically
- Customize data masking functions
- Mask data inflight, as it’s extracted from the underlying source systems
Beyond synthetic data generation
Manage the synthetic data lifecycle
K2view has the only end-to-end synthetic data management solution, supporting data extraction, generation, pipelining, and operations.
-
Provision compliant data subsets, code-free
-
Mask and transform the data, in flight
-
Reserve data subsets for individual users
-
Version and roll back datasets on demand
-
Integrate data into CI/CD and ML pipelines via APIs