Create Synthetic Data
This feature allows users to generate structured datasets with configurable properties. This is useful for testing AI models, creating sample datasets, or augmenting existing data. Users can define column properties, set constraints, and generate realistic test cases dynamically.
Steps to Create Synthetic Data
1. Access the Synthetic Data Creation Tool
-
Navigate to the Datasets & Experiments section.
-
Click on Add Dataset and select Create Synthetic Data from the options.
2. Define Dataset Properties
-
Name: Enter a name for the synthetic dataset.
-
Description (optional): Add a short description to specify the purpose of this dataset.
-
Use Case (optional): Define the objective of this dataset (e.g., chatbot training, classification tasks).
-
Pattern (optional): Define a data pattern if applicable.
3. Configure Columns
- Click Add Column to specify the structure of your dataset.
- Column Name: Enter a name for the data field.
- Column Type: Select the appropriate data type (e.g., text, number, boolean).
- Properties:
- Set constraints like Minimum Length, Maximum Length, or Value Type (static/dynamic).
- Add multiple properties as needed for data consistency.
4. Specify Data Size
- Define the Number of Rows to generate.
- Ensure the dataset size meets your requirements.
5. Generate the Dataset
-
Click Next to review your configuration.
-
Click Create to generate the dataset.