A Data Augmentation Platform.
AI or Data Scientist.
- Collecting data takes time: AI is powerful when there are numerous labelled data to train the model; however, data collection is time-consuming.
- Imbalanced data: Imbalance is common and expected in real world, e.g. Medical diagnosis, Spam filtering, and Fraud detection.
- Concern of data privacy: The concerns of data leak and privacy are increasing. It’s getting harder and harder to collect data due to new regulations and guidelines, e.g. GDPR.
What it does
- Generate Synthetic Samples, i.e. pseudo data, for structured/tabular data.The data’s schema, data distribution, and relationship between columns of the generated data are as close to real data as possible.The difference of statistical properties between synthetic and real data is slight.
- A Data Augmentation Platform to provide the data augmentation service with only small amount of data.