A Tool I Built for Synthetic Datasets
I made a tool because getting datasets for NLP or tabular data is tough. It uses an AI API to generate synthetic data. You can define columns with names, types, and prompts, and set the number of rows, up to 50,000 or more as much as you need. It’s in Python with a basic interface. It’s on GitHub here: https://github.com/VoxDroid/Zylthra. I needed it for some work, and it does the job. If anyone tries it, let me know what’s off.

I made a tool because getting datasets for NLP or tabular data is tough. It uses an AI API to generate synthetic data. You can define columns with names, types, and prompts, and set the number of rows, up to 50,000 or more as much as you need. It’s in Python with a basic interface. It’s on GitHub here: https://github.com/VoxDroid/Zylthra. I needed it for some work, and it does the job. If anyone tries it, let me know what’s off.