Dataset?
Would it be possible to get a copy of the dataset used to train this model?
I'm actually curious on this as well, as I'd like also to know a little bit better how the process of synthetic data generation was conducted and the dataset curated, please
Thank you for your interest!
Currently, our datasets aren’t publicly available (they’re provided to customers), but we will soon publish a preprint describing our synthetic data–generation pipeline for the Swahili sentiment analysis task. This paper will present the complete methodology behind our approach.
How much does the dataset cost, and what's the license on it if I purchase it?
@trentmkelly
, the price depends on your use case/company size, but the license is fully yours upon purchase. If you interested please write me vadim@tabularis.ai