Dataset?

#3
by trentmkelly - opened

Would it be possible to get a copy of the dataset used to train this model?

I'm actually curious on this as well, as I'd like also to know a little bit better how the process of synthetic data generation was conducted and the dataset curated, please

tabularisai org

Thank you for your interest!

Currently, our datasets aren’t publicly available (they’re provided to customers), but we will soon publish a preprint describing our synthetic data–generation pipeline for the Swahili sentiment analysis task. This paper will present the complete methodology behind our approach.

How much does the dataset cost, and what's the license on it if I purchase it?

tabularisai org

@trentmkelly , the price depends on your use case/company size, but the license is fully yours upon purchase. If you interested please write me vadim@tabularis.ai

Sign up or log in to comment