Clustered search technology that uses a data sample and returns matches from the TAUS Data repository, according to domain relevance. With this methodology you get clean, high-quality, high-fidelity datasets for MT training, tuned to your specific content.
Tell us what kind of data you are looking for by providing a sample that represents the domain and language pair of interest.
We identify the best matching data in the TAUS Data repository, on a segment-level, and create three separate data selections.
We share with you the volumes (number of words and segments), samples and price. You purchase only if you like the results.