<img height="1" width="1" src="https://www.facebook.com/tr?id=178445000157960&amp;ev=PageView &amp;noscript=1">


Matching Data Service


Clustered search technology that uses a data sample and returns matches from the TAUS Data repository, according to domain relevance. With this methodology you get clean, high-quality, high-fidelity datasets for MT training, tuned to your specific content.

Yes, I’d like to try the service for FREE

How it Works?

Red box containing a checkmark

You Provide us with a Data Sample

Tell us what kind of data you are looking for by providing a sample that represents the domain and language pair of interest.

Red box containing a checkmark

We Search for Matching Sentences

We identify the best matching data in the TAUS Data repository, on a segment-level, and create three separate data selections.

Red box containing a checkmark

You Review
Matching Data Selections

We share with you the volumes (number of words and segments), samples and price. You purchase only if you like the results.

Explore the Ready-Made Datasets