About TS Corpus ML Tools


TS Corpus is a Free&Independent Project that aims building Turkish corpora, NLP tools and linguistic datasets.

Since 2011, we released various corpora for Turkish.

For some of these corpora, some specific tools were required. In order to fulfill our needs, we developed various tools.

For TS TimeLine Corpus , we had to built to two machine learning models

  • a language guesser, to classify Turkish and English news, we harvested from the sources.
  • a machine learning model to classify news category.
  • Under this subdomain, we aim to share these machine learning models as

  • online available tools and
  • via API's
  • These tools are free to use for academic studies and researches, but restricted for commercial usage.

    Please note that, any text uploaded by users are saved for later studies and might be used to enhance the accuracy of the served models.

    For feedback, please use this form