• P6-0411 - Language resources and technologies for Slovene language
The Client : Javna agencija za raziskovalno dejavnost RS ( P6-0411 )
Project type: ARRS research programmes
Project duration: 2019 - 2024
  • Description

The main research topic of the programme is modern Slovene, considered especially from the point of view of rapid digitization of languages and new developments in ICT. The objective of the programme is to conduct research into the specifics of Slovene and enable the development of resources and technologies according to international standards taking. The programme is interdisciplinary - in addition to linguistics, it includes computer and information sciences (language technologies) and education (literacy). The programme is conducted by experienced researchers of the Centre for Language Resources and Technologies at the University of Ljubljana (CJVT UL). Research covers five areas: language description, standardization, language technologies, terminology, and multilinguality. These areas cover all levels of description (text linguistics, semantics, syntax, morphology, phonology), focusing on holistic exploration of language phenomena. The research is empirical, based on real language data found in contemporary corpora and similar resources. In the fields of terminology and multilinguality the programme also covers research into the contact between Slovene and other languages, in order to facilitate the development of multilingual resources and technologies (e.g. for machine translation). Research methodology is rooted in state-of-the-art methods of machine learning and data mining, used for other languages under the theoretical framework of computational and corpus linguistics.