Display K-centres

[This page is no longer updated; please visit our new K-Centre catalogue for up-to-date information about our K-Centres]

Full nameCLARIN Knowledge Centre for South Slavic languages
Short nameCLASSLA
URLhttp://www.clarin.si/info/k-centre/
Hosted by(1) Jožef Stefan Institute (CLARIN.SI), Ljubljana, Slovenia
(2) Institute of Croatian Language (IHJ), Zagreb, Croatia
(3) Institute of Information and Communication Technologies (CLADA-BG), Sofia, Bulgaria
City of main hubLjubljana
Country of main hubSI
Active since2019-03-19
Area of competenceCLASSLA offers expertise on language resources and technologies for South Slavic languages. It provides information on freely available lexicons and corpora, which can be used in research in the social sciences and humanities. The CLASSLA-Stanza pipeline allows researchers to perform language processing of their texts to produce their own corpora, while the CLASSLA web corpora as the largest general corpora for all South Slavic languages enable direct language research. The centre provides guidance in how to use the available resources and technologies in research.
Audiences- Computational linguists
- Computer scientists
- Citizen scientists
- Historians
- Language teachers
- Linguists
- Sociolinguists
- Sociologists
Types of services- FAQ
- Helpdesk
- Technical support
- Training
Language portal for- Slovenian
- Slovene
- Croatian
- Bosnian
- Serbian
- Montenegrin
- Macedonian
- Bulgarian
Other languages covered
Modalities covered- Audio: speech
- Text
Linguistic topics- Applied linguistics
- Dialect studies
- Sociolinguistics
Language processing topics- Basic language processing
- Information extraction
- Language understanding
- Named entity recognition
- Processing of morphologically rich languages
- Speech recognition
Data types- Manually annotated datasets
- Corpora
- Language models
- Treebanks
Resource families- Computer-mediated communication corpora (social media)
- Historical corpora
- Literary corpora
- Newspaper corpora
- Parliamentary corpora
- Corpora of academic texts
- Manually annotated corpora
- Multimodal corpora
- Parallel corpora
- Reference corpora
- Spoken corpora
- Language models
- Lexica
- Normalization
- Named entity recognition
- Part-of-speech tagging and lemmatization
- Tools for sentiment analysis
Generic topics- Evaluation of tools and models
- Machine learning
- Deep learning
Other keywords- Processing of closely related languages
- Language variation
- Spatial language variation
Tour de CLARIN introduction https://www.clarin.eu/blog/tour-de-clarin-clarin-knowledge-centre-south-slavic-languages-classla
Tour de CLARIN interviewhttps://www.clarin.eu/blog/tour-de-clarin-interview-zrinka-kolakovic
Last update2024-07-22 10:43:42