The Corpus of Serbian Language CSL
was compiled from a sample of 11 million words and spans the Serbian language
from the 12th century to the contemporary language.
Each word in the CSL
is manually tagged for its grammatical status, number of graphemes
and syllables and phonological structure. The text is also tagged for the
beginning and the end of sentences and paragraphs.
The CSL project
was initiated and conducted in the late fifties by Prof. Đorđe Kostić at
the Institute for Experimental Phonetics and Speech Pathology in Belgrade. In
1996 through joint efforts of the Institute for Experimental Phonetics
and Speech Pathology and the Laboratory for Experimental Psychology, University
of Belgrade the project was reactivated and the material transferred into
an electronic format.
CSL is part of a
broader project entitled A Quantitative Description of the Serbian Language
Structure - QDSLS, whose goal is to provide a quantitative diachronic
description of the Serbian language.
was and still is financed entirely by the Institute for Experimental Phonetics
and Speech Pathology, Belgrade. The Director of the project is Prof. Aleksandar
Kostić, Director of the Laboratory for Experimental Psychology, Department
of Psychology, University of Belgrade. The chief editor of all QDSLS publications
is Dr. Mirjana Sovilj, who is the director of the Institute for Experimental
Phonetics and Speech Pathology.