TECHNICAL DETAILS                                          CSL
The whole material (the original annotated text and frequency dictionaries) has been transferred into electronic format (relational database - MS Visual FoxPro). Currently, the material is structured as a table, each word and its codes being in a single row. The whole material is also available in ASCII format and could be transferred into any standard spreadsheet or statistical package. 
In its final version the CSL will be in the XML format. Transferring into XML is underway and it is 
expected to be completed by the end of 2001.  
In addition to the main corpus, the final version of the CSL will contain a series of probability matrices that will capture all aspects of Serbian language, spanning from the level of phonology and syllabic structure to the level of inflected morphology.