Please use this identifier to cite or link to this item: 192.168.6.56/handle/123456789/58397
Title: Formalizing Natural Languages with NooJ and Its Natural Language Processing Applications
Authors: Kenitra and Rabat, Morocco
Samir Mbarki • Mohammed Mourchid Max Silberztein
Keywords: Formalizing Natural
Issue Date: 2018
Publisher: Springer
Description: This paper aims at presenting how to elaborate a relevant sorting of morphosyntactic tags to be used in the NooJ dictionary for Rromani language through three topics: dialectal issues, treatment of postpositions and countableness of substantives. This module encompasses all four dialects of Rromani, the isoglosses of which are basically no longer geographical. We have thus defined each of the four dialects through a combination of two tags corresponding to specific isoglosses. For instance, the so-called O-bi dialect (i.e. O-superdialect with no mutation of alveolar affricates) is labelled as “rro + rrbi” in NooJ. Then, on typological grounds, it was decided to treat the Rromani postpositions as agglutinative, non-inflectional, morphemes. Rromani postpositions are appended to substantives in the oblique case and in some cases cumulative (as in Modern Indic). In addition, the postposition of possession may be inflected in gender, number and case as an adjective (-qo, -qi, -qe of as basic forms, with variants). Accordingly, no less than some 250 potential forms are to be encountered for postpositions, covering all basic dialectal variants. However, they may all be rendered, by a much more economical system, appropriate to both Rromani grammar and computational analysis. Moreover, we investigated the system of countableness in Rromani nouns when relevant.
URI: http://10.6.20.12:80/handle/123456789/58397
ISBN: 978-3-319-73420-0
Appears in Collections:Rural Development Studies

Files in This Item:
File Description SizeFormat 
108.pdf41.85 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.