THEORETICAL AND METHODOLOGICAL FOUNDATIONS FOR CREATING A LINGUISTIC BASE OF THE FERGANA VALLEY DIALECTS AND INTEGRATING THEM INTO THE NATIONAL CORPUS OF THE UZBEK LANGUAGE

Main Article Content

Vosiljonov Azizbek Boxodirjon ugli

Abstract

This article comprehensively analyzes the issues of systematic study of the linguistic characteristics of dialects distributed in the Fergana Valley, their digitization and integration into the national corpus of the Uzbek language. Within the framework of the research, phonetic, morphological and lexical units were identified based on field materials, and a multi-layered linguistic annotation model was developed. The processes of transcription, normalization, enrichment with metadata of dialectal texts and their adaptation to the corpus architecture were scientifically covered. The article shows the importance of determining areal differences, creating opportunities for linguostatistical analysis, and the dialectal module in applied language technologies. The results of the research suggest a methodological model for preserving and systematizing Fergana Valley dialects in a digital environment.

Downloads

Download data is not yet available.

Article Details

Section

Articles

How to Cite

THEORETICAL AND METHODOLOGICAL FOUNDATIONS FOR CREATING A LINGUISTIC BASE OF THE FERGANA VALLEY DIALECTS AND INTEGRATING THEM INTO THE NATIONAL CORPUS OF THE UZBEK LANGUAGE. (2026). Journal of Multidisciplinary Sciences and Innovations, 5(02), 1292-1294. https://doi.org/10.55640/

References

1. Gоldin. Mаshinа fоndi, 1986-1990.

2. Абдурахмонова, Н., & Абдувахобов, Г. (2021). O ‘quv lug ‘atini tuzishning nazariy metodologik asoslari. Международный журнал искусство слова, 4(6).

3. Abdurakhmonova, N. (2021). Formal-Functional Models of The Uzbek Electron

Corpus. ANGLISTICUM. Journal of the Association-Institute for English Language and American Studies, 10(8), 59-66.

4. Abdurakhmonova, N., Alisher, I., & Toirova, G. (2022, September). Applying Web Crawler Technologies for Compiling Parallel Corpora as one Stage of Natural Language Processing. In 2022 7th International Conference on Computer Science and Engineering (UBMK) (pp. 73-75). IEEE.

5. Абдурахмонова, Н., & Абдувахобов, Г. (2021). O ‘QUV LUG ‘ATINI TUZISHNING NAZARIY METODOLOGIK ASOSLARI. МЕЖДУНАРОДНЫЙ ЖУРНАЛ ИСКУССТВО СЛОВА, 4(6).

6. Abdurakhmonova, N., Shakirovich, I. A., & O‘G‘Li, K. N. S. (2022). Morphological analyzer (morfoAnalyse) Python package for Turkic language. Science and Education, 3(9), 146-156.

7. Mаhmudоv, M.Ə. Kоmpütеr dilçiliyi / M.Ə. Mаhmudоv. – Bаkı: Еlm vә tәhsil, – 2013.– 352 s

8. Gоláňоvá, H. Wаclаwičоvá, M. Cо jе v ČNK nоvéhо Iх (Zprávy z čеskéhо nárоdníhо kоrpusu). Kоrpus – grаmаtikа – ахiоlоgiе, 2018 (17), pаgеs 78–82

9. https://vаriеng.hеlsinki.fi/CоRD/cоrpоrа/Diаlеkts/

10. http://www.kоrpus.cz

Similar Articles

You may also start an advanced similarity search for this article.