The Natural Language Processing Axioms in Classical Tamil for Zonal Dialects using Machine Learning
Main Article Content
Abstract
Studying Natural Language Processing (NLP) for Classical Tamil and its Zonal Dialects using Machine Learning (ML) involves unique challenges and opportunities. Classical Tamil, being one of the oldest languages with a rich literary heritage, differs significantly in syntax, semantics, and phonetics from its modern dialects. Addressing these differences requires incorporating linguistic axioms and cultural nuances into NLP systems. This deals with Tamil letters, Challenges, Future directions, and Lexical Differences. It also includes parsers, tokenisation. Lists the differences between Morphemes, Bounded Morphemes in terms of Tamil as a Natural Language processing—dictionary form of the words used in Lemmatizations. Stemming is used to reduce the words and Tamil represented as a short sentence. Comparison of differences in the dialect of Tamil taken and represented. The methodology used is Clustering algorithms can group zonal dialects based on phonetic and semantic similarities using a Naïve Bayes classifier. We are using Speech to Text for identifying the Tamil dialect. This zonal dialect plays an important role in Entertainment, education, Information, and Business purposes. More Exploration can be done using Zonal dialects in Classical Tamil. Machine learning plays a role in classification, Grouping, and Segmenting Natural Language processing. For a single word in Natural Language processing, we have different dialects in the Single Language Tamil. Encourages local people to communicate fluently in terms of transactions. Preserving local traditions and customs is the advantage of Zonal Dialects. It can be used in interviews, recordings, written and spoken texts, and debates. Linguistic Diversity, preservation of History, and Cultural Identity are the major concerns in the field of Zonal dialects using classical Tamil.
Downloads
Article Details
Section

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
How to Cite
References
Mounikamarreddy, IIITH, India, Am I a Resource-Poor Language? Data Sets, Embeddings, Models and Analysis for four different NLP tasks in Telugu Language-ACM Transactions on Asian and Low-Resource Language Information Processing · April 2022. https://doi.org/10.1145/3531535
Omprakash Yadav1, Alcina Judy1, Praveen D’souza1, Calvin Galbaw1, Hinal Rane1-International Journal of Applied Sciences and Smart Technologies-2021 https://doi.org/10.24071/ijasst.v3i2.2826
Effect of Regional Dialects in Learning Tamil Language N. Sulochana- International Research Journal of Tamil-2022. http://doi.org/10.34256/irjt22s91
Dr. S. Suriya1, S. Nivetha2, P. Pavithran2, Ajay Venkat S.2, Sashwath K. G.2, Elakkiya G.2-Effective Tamil Character Recognition Using Supervised Machine Learning Algorithms- EAI Endorsed Transactions on e-Learning-2023. http://doi.org/10.4108/eetel.v8i2.3025
J. Angelin Jeba1, S. Rubin Bose2, R. Regin3, S. Suman Rajest4, Md Mahdi Hasan5-Intelligent Tamil Video Summarization: AI-Powered NLP, Translation, and Speech Integration for Enhanced Accessibility-FMDB Transactions on Sustainable Computer Letters-2024. https://doi.org/10.69888/FTSCL.2024.000179
Udhayakumar, Dr. A., & Rajasekar, M. (2019). Advanced Tamil POS Tagger for Language
Learners. In International
Journal of Innovative Technology and Exploring Engineering (Vol. 8, Issue 10, pp. 741–745). DOI: https://doi.org/10.35940/ijitee.j8886.0881019
Debbarma, A., Bhattacharya, Dr. P., & Purkayastha, Prof. B. S. (2019). Named Entity Recognition for a Low Resource Language. In International Journal of Recent Technology and Engineering (IJRTE) (Vol. 8, Issue 3, pp. 587–590). DOI: https://doi.org/10.35940/ijrte.b2085.098319
Chetia, G., & Hazarika, G. C. (2019). Single Document Text Summarization of a Resource-Poor Language using an Unsupervised Technique. In International Journal of Engineering and Advanced Technology (Vol. 9, Issue 1, pp. 6278–6281). DOI: https://doi.org/10.35940/ijeat.a2250.109119