The Natural Language Processing Axioms in Classical Tamil for Zonal Dialects using Machine Learning

Perumal Sivaraman; Dr. Prabaharan G; Dr. Senthil Kumar R

doi:10.35940/ijitee.F1096.14060525

PDF

Published: 30-05-2025

DOI: https://doi.org/10.35940/ijitee.F1096.14060525

Keywords:

(must be 3-5), Zonal Dialects, Machine Learning, Naïve Bayes

Perumal Sivaraman

Department of Information Technology, UTAS - Nizwa, Oman (Nizwa), Oman.

https://orcid.org/0009-0003-5301-8968

Dr. Prabaharan G

Department of Computer Science and Engineering, Vel Tech Rangarajan, Dr. Sagunthala R and D Institute of Science and Technology, Morai (Tamil Nadu), India.

https://orcid.org/0000-0002-8365-5322

Dr. Senthil Kumar R

Department of Computer Science and Engineering, Jain Deemed University, Bengaluru (Karnataka), India.

https://orcid.org/0000-0002-1393-756X

Abstract

Studying Natural Language Processing (NLP) for Classical Tamil and its Zonal Dialects using Machine Learning (ML) involves unique challenges and opportunities. Classical Tamil, being one of the oldest languages with a rich literary heritage, differs significantly in syntax, semantics, and phonetics from its modern dialects. Addressing these differences requires incorporating linguistic axioms and cultural nuances into NLP systems. This deals with Tamil letters, Challenges, Future directions, and Lexical Differences. It also includes parsers, tokenisation. Lists the differences between Morphemes, Bounded Morphemes in terms of Tamil as a Natural Language processing—dictionary form of the words used in Lemmatizations. Stemming is used to reduce the words and Tamil represented as a short sentence. Comparison of differences in the dialect of Tamil taken and represented. The methodology used is Clustering algorithms can group zonal dialects based on phonetic and semantic similarities using a Naïve Bayes classifier. We are using Speech to Text for identifying the Tamil dialect. This zonal dialect plays an important role in Entertainment, education, Information, and Business purposes. More Exploration can be done using Zonal dialects in Classical Tamil. Machine learning plays a role in classification, Grouping, and Segmenting Natural Language processing. For a single word in Natural Language processing, we have different dialects in the Single Language Tamil. Encourages local people to communicate fluently in terms of transactions. Preserving local traditions and customs is the advantage of Zonal Dialects. It can be used in interviews, recordings, written and spoken texts, and debates. Linguistic Diversity, preservation of History, and Cultural Identity are the major concerns in the field of Zonal dialects using classical Tamil.

Downloads

Download data is not yet available.

Issue

Vol. 14 No. 6 (2025): Volume-14 Issue-6, May 2025

Section

Articles

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

CC-BY-NC-ND 4.0

How to Cite

[1]

Perumal Sivaraman, Dr. Prabaharan G, and Dr. Senthil Kumar R , Trans., “The Natural Language Processing Axioms in Classical Tamil for Zonal Dialects using Machine Learning”, IJITEE, vol. 14, no. 6, pp. 36–44, May 2025, doi: 10.35940/ijitee.F1096.14060525.

References

Mounikamarreddy, IIITH, India, Am I a Resource-Poor Language? Data Sets, Embeddings, Models and Analysis for four different NLP tasks in Telugu Language-ACM Transactions on Asian and Low-Resource Language Information Processing · April 2022. https://doi.org/10.1145/3531535

Omprakash Yadav1, Alcina Judy1, Praveen D’souza1, Calvin Galbaw1, Hinal Rane1-International Journal of Applied Sciences and Smart Technologies-2021 https://doi.org/10.24071/ijasst.v3i2.2826

Effect of Regional Dialects in Learning Tamil Language N. Sulochana- International Research Journal of Tamil-2022. http://doi.org/10.34256/irjt22s91

Dr. S. Suriya1, S. Nivetha2, P. Pavithran2, Ajay Venkat S.2, Sashwath K. G.2, Elakkiya G.2-Effective Tamil Character Recognition Using Supervised Machine Learning Algorithms- EAI Endorsed Transactions on e-Learning-2023. http://doi.org/10.4108/eetel.v8i2.3025

J. Angelin Jeba1, S. Rubin Bose2, R. Regin3, S. Suman Rajest4, Md Mahdi Hasan5-Intelligent Tamil Video Summarization: AI-Powered NLP, Translation, and Speech Integration for Enhanced Accessibility-FMDB Transactions on Sustainable Computer Letters-2024. https://doi.org/10.69888/FTSCL.2024.000179

Udhayakumar, Dr. A., & Rajasekar, M. (2019). Advanced Tamil POS Tagger for Language

Learners. In International

Journal of Innovative Technology and Exploring Engineering (Vol. 8, Issue 10, pp. 741–745). DOI: https://doi.org/10.35940/ijitee.j8886.0881019

Debbarma, A., Bhattacharya, Dr. P., & Purkayastha, Prof. B. S. (2019). Named Entity Recognition for a Low Resource Language. In International Journal of Recent Technology and Engineering (IJRTE) (Vol. 8, Issue 3, pp. 587–590). DOI: https://doi.org/10.35940/ijrte.b2085.098319

Chetia, G., & Hazarika, G. C. (2019). Single Document Text Summarization of a Resource-Poor Language using an Unsupervised Technique. In International Journal of Engineering and Advanced Technology (Vol. 9, Issue 1, pp. 6278–6281). DOI: https://doi.org/10.35940/ijeat.a2250.109119

Article Sidebar

Main Article Content