Gurupriya Adurthy , trans. 2025. “Optimizing Large Language Model Deployment With Scalable Inference and Ensemble Techniques”. International Journal of Engineering and Advanced Technology (IJEAT) 15 (2): 9-14. https://doi.org/10.35940/ijeat.A4692.15021225.