Gurupriya Adurthy , translator. “Optimizing Large Language Model Deployment With Scalable Inference and Ensemble Techniques”. International Journal of Engineering and Advanced Technology (IJEAT), vol. 15, no. 2, Dec. 2025, pp. 9-14, https://doi.org/10.35940/ijeat.A4692.15021225.