Gurupriya Adurthy (tran.) (2025) “Optimizing Large Language Model Deployment with Scalable Inference and Ensemble Techniques”, International Journal of Engineering and Advanced Technology (IJEAT), 15(2), pp. 9–14. doi:10.35940/ijeat.A4692.15021225.