Optimizing Large Language Model Deployment with Scalable Inference and Ensemble Techniques. International Journal of Engineering and Advanced Technology (IJEAT), [S. l.], v. 15, n. 2, p. 9–14, 2025. DOI: 10.35940/ijeat.A4692.15021225. Disponível em: https://journals.blueeyesintelligence.org/index.php/ijeat/article/view/954.. Acesso em: 31 dec. 2025.