The increasing usage of large language models for code generation raises concerns regarding their computational costs and ecological impact. This study evaluates the environmental efficiency of several cutting-edge large language models, including ChatGPT, Claude, Copilot, DeepSeek, Gemini, Mistral, and Qwen, across algorithm and data structure tasks in Python, C++, and Java, selected from HackerRank to ensure practical relevance. A multi-metric, sustainability-focused evaluation framework is proposed, measuring execution time, peak memory usage, energy consumption, and carbon footprint. The Technique for Order of Preference by Similarity to Ideal Solution (TOPSIS) is applied to combine algorithm and data structure metrics into scores for each programming language, which are then normalized across models and averaged across languages to compute the GreenAI Efficiency Score. This unified score enables fair, comprehensive ranking of models, promoting environmentally responsible AI selection in software development.
Building similarity graph...
Analyzing shared references across papers
Loading...
Tayrin Tunzina
Mysun Mashira
Md. Motaharul Islam
IEEE Access
SHILAP Revista de lepidopterología
United International University
Building similarity graph...
Analyzing shared references across papers
Loading...
Tunzina et al. (Thu,) studied this question.
synapsesocial.com/papers/69a75ce8c6e9836116a262dc — DOI: https://doi.org/10.1109/access.2026.3658813