Comprehensive testing environment for analyzing Large Language Model behavior across energy consumption and comparative performance.
Analyze energy consumption and carbon footprint of LLM modifications. Track Wh/token usage across different benchmarks.
Simple model testing and comparison. Compare base vs instruction-tuned models with basic metrics.
For specialized use cases, standalone versions of each environment are available:
python3 app_energy.py
Port 8002 - Energy only
python3 app_comparison.py
Port 8004 - Comparison only
ollama serveollama pull llama3.1:8b