Comparative evaluation of multiple LLMs on accuracy/efficiency.
AI & Machine Learning
July 2025



A benchmarking harness to compare multiple LLMs on code generation tasks, scoring accuracy, latency, and cost to guide model selection.
More projects
AI Log Explainer
Explainable AI tool for ML log interpretation.
MindMentor - AI Tutoring Platform
AI-powered tutoring platform with multi-model chat and conversation context.
BrandCraft AI
Branding assistant generating palettes and fonts from user input.