List of Flash News about OpenRouter
| Time | Details |
|---|---|
|
2025-11-22 23:54 |
Andrej Karpathy unveils llm-council open-source multi-LLM ensemble via OpenRouter; GPT-5.1 ranked highest by peers, Claude lowest
According to @karpathy, he released an open-source llm-council web app that dispatches each user query to multiple models via OpenRouter, lets models review and rank anonymized responses, and then a Chairman LLM produces the final answer, detailing a concrete multi-LLM ensemble workflow. Source: @karpathy on X. According to @karpathy, the current council includes openai/gpt-5.1, google/gemini-3-pro-preview, anthropic/claude-sonnet-4.5, and x-ai/grok-4, providing side-by-side outputs and rankings across OpenAI, Google, Anthropic, and xAI model families. Source: @karpathy on X. According to @karpathy, cross-model evaluation frequently selects another model’s response as superior, highlighting a practical peer-review method for model selection and ranking. Source: @karpathy on X. According to @karpathy, in his reading tests the models consistently praised GPT-5.1 as the best and most insightful and consistently selected Claude as the worst, with Gemini 3 Pro and Grok-4 in between, while his qualitative take found GPT-5.1 wordy, Gemini 3 more condensed, and Claude too terse. Source: @karpathy on X. According to @karpathy, the code is publicly available for others to try on GitHub under the llm-council repository. Source: @karpathy on X and @karpathy on GitHub. According to @karpathy, the post does not mention cryptocurrencies, tokens, or blockchains, and provides no direct crypto market claims. Source: @karpathy on X. |