List of Flash News about routing
| Time | Details |
|---|---|
|
2026-06-05 12:22 |
Gensyn: IR3DE Router Hits 98.4% LLM Accuracy
Gensyn researchers unveil IR3DE router that picks optimal expert LLMs via linear algebra, delivering 98.4% reasoning performance without neural training. |
|
2026-02-10 15:52 |
Google Cloud Vertex AI Achieves 35% Latency Reduction with GKE Inference Gateway
According to Richard Seroter, the introduction of load-aware and context-aware routing in the GKE Inference Gateway has enabled Google Cloud's Vertex AI, which operates on GKE, to achieve a 35% reduction in latency. This improvement significantly enhances performance compared to standard load balancing, offering users faster and more efficient AI inference capabilities. |