Search Results for "llm"
NVIDIA Megatron Boosts LLM Training With Muon Optimizer
NVIDIA integrates Muon and advanced optimizers into Megatron to enhance large-scale LLM training with near-parity throughput to AdamW.
LLM Agents Help Win Kaggle Competition with 600K Lines of Code
Generative AI agents produced 600,000 lines of code and ran 850 experiments to secure first place in a Kaggle competition. Here's how they did it.
Ray Serve Introduces Scalable Multi-Agent AI Architecture
Ray Serve leverages MCP and A2A protocols for scalable AI agents, solving production bottlenecks in LLM and multi-agent deployments.
Anyscale Launches LLM Post-Training Tool to Simplify Fine-Tuning
Anyscale unveils a post-training skill for large language models, streamlining methodology selection, GPU planning, and configuration generation.
Claude Opus Aims to Revolutionize Source Code Security with LLMs
Anthropic's Claude Opus 4.7 showcases its ability to find and patch source code vulnerabilities, positioning it as a powerful tool for secure software development.
Ray Serve LLM Enhances Distributed Inference with 24x Boost
Ray Serve LLM achieves 24x higher throughput with new direct streaming, HAProxy integration, and vLLM backend upgrades, pushing LLM inference forward.
ParallelKernelBench Exposes LLM Weakness in Multi-GPU Kernels
ParallelKernelBench shows GPT-5.5 and peers struggle with multi-GPU CUDA kernels, solving less than 31% of tasks. Here's why it matters.
Is Conversational Diagnostic AI like AMIE Feasible?
AMIE, an AI system developed by Google Research and DeepMind, demonstrates superior diagnostic accuracy compared to human physicians in a groundbreaking study, signaling a new era in AI-driven healthcare.
Deceptive AI: The Hidden Dangers of LLM Backdoors
Recent studies reveal large language models can deceive, challenging AI safety training methods. They can hide dangerous behaviors, creating false safety impressions, necessitating the development of robust protocols.
StreamingLLM Breakthrough: Handling Over 4 Million Tokens with 22.2x Inference Speedup
SwiftInfer, leveraging StreamingLLM's groundbreaking technology, significantly enhances large language model inference, enabling efficient handling of over 4 million tokens in multi-round conversations with a 22.2x speedup.
Equating Cryptocurrency Solely with Illegal Conduct Lacks Understanding
Cryptocurrency has its benefits, but several consumers are still unaware of what they are because of security concerns and how the technology functions. In a recent interview, Coleman Watson – Managing partner Watson LLP – identified that while many people are interested in using cryptocurrency, lack of understanding remains a major hurdle.
Has Judgement Finally Come for 2017 ICOs? Class Action Lawsuits Name Binance, BitMEX and Block.One Among Host of Crypto Defendants
Crypto Giants Binance, BitMEX, along with Executives CZ, Arthur Hayes Named Among Defendants in 11 Class Action Suits For Violating Securities Laws.