As AI models extend their capabilities to solve more sophisticated challenges, a new scaling law known as test-time scaling or inference-time scaling is... As AI...
More like this
LLM Model Pruning and Knowledge Distillation with NVIDIA NeMo Framework
Model pruning and knowledge distillation are powerful cost-effective strategies for obtaining smaller language models from an initial larger sibling. ... Model pruning and knowledge distillation are...
More like this
Post Title
Singleton is a design pattern that is used as a container which holds values that can be globally accessible across the whole project. Singletons are...
More like this
Spotlight: BRLi and Toulouse INP Develop AI-Based Flood Models Using NVIDIA Modulus
Floods pose major threats to 1.5 billion people, making it the most common cause of major natural disasters. They cause up to $25 billion in...
More like this
Featured Energy Sessions at NVIDIA GTC 2025
Learn from energy leaders using HPC and AI to boost exploration, production, and fuel delivery, while enhancing power grid reliability and resiliency. Learn from energy...
