Phi-3-Medium accelerates research with logic-rich features in both short (4K) and long (128K) context. Phi-3-Medium accelerates research with logic-rich features in both short (4K) and...
StarCoder2-15B: A Powerful LLM for Code Generation, Summarization, and Documentation
Trained on 600+ programming languages, StarCoder2-15B is now packaged as a NIM inference microservice available for free from the NVIDIA API catalog. Trained on 600+...
How Cutting-Edge Computer Chips are Speeding Up the AI Revolution
Featured in Nature, this post delves into how GPUs and other advanced technologies are meeting the computational challenges posed by AI. Featured in Nature, this...
Google’s New Gemma 2 Model Now Optimized and Available on NVIDIA API Catalog
Gemma 2, the next generation of Google Gemma models, is now optimized with TensorRT-LLM and packaged as NVIDIA NIM inference microservice. Gemma 2, the next...
Deploy GPU-Optimized AI Software with One Click Using Brev.dev and NVIDIA NGC Catalog
Brev.dev is making it easier to develop AI solutions by leveraging software libraries, frameworks, and Jupyter Notebooks on the NVIDIA NGC catalog. You can use......