Google’s New Gemma 2 Model Now Optimized and Available on NVIDIA API Catalog

Gemma 2, the next generation of Google Gemma models, is now optimized with TensorRT-LLM and packaged as NVIDIA NIM inference microservice.

Gemma 2, the next generation of Google Gemma models, is now optimized with TensorRT-LLM and packaged as NVIDIA NIM inference microservice.

Source

Leave a Reply

Your email address will not be published.

Previous post Deploy GPU-Optimized AI Software with One Click Using Brev.dev and NVIDIA NGC Catalog
Next post GTA Online adds a qualify-of-life feature players have wanted for years then upsets everyone by paywalling it: ‘One of the slimiest things they’ve done in a while’