Devices – Page 145 – Prefer systems

Posted on December 18, 2024

Devices

A Guide to Retrieval-Augmented Generation for AEC

Large language models (LLMs) are rapidly changing the business landscape, offering new capabilities in natural language processing (NLP), content generation,... Large language models (LLMs) are...

0 Comments

Streamlining CUB with a Single-Call API

Greyhawkery Comics: Under #24

How to Train an AI Agent for Command-Line Tasks with Synthetic Data and Reinforcement Learning

How to Write High-Performance Matrix Multiply in NVIDIA CUDA Tile

Posted on December 18, 2024

Devices

NVIDIA TensorRT-LLM Now Supports Recurrent Drafting for Optimizing LLM Inference

Recurrent drafting (referred as ReDrafter) is a novel speculative decoding technique developed and open-sourced by Apple for large language model (LLM)... Recurrent drafting (referred as...

0 Comments

Streamlining CUB with a Single-Call API

Greyhawkery Comics: Under #24

How to Train an AI Agent for Command-Line Tasks with Synthetic Data and Reinforcement Learning

How to Write High-Performance Matrix Multiply in NVIDIA CUDA Tile

Posted on December 18, 2024

Devices

Greyhawkery Comics: Graz’zt Show #1

Season's Greetings Greyhawkers! Today's comic is a surprise present for my readers. I used to do annual Needfest comics around this time of year (those...

0 Comments

Streamlining CUB with a Single-Call API

Greyhawkery Comics: Under #24

How to Train an AI Agent for Command-Line Tasks with Synthetic Data and Reinforcement Learning

How to Write High-Performance Matrix Multiply in NVIDIA CUDA Tile

Posted on December 18, 2024

Devices

Data-Efficient Knowledge Distillation for Supervised Fine-Tuning with NVIDIA NeMo-Aligner

Knowledge distillation is an approach for transferring the knowledge of a much larger teacher model to a smaller student model, ideally yielding a compact,... Knowledge...

0 Comments

Streamlining CUB with a Single-Call API

Greyhawkery Comics: Under #24

How to Train an AI Agent for Command-Line Tasks with Synthetic Data and Reinforcement Learning

How to Write High-Performance Matrix Multiply in NVIDIA CUDA Tile

Posted on December 17, 2024

Devices

Efficient Ray Tracing with NVIDIA OptiX Shader Binding Table Optimization

NVIDIA OptiX is the API for GPU-accelerated ray tracing with CUDA, and is often used to render scenes containing a wide variety of objects and...

0 Comments

Category: Devices

A Guide to Retrieval-Augmented Generation for AEC

More like this

Streamlining CUB with a Single-Call API

Greyhawkery Comics: Under #24

How to Train an AI Agent for Command-Line Tasks with Synthetic Data and Reinforcement Learning

How to Write High-Performance Matrix Multiply in NVIDIA CUDA Tile

NVIDIA TensorRT-LLM Now Supports Recurrent Drafting for Optimizing LLM Inference

More like this

Streamlining CUB with a Single-Call API

Greyhawkery Comics: Under #24

How to Train an AI Agent for Command-Line Tasks with Synthetic Data and Reinforcement Learning

How to Write High-Performance Matrix Multiply in NVIDIA CUDA Tile

Greyhawkery Comics: Graz’zt Show #1

More like this

Streamlining CUB with a Single-Call API

Greyhawkery Comics: Under #24

How to Train an AI Agent for Command-Line Tasks with Synthetic Data and Reinforcement Learning

How to Write High-Performance Matrix Multiply in NVIDIA CUDA Tile

Data-Efficient Knowledge Distillation for Supervised Fine-Tuning with NVIDIA NeMo-Aligner

More like this

Streamlining CUB with a Single-Call API

Greyhawkery Comics: Under #24

How to Train an AI Agent for Command-Line Tasks with Synthetic Data and Reinforcement Learning

How to Write High-Performance Matrix Multiply in NVIDIA CUDA Tile

Efficient Ray Tracing with NVIDIA OptiX Shader Binding Table Optimization

More like this

Streamlining CUB with a Single-Call API

Greyhawkery Comics: Under #24

How to Train an AI Agent for Command-Line Tasks with Synthetic Data and Reinforcement Learning

How to Write High-Performance Matrix Multiply in NVIDIA CUDA Tile