GPU Memory - Search News

Nvidia shrinks LLM memory 20x without changing model weights

Nvidia's KV Cache Transform Coding (KVTC) compresses LLM key-value cache by 20x without model changes, cutting GPU memory costs and time-to-first-token by up to 8x for multi-turn AI applications.

Nvidia BlueField-4 STX adds a context memory layer to storage to close the agentic AI throughput gap

Nvidia's BlueField-4 STX reference architecture inserts a dedicated context memory layer between GPUs and traditional storage, claiming 5x token throughput and 4x energy efficiency for agentic AI ...

Tom's Hardware on MSN

Nvidia demonstrates Rubin Ultra tray, the world's first AI GPU with 1TB of HBM4E memory

Nvidia shows off its next-generation Kyber rack-scale solution to be powered by Rubin Ultra GPUs with four compute chiplets and 1 TB of HBM4E memory per package.

New memory architecture targets AI inference bottlenecks

Lightbits Labs Ltd. today is introducing a new architecture aimed at addressing one of the most stubborn bottlenecks in large-scale artificial intelligence inference: the growing mismatch between the ...

TweakTown

NVIDIA launches single-slot RTX PRO 4500 Blackwell Server Edition GPU

NVIDIA has launched the new compact single-slot RTX PRO 4500 Blackwell Server Edition with 32GB of GDDR7 memory for servers ...

Phison Rescales Local AI Inferencing with Flash Memory Expansion

Phison Electronics (8299TT), a global leader in NAND flash controllers and storage solutions, today announced its GTC ...

KIOXIA Announces New SSD Model Optimized for AI GPU-Initiated Workloads

"Kioxia fully supports the NVIDIA Storage-Next initiative and will deliver purpose-built SSDs to effectively address the need for GPU-accessible memory," said Makoto Hamada, Senior Director of the SSD ...

How-To Geek on MSN

5 hidden PC settings you need to change before giving up on your aging GPU

Here are a few brilliant ways to squeeze more frame rates out of your GPU instead of upgrading ...

Ars Technica

Dedicated GPU memory for ARM Linux/Meson/AMLogic

I'm hoping there are a few kernel hackers around here who might have some insights into this... I have a long standing habit of using "gutless wonder" ARM boards for desktop. Some work well, some work ...

TweakTown

Innosilicon Type-B graphics card: Chinese GPU has 32GB GDDR6X memory

Innosilicon has officially launched its new graphics cards based on its in-house Fantasy One GPU, with 4 new graphics cards based on the Fantasy One GPU launched -- including a multi-GPU design -- in ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results