Models Compression - Search News

How CLIKA’s automated hardware-aware AI compression toolkit efficiently enables scalable deployment of AI on any target hardware

With democratising AI and greater access to open-source AI models, enterprises today have made AI adoption a mission-critical imperative. According to Menlo Venture’s report, “2024: The State of ...

Nvidia shrinks LLM memory 20x without changing model weights

Nvidia's KV Cache Transform Coding (KVTC) compresses LLM key-value cache by 20x without model changes, cutting GPU memory costs and time-to-first-token by up to 8x for multi-turn AI applications.

8hon MSN

Multiverse Computing pushes its compressed AI models into the mainstream

After compressing models from major AI labs including OpenAI, Meta, DeepSeek and Mistral AI, Multiverse Computing has ...

Ars Technica

AI language models can exceed PNG and FLAC in lossless compression, says study

Effective compression is about finding patterns to make data smaller without losing information. When an algorithm or model can accurately guess the next piece of data in a sequence, it shows it’s ...

The Next Web

Researchers claim this AI model achieves better compression rates than PNGs

Image compression has been one of the constantly evolving challenges in computer science. Programers and researchers are always trying to improve current standards or create new ones to get better ...

insideHPC

Multiverse Computing Raises $215M for LLM Compression

San Sebastian, Spain – June 12, 2025: Multiverse Computing has developed CompactifAI, a compression technology capable of reducing the size of LLMs (Large Language Models) by up to 95 percent while ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results