Quantization Model Compression

Here are 3 critical LLM compression strategies to supercharge AI performance

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More In today’s fast-paced digital landscape, businesses relying on AI face ...

InfoWorld

What is model quantization? Smaller, faster LLMs

Reducing the precision of model weights can make deep neural networks run faster in less GPU memory, while preserving model accuracy. If ever there were a salient example of a counter-intuitive ...

TMCnet

Nota AI Signs EXAONE Commercialization Partnership with LG AI Research to Accelerate LLM Adoption Through AI Model Compression Technology

SEOUL, South Korea, Dec. 10, 2025 /PRNewswire/ -- Nota AI, a company specializing in AI model compression and optimization ...

InfoQ

Show inaccessible results

Here are 3 critical LLM compression strategies to supercharge AI performance

What is model quantization? Smaller, faster LLMs

Nota AI Signs EXAONE Commercialization Partnership with LG AI Research to Accelerate LLM Adoption Through AI Model Compression Technology

Google Releases Quantization Aware Training for TensorFlow Model Optimization

Neural Network Model Quantization On Mobile

Scaling Small Language Models (SLMs) For Edge Devices: A New Frontier In AI

Medical Image Compression and Vector Quantization