Media Summary: In this video, we discuss the fundamentals of model In this video I will introduce and explain This video explores DeepSeek R1, how distilled versions and

Quantization In Deep Learning Deep - Detailed Analysis & Overview

In this video, we discuss the fundamentals of model In this video I will introduce and explain This video explores DeepSeek R1, how distilled versions and Try Voice Writer - speak your thoughts and let AI handle the grammar: Four techniques to optimize the speed ... The power consumption of data-center is doubling every year and edge devices like Internet of Things (IoT) are growing rapidly. Run massive AI models on your laptop! Learn the secrets of LLM

Zhaowei Cai; Xiaodong He; Jian Sun; Nuno Vasconcelos The problem of Dr. Daniel Soudry, Technion "Hardware for AI Track" AI Week Yuval Ne'eman Workshop for Science, Technology and Security Tel ... This talk was delivered at the April 2023 Generative AI meetup by Amod Malviya, co-founder at Udaan. Details of the talk and link ...

Photo Gallery

Quantization in deep learning | Deep Learning Tutorial 49 (Tensorflow, Keras & Python)
What is LLM quantization?
How LLMs survive in low precision | Quantization Fundamentals
Quantization explained with PyTorch - Post-Training Quantization, Quantization-Aware Training
DeepSeek R1: Distilled & Quantized Models Explained
Quantization vs Pruning vs Distillation: Optimizing NNs for Inference
Downsizing Neural Networks by Quantization - Introduction to Deep Learning
Quantization of Deep Learning Solution for Efficient Inference | Kim Hee, UMM [PyData Südwest]
Optimize Your AI - Quantization Explained
Quantization in Deep Learning (LLMs)
EfficientML.ai Lecture 5 - Quantization (Part I) (MIT 6.5940, Fall 2023)
Quantizing LLMs - How & Why (8-Bit, 4-Bit, GGUF & More)
Sponsored
Sponsored
View Detailed Profile
Quantization in deep learning | Deep Learning Tutorial 49 (Tensorflow, Keras & Python)

Quantization in deep learning | Deep Learning Tutorial 49 (Tensorflow, Keras & Python)

Are you planning to deploy a

What is LLM quantization?

What is LLM quantization?

In this video we define the basics of

Sponsored
How LLMs survive in low precision | Quantization Fundamentals

How LLMs survive in low precision | Quantization Fundamentals

In this video, we discuss the fundamentals of model

Quantization explained with PyTorch - Post-Training Quantization, Quantization-Aware Training

Quantization explained with PyTorch - Post-Training Quantization, Quantization-Aware Training

In this video I will introduce and explain

DeepSeek R1: Distilled & Quantized Models Explained

DeepSeek R1: Distilled & Quantized Models Explained

This video explores DeepSeek R1, how distilled versions and

Sponsored
Quantization vs Pruning vs Distillation: Optimizing NNs for Inference

Quantization vs Pruning vs Distillation: Optimizing NNs for Inference

Try Voice Writer - speak your thoughts and let AI handle the grammar: https://voicewriter.io Four techniques to optimize the speed ...

Downsizing Neural Networks by Quantization - Introduction to Deep Learning

Downsizing Neural Networks by Quantization - Introduction to Deep Learning

This video explains the

Quantization of Deep Learning Solution for Efficient Inference | Kim Hee, UMM [PyData Südwest]

Quantization of Deep Learning Solution for Efficient Inference | Kim Hee, UMM [PyData Südwest]

The power consumption of data-center is doubling every year and edge devices like Internet of Things (IoT) are growing rapidly.

Optimize Your AI - Quantization Explained

Optimize Your AI - Quantization Explained

Run massive AI models on your laptop! Learn the secrets of LLM

Quantization in Deep Learning (LLMs)

Quantization in Deep Learning (LLMs)

This video is about

EfficientML.ai Lecture 5 - Quantization (Part I) (MIT 6.5940, Fall 2023)

EfficientML.ai Lecture 5 - Quantization (Part I) (MIT 6.5940, Fall 2023)

EfficientML.ai Lecture 5 -

Quantizing LLMs - How & Why (8-Bit, 4-Bit, GGUF & More)

Quantizing LLMs - How & Why (8-Bit, 4-Bit, GGUF & More)

Quantizing

[Full Workshop] Reinforcement Learning, Kernels, Reasoning, Quantization & Agents — Daniel Han

[Full Workshop] Reinforcement Learning, Kernels, Reasoning, Quantization & Agents — Daniel Han

Why is Reinforcement

tinyML Talks: A Practical Guide to Neural Network Quantization

tinyML Talks: A Practical Guide to Neural Network Quantization

"A Practical Guide to Neural Network

Quantization Aware Training (QAT) With a Custom DataLoader: Beginner's Tutorial to Training Loops

Quantization Aware Training (QAT) With a Custom DataLoader: Beginner's Tutorial to Training Loops

If you need help with anything

Quantizing a Deep Learning Network in MATLAB

Quantizing a Deep Learning Network in MATLAB

In this video, we demonstrate the

New course with Hugging Face: Quantization in Depth 🤗

New course with Hugging Face: Quantization in Depth 🤗

Enroll now: https://bit.ly/44nXDNa We're excited to introduce

Deep Learning With Low Precision by Half-Wave Gaussian Quantization | Spotlight 4-1A

Deep Learning With Low Precision by Half-Wave Gaussian Quantization | Spotlight 4-1A

Zhaowei Cai; Xiaodong He; Jian Sun; Nuno Vasconcelos The problem of

Resource-Efficient Quantized Deep Learning

Resource-Efficient Quantized Deep Learning

Dr. Daniel Soudry, Technion "Hardware for AI Track" AI Week Yuval Ne'eman Workshop for Science, Technology and Security Tel ...

Introduction to Quantization in Deep Neural Networks

Introduction to Quantization in Deep Neural Networks

This talk was delivered at the April 2023 Generative AI meetup by Amod Malviya, co-founder at Udaan. Details of the talk and link ...

Related Video Content

Quantization (signal processing) - Wikipedia information

In mathematics and digital signal processing, quantization is the process of mapping input values from a large set...

What is Quantization - GeeksforGeeks information

Nov 6, 2025 · Quantization is a model optimization technique that reduces the precision of numerical values such as...

Model Quantization: Concepts, Methods, and Why It Matters information

Nov 24, 2025 · Quantization reduces the precision of model parameters and activations (for example, from FP32/FP16 to...

What Is Quantization? | How It Works & Applications information

Quantization is the process of mapping continuous infinite values to a smaller set of discrete finite values. In the...

What is quantization? - IBM information

Quantization is the process of reducing the precision of a digital signal, typically from a higher-precision format...