Media Summary: microsoftresearch arxiv: code: Authors: Lingjiao Chen, ... Dive deep into the world of Large Language Check out StormMCP for your next project! This video was inspired by this great blog: ...

Optimizing Model Selection For Compound - Detailed Analysis & Overview

microsoftresearch arxiv: code: Authors: Lingjiao Chen, ... Dive deep into the world of Large Language Check out StormMCP for your next project! This video was inspired by this great blog: ... Today we have Philip Kiely from Baseten on the show. Baseten is a Series B startup focused on providing infrastructure for AI ... LLM inference is not your normal deep learning Open-source LLMs are great for conversational applications, but they can be difficult to scale in production and deliver latency ...

I tested Gemma 3 4B vs Ministral 8B on an intent classification task with the same prompt. Gemma 3 4B won. Then I Try Voice Writer - speak your thoughts and let AI handle the grammar: Four techniques to LLM Selector for Multi AI Agents (Berkeley). All rights w/ authors: " Join us at the premier vendor-neutral open source conference, where developers and technologists come together to collaborate, ... Introduction to likelihood functions, maximum likelihood estimation, and some interesting This video provides viewers with 10 practical tips for improving the accuracy of their machine learning

In this AI Tech Experts Webinar, Julia May (ML Engineer), explains why prompts should be treated as hyperparameters and how ... Ready to become a certified watsonx AI Assistant Engineer? Register now and use code IBMTechYT20 for 20% off of your exam ...

Photo Gallery

Optimizing Model Selection for Compound AI Systems - how should one decide which LLM to use?
Optimize Your AI Models
Large Language Model Selection Masterclass - Nov 2025
Using Foundry's Model Router to Simplify Optimal AI Model Selection
Optimize Your AI - Quantization Explained
Deep Dive into Inference Optimization for LLMs with Philip Kiely
Mastering LLM Inference Optimization From Theory to Cost Effective Deployment: Mark Moyou
Deep Dive: Optimizing LLM inference
Prompt Optimization Changes Which LLM is Best
Quantization vs Pruning vs Distillation: Optimizing NNs for Inference
LLM Cost Optimization Guide — Choose the Right AI Model for Every Task
Embedding model evaluation & selection guide
Sponsored
Sponsored
View Detailed Profile
Optimizing Model Selection for Compound AI Systems - how should one decide which LLM to use?

Optimizing Model Selection for Compound AI Systems - how should one decide which LLM to use?

microsoftresearch arxiv: https://arxiv.org/pdf/2502.14815 code: https://github.com/LLMSelector/llmselector Authors: Lingjiao Chen, ...

Optimize Your AI Models

Optimize Your AI Models

Dive deep into the world of Large Language

Sponsored
Large Language Model Selection Masterclass - Nov 2025

Large Language Model Selection Masterclass - Nov 2025

Check out StormMCP for your next project! https://stormmcp.ai/ This video was inspired by this great blog: ...

Using Foundry's Model Router to Simplify Optimal AI Model Selection

Using Foundry's Model Router to Simplify Optimal AI Model Selection

A walkthrough of Microsoft Foundry's

Optimize Your AI - Quantization Explained

Optimize Your AI - Quantization Explained

Run massive AI

Sponsored
Deep Dive into Inference Optimization for LLMs with Philip Kiely

Deep Dive into Inference Optimization for LLMs with Philip Kiely

Today we have Philip Kiely from Baseten on the show. Baseten is a Series B startup focused on providing infrastructure for AI ...

Mastering LLM Inference Optimization From Theory to Cost Effective Deployment: Mark Moyou

Mastering LLM Inference Optimization From Theory to Cost Effective Deployment: Mark Moyou

LLM inference is not your normal deep learning

Deep Dive: Optimizing LLM inference

Deep Dive: Optimizing LLM inference

Open-source LLMs are great for conversational applications, but they can be difficult to scale in production and deliver latency ...

Prompt Optimization Changes Which LLM is Best

Prompt Optimization Changes Which LLM is Best

I tested Gemma 3 4B vs Ministral 8B on an intent classification task with the same prompt. Gemma 3 4B won. Then I

Quantization vs Pruning vs Distillation: Optimizing NNs for Inference

Quantization vs Pruning vs Distillation: Optimizing NNs for Inference

Try Voice Writer - speak your thoughts and let AI handle the grammar: https://voicewriter.io Four techniques to

LLM Cost Optimization Guide — Choose the Right AI Model for Every Task

LLM Cost Optimization Guide — Choose the Right AI Model for Every Task

Learn how to choose the right LLM

Embedding model evaluation & selection guide

Embedding model evaluation & selection guide

Selecting

Apply LLMSelector to your AI Agents (Tutorial & Code)

Apply LLMSelector to your AI Agents (Tutorial & Code)

LLM Selector for Multi AI Agents (Berkeley). All rights w/ authors: "

Model Selection and Capacities - AI@ND Quick Tip - Episode 21

Model Selection and Capacities - AI@ND Quick Tip - Episode 21

Learn about the different types of

Headroom: A Context Optimization Layer for LLM Applications - Tejas Chopra, Netflix, Inc.

Headroom: A Context Optimization Layer for LLM Applications - Tejas Chopra, Netflix, Inc.

Join us at the premier vendor-neutral open source conference, where developers and technologists come together to collaborate, ...

Context Optimization vs LLM Optimization: Choosing the Right Approach

Context Optimization vs LLM Optimization: Choosing the Right Approach

Download the AI

Model Optimization for Quants

Model Optimization for Quants

Introduction to likelihood functions, maximum likelihood estimation, and some interesting

10 Tips for Improving the Accuracy of your Machine Learning Models

10 Tips for Improving the Accuracy of your Machine Learning Models

This video provides viewers with 10 practical tips for improving the accuracy of their machine learning

From Prompt Engineering to Prompt Optimization in Production LLM Systems

From Prompt Engineering to Prompt Optimization in Production LLM Systems

In this AI Tech Experts Webinar, Julia May (ML Engineer), explains why prompts should be treated as hyperparameters and how ...

LLM Compression Explained: Build Faster, Efficient AI Models

LLM Compression Explained: Build Faster, Efficient AI Models

Ready to become a certified watsonx AI Assistant Engineer? Register now and use code IBMTechYT20 for 20% off of your exam ...

Related Video Content

OPTIMIZE Definition & Meaning - Merriam-Webster information

3 days ago · The meaning of OPTIMIZE is to make as perfect, effective, or functional as possible. How to use optimize...

OPTIMIZING | English meaning - Cambridge Dictionary information

OPTIMIZING definition: 1. present participle of optimize 2. to make something as good as possible: . Learn more.

OPTIMIZING definition in American English | Collins English Dictionary information

OPTIMIZING definition: to take the full advantage of | Meaning, pronunciation, translations and examples in American...

Optimizing - definition of optimizing by The Free Dictionary information

Define optimizing. optimizing synonyms, optimizing pronunciation, optimizing translation, English dictionary...

OPTIMIZE Definition & Meaning | Dictionary.com information

OPTIMIZE definition: to make as effective, perfect, or useful as possible. See examples of optimize used in a...