Andrej Karpathy

※本サイトに掲載されているチャンネル情報や動画情報はYouTube公式のAPIを使って取得・表示しています。動画はYouTube公式の動画プレイヤーで再生されるため、再生数・収益などはすべて元動画に還元されます。

Timetable

動画タイムテーブル

動画数:17件

Intro into the growing LLM ecosystem - How I use LLMs

Intro into the growing LLM ecosystem

How I use LLMs
2025年02月28日  @thecandlemanind 様 
00:00:00 - 00:02:54
The audio modes demos at  pm were simply amazing! - How I use LLMs

The audio modes demos at pm were simply amazing!

How I use LLMs
2025年02月28日  @hellovaibhav 様 
00:01:30 - 02:11:12
ChatGPT interaction under the hood - How I use LLMs

ChatGPT interaction under the hood

How I use LLMs
2025年02月28日  @thecandlemanind 様 
00:02:54 - 00:13:12
Basic LLM interactions examples - How I use LLMs

Basic LLM interactions examples

How I use LLMs
2025年02月28日  @thecandlemanind 様 
00:13:12 - 00:18:03
Exactly, I have observed this so many times!! Why do these chat platforms not have an option to branch out to a new chat (for exploring multiple ideas or something) from a particular answer point? Are there any technical challenges? - How I use LLMs

Exactly, I have observed this so many times!! Why do these chat platforms not have an option to branch out to a new chat (for exploring multiple ideas or something) from a particular answer point? Are there any technical challenges?

How I use LLMs
2025年02月28日  @devalmodi141 様 
00:16:55 - 02:11:12
new chat --->for new topic - How I use LLMs

new chat --->for new topic

How I use LLMs
2025年02月28日  @yog_g5001 様 
00:17:31 - 01:12:25
Be aware of the model you're using, pricing tiers - How I use LLMs

Be aware of the model you're using, pricing tiers

How I use LLMs
2025年02月28日  @thecandlemanind 様 
00:18:03 - 00:22:54
Personal note: - How I use LLMs

Personal note:

How I use LLMs
2025年02月28日  @seriyanto 様 
00:21:00 - 00:37:00
Thinking models and when to use them - How I use LLMs

Thinking models and when to use them

How I use LLMs
2025年02月28日  @thecandlemanind 様 
00:22:54 - 00:31:00
Tool use: internet search - How I use LLMs

Tool use: internet search

How I use LLMs
2025年02月28日  @thecandlemanind 様 
00:31:00 - 00:42:04
Is one Search one token in the context window? - How I use LLMs

Is one Search one token in the context window?

How I use LLMs
2025年02月28日  @superfreiheit1 様 
00:34:28 - 02:11:12
- Casual sneeze making the video even more fun - How I use LLMs

- Casual sneeze making the video even more fun

How I use LLMs
2025年02月28日  @siddhantsahu92 様 
00:36:34 - 02:11:12
Bless you, Andrej-! - How I use LLMs

Bless you, Andrej-!

How I use LLMs
2025年02月28日  @kyung-hoonkim5963 様 
00:36:35 - 02:11:12
search - How I use LLMs

search

How I use LLMs
2025年02月28日  @seriyanto 様 
00:37:00 - 01:22:00
We Vietnamese always cherish exceptional talents like you, Andrej. - How I use LLMs

We Vietnamese always cherish exceptional talents like you, Andrej.

How I use LLMs
2025年02月28日  @AllAboutFacts935 様 
00:41:26 - 02:11:12
Tool use: deep research - How I use LLMs

Tool use: deep research

How I use LLMs
2025年02月28日  @thecandlemanind 様 
00:42:04 - 00:50:57
ChatGPT was not the first to offer Deep Research. Gemini made Deep Research available on December 11, 2024. ChatGPT added theirs February 2, 2025. - How I use LLMs

ChatGPT was not the first to offer Deep Research. Gemini made Deep Research available on December 11, 2024. ChatGPT added theirs February 2, 2025.

How I use LLMs
2025年02月28日  @ToddBeaupre 様 
00:42:51 - 02:11:12
You missed Gemini Deep Research. That’s the original one. - How I use LLMs

You missed Gemini Deep Research. That’s the original one.

How I use LLMs
2025年02月28日  @ParthKohli 様 
00:45:15 - 02:11:12
What we would really need is ability to pass the response with all the provided references to another thinking + internet access AI system with a task "Does this article content match the provided references?". I'm pretty sure that different AI models do not accidentally hallucinate badly enough to fail this kind of verification task most of the time. - How I use LLMs

What we would really need is ability to pass the response with all the provided references to another thinking + internet access AI system with a task "Does this article content match the provided references?". I'm pretty sure that different AI models do not accidentally hallucinate badly enough to fail this kind of verification task most of the time.

How I use LLMs
2025年02月28日  @MikkoRantalainen 様 
00:48:20 - 02:11:12
File uploads, adding documents to context - How I use LLMs

File uploads, adding documents to context

How I use LLMs
2025年02月28日  @thecandlemanind 様 
00:50:57 - 00:59:00
Re  , accessing .epub in context would be a win.Imagine clicking Table of Contents Chapter inside of Cursor or ChatGPT Platform and having it ready for the selected LLM.. 📖 🙂 - How I use LLMs

Re , accessing .epub in context would be a win.Imagine clicking Table of Contents Chapter inside of Cursor or ChatGPT Platform and having it ready for the selected LLM.. 📖 🙂

How I use LLMs
2025年02月28日  @Emm-mq5eg 様 
00:58:27 - 02:11:12
I think Copilot in Edge allows you to ask questions in a taskpane and also supports marking as i remember. . Thanks for your Insights! - How I use LLMs

I think Copilot in Edge allows you to ask questions in a taskpane and also supports marking as i remember. . Thanks for your Insights!

How I use LLMs
2025年02月28日  @mrlucasx282 様 
00:58:30 - 02:11:12
you need the Highlight app. it literally takes into context whatever document you have opened in your system, so no copying is needed. very smooth - How I use LLMs

you need the Highlight app. it literally takes into context whatever document you have opened in your system, so no copying is needed. very smooth

How I use LLMs
2025年02月28日  @bwknylfcfjjwij2 様 
00:58:40 - 02:11:12
I suggest using kortex for large amount of pdf or books that can be using with an LLM. I am not sure about each LLMs limit in terms of document upload (MB) and how is connected with token input limits, I would like to know more about this - How I use LLMs

I suggest using kortex for large amount of pdf or books that can be using with an LLM. I am not sure about each LLMs limit in terms of document upload (MB) and how is connected with token input limits, I would like to know more about this

How I use LLMs
2025年02月28日  @chinapulsee 様 
00:58:50 - 02:11:12
You could just have the ChatGPT floating window open while you read a book in full-screen. That way, you don’t have to keep switching between windows. 👍🏻 - How I use LLMs

You could just have the ChatGPT floating window open while you read a book in full-screen. That way, you don’t have to keep switching between windows. 👍🏻

How I use LLMs
2025年02月28日  @DmitriTakeda 様 
00:58:50 - 02:11:12
"don't read books alone" - How I use LLMs

"don't read books alone"

How I use LLMs
2025年02月28日  @kenwarner 様 
00:58:58 - 02:11:12
Tool use: python interpreter, messiness of the ecosystem - How I use LLMs

Tool use: python interpreter, messiness of the ecosystem

How I use LLMs
2025年02月28日  @thecandlemanind 様 
00:59:00 - 01:04:35
Gemini's prediction is not actually close. It is lower by an order of 3. But another amazing video by Andrej ! Thank you :) - How I use LLMs

Gemini's prediction is not actually close. It is lower by an order of 3. But another amazing video by Andrej ! Thank you :)

How I use LLMs
2025年02月28日  @adityavipradas3252 様 
01:04:08 - 02:11:12
ChatGPT Advanced Data Analysis, figures, plots - How I use LLMs

ChatGPT Advanced Data Analysis, figures, plots

How I use LLMs
2025年02月28日  @thecandlemanind 様 
01:04:35 - 01:09:00
keep in mind if you reading this, just because it uses an internet source, doesn’t mean it won’t hallucinate content it thinks it found in the source - How I use LLMs

keep in mind if you reading this, just because it uses an internet source, doesn’t mean it won’t hallucinate content it thinks it found in the source

How I use LLMs
2025年02月28日  @Rkcuddles 様 
01:05:23 - 02:11:12
0.1 is a heuristic to avoid 0, which may behave badly? - How I use LLMs

0.1 is a heuristic to avoid 0, which may behave badly?

How I use LLMs
2025年02月28日  @joebowbeer 様 
01:05:53 - 02:11:12
Claude Artifacts, apps, diagrams - How I use LLMs

Claude Artifacts, apps, diagrams

How I use LLMs
2025年02月28日  @thecandlemanind 様 
01:09:00 - 01:14:02
This is pure gold. Andrej is the best teacher on all things AI. He teaches with such clarity and simplicity that the knowledge just sticks. I just wish that the part about coding between  - 1. a disclaimer when there are high vulnerabilities in node dependencies (2. discusses the legal aspects of using code generated by llms or llm powered tools like cursor, windsurf, github copilot etc. I really wish such videos talk about this crucial aspect else most viewers will get a sense that software development is as simple as just prompting LLMs for code and they can use the code generated as it is. There are many cases when such LLMs spit out copyrighted code or code under licenses and using them without attribution is risky. - How I use LLMs

This is pure gold. Andrej is the best teacher on all things AI. He teaches with such clarity and simplicity that the knowledge just sticks. I just wish that the part about coding between - 1. a disclaimer when there are high vulnerabilities in node dependencies (2. discusses the legal aspects of using code generated by llms or llm powered tools like cursor, windsurf, github copilot etc. I really wish such videos talk about this crucial aspect else most viewers will get a sense that software development is as simple as just prompting LLMs for code and they can use the code generated as it is. There are many cases when such LLMs spit out copyrighted code or code under licenses and using them without attribution is risky.

How I use LLMs
2025年02月28日  @AIUser1431 様 
01:09:18 - 01:22:00
---> conceptual diagram - How I use LLMs

---> conceptual diagram

How I use LLMs
2025年02月28日  @yog_g5001 様 
01:12:25 - 02:11:12
Love the conceptual diagram idea. Very very useful - How I use LLMs

Love the conceptual diagram idea. Very very useful

How I use LLMs
2025年02月28日  @sumitsp01 様 
01:14:00 - 02:11:12
Cursor: Composer, writing code - How I use LLMs

Cursor: Composer, writing code

How I use LLMs
2025年02月28日  @thecandlemanind 様 
01:14:02 - 01:22:28
) - How I use LLMs

)

How I use LLMs
2025年02月28日  @AIUser1431 様 
01:19:23 - 02:11:12
The confetti moment got me excited too. Amazing video, Andrej, thank you! - How I use LLMs

The confetti moment got me excited too. Amazing video, Andrej, thank you!

How I use LLMs
2025年02月28日  @damirmusicverse 様 
01:20:35 - 02:11:12
showed - How I use LLMs

showed

How I use LLMs
2025年02月28日  @AIUser1431 様 
01:22:00 - 01:19:23
talk to llms - How I use LLMs

talk to llms

How I use LLMs
2025年02月28日  @seriyanto 様 
01:22:00 - 02:11:12
Audio (Speech) Input/Output - How I use LLMs

Audio (Speech) Input/Output

How I use LLMs
2025年02月28日  @thecandlemanind 様 
01:22:28 - 01:27:37
What a gigachad. And yet for some reasons he doesn't seem to be aware that his Mac comes with Dictation feature (). Maybe he has an older model of MacOS. Maybe I'm missing something but this section of the video makes no sense to me. But again, what an amazing video by a generous genius! - How I use LLMs

What a gigachad. And yet for some reasons he doesn't seem to be aware that his Mac comes with Dictation feature (). Maybe he has an older model of MacOS. Maybe I'm missing something but this section of the video makes no sense to me. But again, what an amazing video by a generous genius!

How I use LLMs
2025年02月28日  @hayechan7927 様 
01:25:10 - 02:11:12
The native ChatGPT app for macOS does have the mic icon. - How I use LLMs

The native ChatGPT app for macOS does have the mic icon.

How I use LLMs
2025年02月28日  @DmitriTakeda 様 
01:25:20 - 02:11:12
Why don't you use mac dictate feature? - How I use LLMs

Why don't you use mac dictate feature?

How I use LLMs
2025年02月28日  @xnivaxhzne 様 
01:25:26 - 02:11:12
Advanced Voice Mode aka true audio inside the model - How I use LLMs

Advanced Voice Mode aka true audio inside the model

How I use LLMs
2025年02月28日  @thecandlemanind 様 
01:27:37 - 01:37:09
kind of how shazam works under the hood, by getting a graph made for the audio spectogram and by identifying the peak points in the graph with background noise minimized and then it those peak points being converted to audio fingerprints and at last based on the fingerprint it searches its database of millions of songs. - How I use LLMs

kind of how shazam works under the hood, by getting a graph made for the audio spectogram and by identifying the peak points in the graph with background noise minimized and then it those peak points being converted to audio fingerprints and at last based on the fingerprint it searches its database of millions of songs.

How I use LLMs
2025年02月28日  @satyams-yt 様 
01:28:20 - 02:11:12
Your reaction at  killed me lmao - How I use LLMs

Your reaction at killed me lmao

How I use LLMs
2025年02月28日  @21tired88 様 
01:35:14 - 02:11:12
NotebookLM, podcast generation - How I use LLMs

NotebookLM, podcast generation

How I use LLMs
2025年02月28日  @thecandlemanind 様 
01:37:09 - 01:40:20
Image input, OCR - How I use LLMs

Image input, OCR

How I use LLMs
2025年02月28日  @thecandlemanind 様 
01:40:20 - 01:47:02
woke up in the middle of the night to find that I had been listening to this all night. If I magically know a bunch of shit about LLMs….im going to be shook - How I use LLMs

woke up in the middle of the night to find that I had been listening to this all night. If I magically know a bunch of shit about LLMs….im going to be shook

How I use LLMs
2025年02月28日  @brianaglenn3706 様 
01:44:44 - 02:11:12
For those interested, the math problem at  is not that tricky 🙃. - How I use LLMs

For those interested, the math problem at is not that tricky 🙃.

How I use LLMs
2025年02月28日  @AlessandroTrasatti 様 
01:45:00 - 02:11:12
No Andrej, you failed me to trick😎😅 - How I use LLMs

No Andrej, you failed me to trick😎😅

How I use LLMs
2025年02月28日  @VishwajeetVKale 様 
01:45:28 - 02:11:12
Image output, DALL-E, Ideogram, etc. - How I use LLMs

Image output, DALL-E, Ideogram, etc.

How I use LLMs
2025年02月28日  @thecandlemanind 様 
01:47:02 - 01:49:14
Video input, point and talk on app - How I use LLMs

Video input, point and talk on app

How I use LLMs
2025年02月28日  @thecandlemanind 様 
01:49:14 - 01:52:23
Video output, Sora, Veo 2, etc etc. - How I use LLMs

Video output, Sora, Veo 2, etc etc.

How I use LLMs
2025年02月28日  @thecandlemanind 様 
01:52:23 - 01:53:29
ChatGPT memory, custom instructions - How I use LLMs

ChatGPT memory, custom instructions

How I use LLMs
2025年02月28日  @thecandlemanind 様 
01:53:29 - 01:58:38
whenever you make a typo while typing, that should be a reminder to type with superwhisper instead - How I use LLMs

whenever you make a typo while typing, that should be a reminder to type with superwhisper instead

How I use LLMs
2025年02月28日  @kenwarner 様 
01:54:50 - 02:11:12
"I am Andrej Karpathy; Yes - the AI researcher" What an insane flex. Imagine confirming to an LLM that it's indeed talking to that guy you actually have training memory on. - How I use LLMs

"I am Andrej Karpathy; Yes - the AI researcher" What an insane flex. Imagine confirming to an LLM that it's indeed talking to that guy you actually have training memory on.

How I use LLMs
2025年02月28日  @Caliban314 様 
01:57:55 - 02:11:12
Custom GPTs - How I use LLMs

Custom GPTs

How I use LLMs
2025年02月28日  @thecandlemanind 様 
01:58:38 - 02:06:30
Can you add a reverse (round-trip) button to your translator? It's a great way to test the "stability" of a translation. - How I use LLMs

Can you add a reverse (round-trip) button to your translator? It's a great way to test the "stability" of a translation.

How I use LLMs
2025年02月28日  @joebowbeer 様 
02:02:19 - 02:11:12
agree 👍 going to use it - How I use LLMs

agree 👍 going to use it

How I use LLMs
2025年02月28日  @saisrikaranpulluri1472 様 
02:03:26 - 02:11:12
Summary - How I use LLMs

Summary

How I use LLMs
2025年02月28日  @thecandlemanind 様 
02:06:30 - 02:11:12
introduction - Deep Dive into LLMs like ChatGPT

introduction

Deep Dive into LLMs like ChatGPT
2025年02月06日 
00:00:00 - 00:01:00
- Introduction - Deep Dive into LLMs like ChatGPT

- Introduction

Deep Dive into LLMs like ChatGPT
2025年02月06日  @TimeStampBuddy 様 
00:00:01 - 00:01:04
pretraining data (internet) - Deep Dive into LLMs like ChatGPT

pretraining data (internet)

Deep Dive into LLMs like ChatGPT
2025年02月06日 
00:01:00 - 00:07:47
- LLM Pre-training - Deep Dive into LLMs like ChatGPT

- LLM Pre-training

Deep Dive into LLMs like ChatGPT
2025年02月06日  @TimeStampBuddy 様 
00:01:04 - 00:15:13
Atound, you explain a really interesting notion, that models need to "think" before producing a complex response, thats because each layer in a neural network has finite computation. I feel like its somewhat related to the notion of computational irreducibility Stephen Wolfram talks about.  This is also why we humans need to spend some time thinking about complex issues before coming up with a good response. - Deep Dive into LLMs like ChatGPT

Atound, you explain a really interesting notion, that models need to "think" before producing a complex response, thats because each layer in a neural network has finite computation. I feel like its somewhat related to the notion of computational irreducibility Stephen Wolfram talks about. This is also why we humans need to spend some time thinking about complex issues before coming up with a good response.

Deep Dive into LLMs like ChatGPT
2025年02月06日  @hashiromer7668 様 
00:01:49 - 03:31:24
But what if the ultimate joke about pelicans is actually 'the the the the the the,' but we simply don't have enough intelligence to understand it—just like an unusual move in the game of Go? XD - Deep Dive into LLMs like ChatGPT

But what if the ultimate joke about pelicans is actually 'the the the the the the,' but we simply don't have enough intelligence to understand it—just like an unusual move in the game of Go? XD

Deep Dive into LLMs like ChatGPT
2025年02月06日  @JanKowalski-dm5vr 様 
00:03:02 - 03:31:24
wow amazing  hours so much in few hours .. Saved me hours of research and insprie me for more ..great work looking forward for new such interesting videos.. - Deep Dive into LLMs like ChatGPT

wow amazing hours so much in few hours .. Saved me hours of research and insprie me for more ..great work looking forward for new such interesting videos..

Deep Dive into LLMs like ChatGPT
2025年02月06日  @adarshkumar-jv4hz 様 
00:03:30 - 03:31:24
at   , talks about eliminating racist sites during corpus preprocessing.  This can introduce bias by eliminating candid discussion of, for example, average IQ test scores of racial subgroups. Claude refuses to answer this altogether, calling race a constructed concept. ChatGPT and Gemini, at the time I queried them, both produced valid, honest outputs, which aligned with the research.  Those of you so enamored with Claude are still trapped in Dario's echo-chamber. But society has moved on, now (2025). Will you? - Deep Dive into LLMs like ChatGPT

at , talks about eliminating racist sites during corpus preprocessing. This can introduce bias by eliminating candid discussion of, for example, average IQ test scores of racial subgroups. Claude refuses to answer this altogether, calling race a constructed concept. ChatGPT and Gemini, at the time I queried them, both produced valid, honest outputs, which aligned with the research. Those of you so enamored with Claude are still trapped in Dario's echo-chamber. But society has moved on, now (2025). Will you?

Deep Dive into LLMs like ChatGPT
2025年02月06日  @thomasgilson6206 様 
00:03:50 - 03:31:24
tokenization - Deep Dive into LLMs like ChatGPT

tokenization

Deep Dive into LLMs like ChatGPT
2025年02月06日 
00:07:47 - 00:14:27
neural network I/O - Deep Dive into LLMs like ChatGPT

neural network I/O

Deep Dive into LLMs like ChatGPT
2025年02月06日 
00:14:27 - 00:20:11
- Neural Net & Training - Deep Dive into LLMs like ChatGPT

- Neural Net & Training

Deep Dive into LLMs like ChatGPT
2025年02月06日  @TimeStampBuddy 様 
00:15:13 - 00:40:14
neural network internals - Deep Dive into LLMs like ChatGPT

neural network internals

Deep Dive into LLMs like ChatGPT
2025年02月06日 
00:20:11 - 00:26:01
inference - Deep Dive into LLMs like ChatGPT

inference

Deep Dive into LLMs like ChatGPT
2025年02月06日 
00:26:01 - 00:31:09
GPT-2: training and inference - Deep Dive into LLMs like ChatGPT

GPT-2: training and inference

Deep Dive into LLMs like ChatGPT
2025年02月06日 
00:31:09 - 00:42:52
Somewhere around , you said something about training 1 million tokens. Do you mean you train chunks of 1 million tokens to generate output or you train different tokens that add up to a million to generate output? - Deep Dive into LLMs like ChatGPT

Somewhere around , you said something about training 1 million tokens. Do you mean you train chunks of 1 million tokens to generate output or you train different tokens that add up to a million to generate output?

Deep Dive into LLMs like ChatGPT
2025年02月06日  @oteikwufrancis1108 様 
00:36:52 - 03:31:24
- GPUs & Model Costs - Deep Dive into LLMs like ChatGPT

- GPUs & Model Costs

Deep Dive into LLMs like ChatGPT
2025年02月06日  @TimeStampBuddy 様 
00:40:14 - 01:01:06
Llama 3.1 base model inference - Deep Dive into LLMs like ChatGPT

Llama 3.1 base model inference

Deep Dive into LLMs like ChatGPT
2025年02月06日 
00:42:52 - 00:59:23
: Parallel universes !!! Just loving these analogies - awesome ! - Deep Dive into LLMs like ChatGPT

: Parallel universes !!! Just loving these analogies - awesome !

Deep Dive into LLMs like ChatGPT
2025年02月06日  @madhurkgpian 様 
00:55:22 - 03:31:24
pretraining to post-training - Deep Dive into LLMs like ChatGPT

pretraining to post-training

Deep Dive into LLMs like ChatGPT
2025年02月06日 
00:59:23 - 01:01:06
post-training data (conversations) - Deep Dive into LLMs like ChatGPT

post-training data (conversations)

Deep Dive into LLMs like ChatGPT
2025年02月06日 
01:01:06 - 01:20:32
- Build LLM Assistant - Deep Dive into LLMs like ChatGPT

- Build LLM Assistant

Deep Dive into LLMs like ChatGPT
2025年02月06日  @TimeStampBuddy 様 
01:01:06 - 02:07:30
"something went wrong" 😂 lol I love that he left this in there! - Deep Dive into LLMs like ChatGPT

"something went wrong" 😂 lol I love that he left this in there!

Deep Dive into LLMs like ChatGPT
2025年02月06日  @stephen-torrence 様 
01:18:46 - 03:31:24
his genuine laugh at ChatGPT error is so pure and spontaneous. How can someone not love Karpathy!!?? Sir you are pure Gold for humanity. - Deep Dive into LLMs like ChatGPT

his genuine laugh at ChatGPT error is so pure and spontaneous. How can someone not love Karpathy!!?? Sir you are pure Gold for humanity.

Deep Dive into LLMs like ChatGPT
2025年02月06日  @MarcoDonadelli 様 
01:18:47 - 03:31:24
hallucinations, tool use, knowledge/working memory - Deep Dive into LLMs like ChatGPT

hallucinations, tool use, knowledge/working memory

Deep Dive into LLMs like ChatGPT
2025年02月06日 
01:20:32 - 01:41:46
The chapter about hallucinations was so insightful. Never heard about it as an issue of the dataset, i.e., it wasn't trained to say "I don't know" and how one can test the knowledge of the model. Thanks! - Deep Dive into LLMs like ChatGPT

The chapter about hallucinations was so insightful. Never heard about it as an issue of the dataset, i.e., it wasn't trained to say "I don't know" and how one can test the knowledge of the model. Thanks!

Deep Dive into LLMs like ChatGPT
2025年02月06日  @linusnox 様 
01:20:32 - 03:31:24
Observation: Approx. at , Andrej tests the question "Who is Orson Kovacs" using falcon-7b-instruct in HF playground, the temperature is still 1.0 which will make the model to respond in a balanced manner between randomness and deterministic. Although it makes up stuff to behave like hallucinations, it is good to test out with temperature less or more than 1.0 to understand how the factuality of the data varies. - Deep Dive into LLMs like ChatGPT

Observation: Approx. at , Andrej tests the question "Who is Orson Kovacs" using falcon-7b-instruct in HF playground, the temperature is still 1.0 which will make the model to respond in a balanced manner between randomness and deterministic. Although it makes up stuff to behave like hallucinations, it is good to test out with temperature less or more than 1.0 to understand how the factuality of the data varies.

Deep Dive into LLMs like ChatGPT
2025年02月06日  @avinashrs6303 様 
01:23:50 - 03:31:24
you mentioned around  mark - the reason why you allow the model to say i don't know, instead of augmenting it with the new knowledge, is it because there's infinite amount of knowledge to learn so that it's virtually impossible to learn knowledge, and thus it's better to train it to know when to refuse? In other words, say if somehow the model CAN learn ALL the knowledge of the world, we won't need to train it to stop hallucinating? Thanks. - Deep Dive into LLMs like ChatGPT

you mentioned around mark - the reason why you allow the model to say i don't know, instead of augmenting it with the new knowledge, is it because there's infinite amount of knowledge to learn so that it's virtually impossible to learn knowledge, and thus it's better to train it to know when to refuse? In other words, say if somehow the model CAN learn ALL the knowledge of the world, we won't need to train it to stop hallucinating? Thanks.

Deep Dive into LLMs like ChatGPT
2025年02月06日  @charlielaw48 様 
01:30:00 - 03:31:24
Thanks for the informative video! I have a question about training language models for tool use, specifically regarding the process you described around - Deep Dive into LLMs like ChatGPT

Thanks for the informative video! I have a question about training language models for tool use, specifically regarding the process you described around

Deep Dive into LLMs like ChatGPT
2025年02月06日  @marathonour 様 
01:33:38 - 03:31:24
knowledge of self - Deep Dive into LLMs like ChatGPT

knowledge of self

Deep Dive into LLMs like ChatGPT
2025年02月06日 
01:41:46 - 01:46:56
models need tokens to think - Deep Dive into LLMs like ChatGPT

models need tokens to think

Deep Dive into LLMs like ChatGPT
2025年02月06日 
01:46:56 - 02:01:11
@.  Question. I was just reading a paper recently (I believe it was from Anthropic, but sadly I can't find it now) that when they have looked at "thinking models", it appears the final answer is generally already determined well before the reasoning process begins. Then the model just fills in the chain of thought to get from the question to where it wants to go. Isn't this exactly what you said is not the correct way to handle this? Can you comment on why, if this is the "wrong" approach, it seems to be what modern models are doing? - Deep Dive into LLMs like ChatGPT

@. Question. I was just reading a paper recently (I believe it was from Anthropic, but sadly I can't find it now) that when they have looked at "thinking models", it appears the final answer is generally already determined well before the reasoning process begins. Then the model just fills in the chain of thought to get from the question to where it wants to go. Isn't this exactly what you said is not the correct way to handle this? Can you comment on why, if this is the "wrong" approach, it seems to be what modern models are doing?

Deep Dive into LLMs like ChatGPT
2025年02月06日  @BangkokBubonaglia 様 
01:52:00 - 03:31:24
@ that is elucidating! This is the first time I’ve heard of this concept. Thank you Andrej. - Deep Dive into LLMs like ChatGPT

@ that is elucidating! This is the first time I’ve heard of this concept. Thank you Andrej.

Deep Dive into LLMs like ChatGPT
2025年02月06日  @seadude 様 
01:55:49 - 03:31:24
This teacher is very good at giving cute examples  Appreciate it and I agree it. - Deep Dive into LLMs like ChatGPT

This teacher is very good at giving cute examples Appreciate it and I agree it.

Deep Dive into LLMs like ChatGPT
2025年02月06日  @saisrikaranpulluri1472 様 
01:55:50 - 03:31:24
tokenization revisited: models struggle with spelling - Deep Dive into LLMs like ChatGPT

tokenization revisited: models struggle with spelling

Deep Dive into LLMs like ChatGPT
2025年02月06日 
02:01:11 - 02:04:53
Wow.. love this explanation about why these models fail at character related and counting related task - Deep Dive into LLMs like ChatGPT

Wow.. love this explanation about why these models fail at character related and counting related task

Deep Dive into LLMs like ChatGPT
2025年02月06日  @sumitsp01 様 
02:04:04 - 03:31:24
jagged intelligence - Deep Dive into LLMs like ChatGPT

jagged intelligence

Deep Dive into LLMs like ChatGPT
2025年02月06日 
02:04:53 - 02:07:28
supervised finetuning to reinforcement learning - Deep Dive into LLMs like ChatGPT

supervised finetuning to reinforcement learning

Deep Dive into LLMs like ChatGPT
2025年02月06日 
02:07:28 - 02:14:42
- Model Training in Practice - Deep Dive into LLMs like ChatGPT

- Model Training in Practice

Deep Dive into LLMs like ChatGPT
2025年02月06日  @TimeStampBuddy 様 
02:07:30 - 03:31:24
reinforcement learning - Deep Dive into LLMs like ChatGPT

reinforcement learning

Deep Dive into LLMs like ChatGPT
2025年02月06日 
02:14:42 - 02:27:47
DeepSeek-R1 - Deep Dive into LLMs like ChatGPT

DeepSeek-R1

Deep Dive into LLMs like ChatGPT
2025年02月06日 
02:27:47 - 02:42:07
Deepseek says “$3 is a bit expensive for an apple, but maybe they’re organic or something” 😂 - Deep Dive into LLMs like ChatGPT

Deepseek says “$3 is a bit expensive for an apple, but maybe they’re organic or something” 😂

Deep Dive into LLMs like ChatGPT
2025年02月06日  @austinw.1530 様 
02:34:21 - 03:31:24
What a treat!!! At  , haha when you say this is very busy very ugly because of google not being able to nail that was epic hahah - Deep Dive into LLMs like ChatGPT

What a treat!!! At , haha when you say this is very busy very ugly because of google not being able to nail that was epic hahah

Deep Dive into LLMs like ChatGPT
2025年02月06日  @KS-df1cp 様 
02:41:08 - 03:31:24
AlphaGo - Deep Dive into LLMs like ChatGPT

AlphaGo

Deep Dive into LLMs like ChatGPT
2025年02月06日 
02:42:07 - 02:48:26
Thank you for the video Andrej! One small note: at , the dashed line in the AlphaGo Zero plot is the Elo of the version of AlphaGo that *defeated* Lee in 2016 (not the Elo of Lee himself). - Deep Dive into LLMs like ChatGPT

Thank you for the video Andrej! One small note: at , the dashed line in the AlphaGo Zero plot is the Elo of the version of AlphaGo that *defeated* Lee in 2016 (not the Elo of Lee himself).

Deep Dive into LLMs like ChatGPT
2025年02月06日  @nkhr2 様 
02:43:05 - 03:31:24
reinforcement learning from human feedback (RLHF) - Deep Dive into LLMs like ChatGPT

reinforcement learning from human feedback (RLHF)

Deep Dive into LLMs like ChatGPT
2025年02月06日 
02:48:26 - 03:09:39
Tiny typo "let's add it to the dataset and give it an ordering that's extremely like a score of 5" -> SHOULD BE "let's add it to the dataset and give it an ordering that's extremely like a score of 1" - Deep Dive into LLMs like ChatGPT

Tiny typo "let's add it to the dataset and give it an ordering that's extremely like a score of 5" -> SHOULD BE "let's add it to the dataset and give it an ordering that's extremely like a score of 1"

Deep Dive into LLMs like ChatGPT
2025年02月06日  @giofou711 様 
03:03:44 - 03:31:24
preview of things to come - Deep Dive into LLMs like ChatGPT

preview of things to come

Deep Dive into LLMs like ChatGPT
2025年02月06日 
03:09:39 - 03:15:15
keeping track of LLMs - Deep Dive into LLMs like ChatGPT

keeping track of LLMs

Deep Dive into LLMs like ChatGPT
2025年02月06日 
03:15:15 - 03:18:34
if you have come till this time stamp then finish the video and go and build something with LLMs.😊 - Deep Dive into LLMs like ChatGPT

if you have come till this time stamp then finish the video and go and build something with LLMs.😊

Deep Dive into LLMs like ChatGPT
2025年02月06日  @Ronak.Purohit 様 
03:16:59 - 03:31:24
where to find LLMs - Deep Dive into LLMs like ChatGPT

where to find LLMs

Deep Dive into LLMs like ChatGPT
2025年02月06日 
03:18:34 - 03:21:46
grand summary - Deep Dive into LLMs like ChatGPT

grand summary

Deep Dive into LLMs like ChatGPT
2025年02月06日 
03:21:46 - 03:31:24
In principle these models are capable of analogies no human has had. Wow😮 - Deep Dive into LLMs like ChatGPT

In principle these models are capable of analogies no human has had. Wow😮

Deep Dive into LLMs like ChatGPT
2025年02月06日  @xnivaxhzne 様 
03:29:54 - 03:31:24
Thank you Andrej for this! Please continue putting contents like this and you are one of the best teachers in this space who can explain in this level of detail. The entire  is pure gold and very grateful that you are putting this level of time and effort ❤ - Deep Dive into LLMs like ChatGPT

Thank you Andrej for this! Please continue putting contents like this and you are one of the best teachers in this space who can explain in this level of detail. The entire is pure gold and very grateful that you are putting this level of time and effort ❤

Deep Dive into LLMs like ChatGPT
2025年02月06日  @ericmathews3619 様 
03:31:23 - 03:31:24