
Intro into the growing LLM ecosystem

The audio modes demos at pm were simply amazing!

ChatGPT interaction under the hood

Basic LLM interactions examples

Exactly, I have observed this so many times!! Why do these chat platforms not have an option to branch out to a new chat (for exploring multiple ideas or something) from a particular answer point? Are there any technical challenges?

new chat --->for new topic

Be aware of the model you're using, pricing tiers

Personal note:

Thinking models and when to use them

Tool use: internet search

Is one Search one token in the context window?

- Casual sneeze making the video even more fun

Bless you, Andrej-!

search

We Vietnamese always cherish exceptional talents like you, Andrej.

Tool use: deep research

ChatGPT was not the first to offer Deep Research. Gemini made Deep Research available on December 11, 2024. ChatGPT added theirs February 2, 2025.

You missed Gemini Deep Research. That’s the original one.

What we would really need is ability to pass the response with all the provided references to another thinking + internet access AI system with a task "Does this article content match the provided references?". I'm pretty sure that different AI models do not accidentally hallucinate badly enough to fail this kind of verification task most of the time.

File uploads, adding documents to context

Re , accessing .epub in context would be a win.Imagine clicking Table of Contents Chapter inside of Cursor or ChatGPT Platform and having it ready for the selected LLM.. 📖 🙂

I think Copilot in Edge allows you to ask questions in a taskpane and also supports marking as i remember. . Thanks for your Insights!

you need the Highlight app. it literally takes into context whatever document you have opened in your system, so no copying is needed. very smooth

I suggest using kortex for large amount of pdf or books that can be using with an LLM. I am not sure about each LLMs limit in terms of document upload (MB) and how is connected with token input limits, I would like to know more about this

You could just have the ChatGPT floating window open while you read a book in full-screen. That way, you don’t have to keep switching between windows. 👍🏻

"don't read books alone"

Tool use: python interpreter, messiness of the ecosystem

Gemini's prediction is not actually close. It is lower by an order of 3. But another amazing video by Andrej ! Thank you :)

ChatGPT Advanced Data Analysis, figures, plots

keep in mind if you reading this, just because it uses an internet source, doesn’t mean it won’t hallucinate content it thinks it found in the source

0.1 is a heuristic to avoid 0, which may behave badly?

Claude Artifacts, apps, diagrams

This is pure gold. Andrej is the best teacher on all things AI. He teaches with such clarity and simplicity that the knowledge just sticks. I just wish that the part about coding between - 1. a disclaimer when there are high vulnerabilities in node dependencies (2. discusses the legal aspects of using code generated by llms or llm powered tools like cursor, windsurf, github copilot etc. I really wish such videos talk about this crucial aspect else most viewers will get a sense that software development is as simple as just prompting LLMs for code and they can use the code generated as it is. There are many cases when such LLMs spit out copyrighted code or code under licenses and using them without attribution is risky.

---> conceptual diagram

Love the conceptual diagram idea. Very very useful

Cursor: Composer, writing code

)

The confetti moment got me excited too. Amazing video, Andrej, thank you!

showed

talk to llms

Audio (Speech) Input/Output

What a gigachad. And yet for some reasons he doesn't seem to be aware that his Mac comes with Dictation feature (). Maybe he has an older model of MacOS. Maybe I'm missing something but this section of the video makes no sense to me. But again, what an amazing video by a generous genius!

The native ChatGPT app for macOS does have the mic icon.

Why don't you use mac dictate feature?

Advanced Voice Mode aka true audio inside the model

kind of how shazam works under the hood, by getting a graph made for the audio spectogram and by identifying the peak points in the graph with background noise minimized and then it those peak points being converted to audio fingerprints and at last based on the fingerprint it searches its database of millions of songs.

Your reaction at killed me lmao

NotebookLM, podcast generation

Image input, OCR

woke up in the middle of the night to find that I had been listening to this all night. If I magically know a bunch of shit about LLMs….im going to be shook

For those interested, the math problem at is not that tricky 🙃.

No Andrej, you failed me to trick😎😅

Image output, DALL-E, Ideogram, etc.

Video input, point and talk on app

Video output, Sora, Veo 2, etc etc.

ChatGPT memory, custom instructions

whenever you make a typo while typing, that should be a reminder to type with superwhisper instead

"I am Andrej Karpathy; Yes - the AI researcher" What an insane flex. Imagine confirming to an LLM that it's indeed talking to that guy you actually have training memory on.

Custom GPTs

Can you add a reverse (round-trip) button to your translator? It's a great way to test the "stability" of a translation.

agree 👍 going to use it
