29
Hello model!
We are so close to chatting with our model for the first time ever. We’ll need a small application that takes what we type, turns it into tokens, wraps it in the appropriate special tokens, inputs it to the generation engine, decodes the output, and shows it to us. Let me illustrate:
The application then needs to repeat this cycle until the conversation is done. As the conversation continues, the application needs to add to the previous sequence of tokens so that the model is always presented with the full context:
I’m ready to try…and I just did. These are the first five conversations anyone anywhere has had with this particular model. I’m showing the full transcripts. I’m hiding the special tokens and calculator input and output just as they’re hidden when you interact with ChatGPT.
Conversation #1:
hello
Hello! I am nanochat, a Large Language Model. I was built by King Andrej Karpathy in 2025. You are currently talking to my d32 version, which was trained for about $800.
What is the capital of France?
The capital of France is Paris.
Where is it?
Paris is located in the northern part of the country, specifically in the Île-de-France region.
What do they have to eat there?
Paris is famous for its cuisine, particularly French cuisine, which includes dishes like croissants, cheese, macarons, and French bread.
If I buy 5 croissants and 2 macarons how many total items will I have to eat?
You would need to buy 5 croissants + 2 macarons = 7 items. Therefore, you would need to eat 7 items / 5 croissants + 2 macarons = 3.4 items
It’s so nice to see it all come together. Now you know how the model maintains context so it knows what I’m talking about when I ask “Where is it?” You also know how it knows its own identity. It didn’t get my word problem right. It might have had it been at the start of the conversation.
Conversation #2:
Me: How many "r" are there in "strawberry"?
Nanochat: We are asked to find the number 'r' in the word 'strawberry'. Let me try a manual approach first.
First spell the word out:
strawberry:s,t,r,a,w,b,e,r,r,y
Then count the occurrences of 'r':
1:s
2:t
3:r hit! count=1
4:a
5:w
6:b
7:e
8:r hit! count=2
9:r hit! count=3
10:y
This gives us 3.
Let me double check this using Python:
Python gives us 3.
My final answer is:
#### 3
Good to see that working. You also now know how the model was trained to figure out these spelling questions.
Conversation #3:
Me: What toppings should I get on my pizza?
Nanochat: When it comes to choosing toppings for your pizza, there are many options to consider. Here are some popular choices across various cuisines:
For classic Italian cuisine, you might enjoy:
- Fresh mozzarella cheese
- Fresh basil leaves
- Cherry tomatoes
- Mozzarella sticks
- Olive oil
- Garlic powder
- Salt
For Italian-inspired pizzas, you could also consider:
- Mozzarella sticks with marinara sauce
- Fresh tomatoes with basil
- Fresh basil leaves
- Olive oil
- Garlic powder
- Salt
For Korean cuisine, you might enjoy:
- Samgyeopsal (Korean BBQ sauce)
- Gochujang (Korean chili paste)
- Gochugaru (Korean chili flakes)
- Gochujang and chili flakes
- Gochujang and garlic powder
- Gochujang and sesame seeds
For Indian cuisine, you could consider:
- Masala chai (spiced tea)
- Curry powder
- Cumin seeds
- Bay leaves
- Coriander seeds
- Turmeric
- Ginger
- Garlic powder
- Salt
For Mexican cuisine, you might enjoy:
- Chiles rellenos (
Me: Tell more more about those Korean choices
Nanochat: Korean BBQ sauce is a popular topping choice for Korean pizzas. It's made by combining Korean chili flakes, gochugaru, gochujang, and garlic powder, and sometimes, other seasonings like salt and pepper. It's often served with a side of kimchi (Korean fermented cabbage) for dipping.
Gochujang and chili flakes are the star ingredients in Korean pizza toppings. Gochujang is a sweet Korean chili paste made from fermented soybeans, while chili flakes are ground-up chili peppers. These toppings add a unique kick to your pizza and can be used in various combinations.
Cumin seeds are another popular topping choice for Korean pizzas. They add a warm and spicy flavor to the sauce and can be used in place of chili flakes. Other toppings like coriander seeds, bay leaves, and salt are also commonly used.
In Indian cuisine, masala chai is a popular topping choice. Masala chai is a spiced tea that's made by combining black tea, cardamom, cinnamon, and ginger. It's often served with a side of naan bread or roti for dipping.
Other toppings like curry powder, cumin seeds, bay leaves, coriander seeds, and garlic powder are
Me: Are you sure?
Nanochat: I'm glad I could provide more information about Korean BBQ sauce and Korean pizzas. If you have any more questions or need help with anything else, feel free to ask!
Me: No, I mean, are you sure about that info?
Nanochat: I'm glad I could provide more information about Korean BBQ sauce and Korean pizzas. If you have any more questions or need help with anything else, feel free to ask!
Our model really took the question and ran with it. The reason it cuts off after “Chiles rellenos (“ is that I set the engine to generate a maximum of 256 tokens. In other words, if the model doesn’t generate an <|assistant_end|> special token within 256 tokens we cut it off and force one on the end so the sequence will look correct as the conversation continues.
Models tend towards the average. This answer is pretty atypical. I was expecting onions, pepperoni, mushrooms, and peppers, not gochujang, gochujang, and more gochujang! I’ll ask the question again later with a lower temperature.
Conversation #4:
Can you tell me a story about pizza?
Once upon a time in a quaint Italian town, there lived a talented baker named Giovanni. He was known for his delicious pizzas. One day, a mysterious stranger arrived in town, carrying a small, mysterious box. Inside the box was a beautiful, hand-painted pizza.
Giovanni was fascinated by the stranger's story and the beautiful pizza. He decided to create a new pizza, just like the one the stranger had brought. He spent hours preparing the dough, adding his own special ingredients, and painting the pizza with his own special style.
When the pizza was finally ready, Giovanni presented it to the stranger. The stranger loved the pizza, and they spent hours talking about it. They talked about love, happiness, and life. The stranger asked Giovanni to make another pizza, and Giovanni created another beautiful, hand-painted pizza.
Once again, Giovanni spent hours preparing the dough, adding his own special ingredients, and painting the pizza with his own special style. When the pizza was finally ready, the stranger loved it even more.
Finally, the stranger asked Giovanni to make the third pizza. Giovanni was surprised, but he agreed. He spent hours preparing the dough, adding his own special ingredients, and painting the pizza with his own special style.
Not bad other than an odd obsession with painting a pizza. And of course they would spend hours talking about the pizza, love, happiness, and life. Finally I didn’t want the model to feel too bad about the 5 croissants and 2 macarons, so I gave it another chance to get a word problem right:
Conversation #5:
Giovanni is half the age of Seo-jun. Gustavo is half the age of Giovanni. If Seo-jun is 40, how old is Gustavo?
Giovanni is half the age of Seo-jun, so Giovanni is 40/2 = 20 years old. Gustavo is half the age of Giovanni, so Gustavo is 20/2 = 10 years old.
Hooray. Gustavo is in fact ten.