Is this possible to produce text with few shot learning? #252

Yusuf-YENICERI · 2023-04-23T03:48:05Z

I trained a gpt model using this repo. I tried to produce text using few shot learning like the one below:

Message: Support has been terrible for 2 weeks...
Sentiment: Negative
###
Message: I love your API, it is simple and so fast!
Sentiment: Positive
###
Message: GPT-J has been released 2 months ago.
Sentiment: Neutral
###
Message: The reactivity of your team has been amazing, thanks!
Sentiment:

The result i get isn't something related. Does this repo enables that feature or is my model bad?

The text was updated successfully, but these errors were encountered:

karpathy · 2023-04-23T16:33:50Z

At the scale of nanoGPT basically the answer is no. ICL (in context learning) emerges a few B parameters down the road.

Yusuf-YENICERI · 2023-04-23T16:41:15Z

Then may i ask if i would fine tune the gpt model i trained on a prompt-answer dataset, can i get a kind of ChatGPT like model? The reason i want is to have a model in my language answering questions on some of the domains i want.

Thanks for the reply.

C080 · 2023-06-20T11:47:27Z

Hi! Try loading gpt-XL weights and fine tune to your prompt-answer dataset, It should be able to produce your desired output

Yusuf-YENICERI · 2023-06-20T13:05:09Z

@C080
Gpt2-XL is trained for English language, but i want it it for my language which is Turkish. Wouldn't that be a problem? Or will that work but won't be performing enough?

C080 · 2023-06-20T14:29:06Z

It could pick up Turkish if it has been trained on a multi-lingual dataset with turkish inside! Anyway try using two layers of Google Translates after & before so all the reasoning happens in english!

VatsaDev · 2023-08-23T15:46:46Z

@Yusuf-YENICERI

Message: Support has been terrible for 2 weeks...
Sentiment: Negative
###
Message: I love your API, it is simple and so fast!
Sentiment: Positive
###
Message: GPT-J has been released 2 months ago.
Sentiment: Neutral
###
Message: The reactivity of your team has been amazing, thanks!
Sentiment:

This is totally possible if you scale this a lot, but there are much better models for this like bert finetuned or sentiment analysis, my repo uses a similar style, but for chat messages, like so:

<human> ... <endOfText>
<bot> ... <endOfText>

Add MLP Expansion factor control and sweep

gkielian pushed a commit to gkielian/ReaLLMASIC_nanogpt that referenced this issue Sep 5, 2024

Merge pull request karpathy#252 from gkielian/add_mlp_expansion_factor

37ca368

Add MLP Expansion factor control and sweep

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is this possible to produce text with few shot learning? #252

Is this possible to produce text with few shot learning? #252

Yusuf-YENICERI commented Apr 23, 2023

karpathy commented Apr 23, 2023

Yusuf-YENICERI commented Apr 23, 2023

C080 commented Jun 20, 2023

Yusuf-YENICERI commented Jun 20, 2023 •

edited

Loading

C080 commented Jun 20, 2023

VatsaDev commented Aug 23, 2023

Is this possible to produce text with few shot learning? #252

Is this possible to produce text with few shot learning? #252

Comments

Yusuf-YENICERI commented Apr 23, 2023

karpathy commented Apr 23, 2023

Yusuf-YENICERI commented Apr 23, 2023

C080 commented Jun 20, 2023

Yusuf-YENICERI commented Jun 20, 2023 • edited Loading

C080 commented Jun 20, 2023

VatsaDev commented Aug 23, 2023

Yusuf-YENICERI commented Jun 20, 2023 •

edited

Loading