Large Language Model: world models or surface statistics?

5401 shaares
131 private links

5401 shaares · 131 private links

Filters

Links per page

20 50 100

Large Language Model: world models or surface statistics?

Large Language Models (LLM) are on fire, capturing public attention by their ability to provide seemingly impressive completions to user prompts (NYT coverage). They are a delicate combination of a radically simplistic algorithm with massive amounts of data and computing power. They are trained by playing a guess-the-next-word game with itself over and over again. Each time, the model looks at a partial sentence and guesses the following word. If it makes it correctly, it will update its parameters to reinforce its confidence; otherwise, it will learn from the error and give a better guess next time.

Mon Jan 30 21:37:16 2023 * · permalink

https://thegradient.pub/othello/

Filters

Links per page

20 50 100