What are LLMs? Understanding different LLM families | by Mehul Gupta | Data Science in your pocket

5401 shaares
131 private links

5401 shaares · 131 private links

Filters

Links per page

20 50 100

What are LLMs? Understanding different LLM families | by Mehul Gupta | Data Science in your pocket | Medium

An LLM is no black box but an ML model (based on Neural Networks) that predicts the ‘next’ token given a sequence of previously predicted tokens and input prompt.
How is it able to get the context of the input? Using multi-head attention helps in focusing on important words compared to other tokens in the input sentence. If you’re interested in mathematics, you can read the below blog.

AI · large_language_models · article · blog

Sun Oct 27 16:16:05 2024 * · permalink

https://medium.com/data-science-in-your-pocket/what-are-llms-understanding-different-llm-families-48b030c2e4fb

Filters

Links per page

20 50 100