Search: [large_language_models]

Topic Modeling with Llama 2 - Maarten Grootendorst

With the advent of Llama 2, running strong LLMs locally has become more and more a reality. Its accuracy approaches OpenAI's GPT-3.5, which serves well for many use cases.

In this article, we will explore how we can use Llama2 for Topic Modeling without the need to pass every single document to the model. Instead, we are going to leverage BERTopic, a modular topic modeling technique that can use any LLM for fine-tuning topic representations.

large_language_models · topic_modeling · AI · article · tutorial · blog

Sun Oct 27 16:19:43 2024 * · permalink

·

https://www.maartengrootendorst.com/blog/llama2-topic/

What are LLMs? Understanding different LLM families | by Mehul Gupta | Data Science in your pocket | Medium

An LLM is no black box but an ML model (based on Neural Networks) that predicts the ‘next’ token given a sequence of previously predicted tokens and input prompt.
How is it able to get the context of the input? Using multi-head attention helps in focusing on important words compared to other tokens in the input sentence. If you’re interested in mathematics, you can read the below blog.

AI · large_language_models · article · blog

Sun Oct 27 16:16:05 2024 * · permalink

·

https://medium.com/data-science-in-your-pocket/what-are-llms-understanding-different-llm-families-48b030c2e4fb

butterfish.nvim: Neovim plugin for fluent coding using language models

vim · plugin · software · AI · large_language_models

Wed Jan 10 17:26:18 2024 * · permalink

·

https://github.com/bakks/butterfish.nvim

llamafile is the new best way to run a LLM on your own computer

large_language_models · post

Fri Dec 1 21:07:00 2023 * · permalink

·

https://simonwillison.net/2023/Nov/29/llamafile/

Large Language Model: world models or surface statistics?

Large Language Models (LLM) are on fire, capturing public attention by their ability to provide seemingly impressive completions to user prompts (NYT coverage). They are a delicate combination of a radically simplistic algorithm with massive amounts of data and computing power. They are trained by playing a guess-the-next-word game with itself over and over again. Each time, the model looks at a partial sentence and guesses the following word. If it makes it correctly, it will update its parameters to reinforce its confidence; otherwise, it will learn from the error and give a better guess next time.

article · science · research · NLP · large_language_models

Mon Jan 30 21:37:16 2023 * · permalink

·

https://thegradient.pub/othello/

Yandex Publishes YaLM 100B. It’s the Largest GPT-Like Neural Network in Open Source

In recent years, large-scale transformer-based language models have become the pinnacle of neural networks used in NLP tasks. They grow in scale and complexity every month, but training such models requires millions of dollars, the best experts, and years of development. That’s why only major IT companies have access to this state-of-the-art technology. However, researchers and developers all over the world need access to these solutions. Without new research, their growth could wane. The only way to avoid this is by sharing best practices with the developer community.

We’ve been using YaLM family of language models in our Alice voice assistant and Yandex Search for more than a year now.

article · AI · NLP · large_language_models · neural_networks

Sun Dec 4 17:27:11 2022 * · permalink

·

https://medium.com/yandex/yandex-publishes-yalm-100b-its-the-largest-gpt-like-neural-network-in-open-source-d1df53d0e9a6