Search: [neural_networks] - Toolleeo's Links

The unbearable slowness of being: Why do we live at 10 bits/s? - ScienceDirect

This article is about the neural conundrum behind the slowness of human behavior. The information throughput of a human being is about 10 bits/s. In comparison, our sensory systems gather data at bits/s. The stark contrast between these numbers remains unexplained and touches on fundamental aspects of brain function: what neural substrate sets this speed limit on the pace of our existence?

article paper neural_networks science research

Tue Dec 31 07:37:57 2024 · permalink

·

https://www.sciencedirect.com/science/article/abs/pii/S0896627324008080

Yandex Publishes YaLM 100B. It’s the Largest GPT-Like Neural Network in Open Source

In recent years, large-scale transformer-based language models have become the pinnacle of neural networks used in NLP tasks. They grow in scale and complexity every month, but training such models requires millions of dollars, the best experts, and years of development. That’s why only major IT companies have access to this state-of-the-art technology. However, researchers and developers all over the world need access to these solutions. Without new research, their growth could wane. The only way to avoid this is by sharing best practices with the developer community.

We’ve been using YaLM family of language models in our Alice voice assistant and Yandex Search for more than a year now.

article · AI · NLP · large_language_models · neural_networks

Sun Dec 4 17:27:11 2022 * · permalink

·

https://medium.com/yandex/yandex-publishes-yalm-100b-its-the-largest-gpt-like-neural-network-in-open-source-d1df53d0e9a6

Reinforcement Learning as a fine-tuning paradigm

Reinforcement Learning (RL) should be better seen as a “fine-tuning” paradigm that can add capabilities to general-purpose pretrained models, rather than a paradigm that can bootstrap intelligence from scratch.

neural_networks · research · post

Thu Jan 13 04:37:43 2022 * · permalink

·

https://ankeshanand.com/blog/2022/01/08/rl-fine-tuning.html

https://benmoseley.blog/my-research/so-what-is-a-physics-informed-neural-network/

neural_networks

Thu Dec 2 17:03:20 2021 · permalink

·

https://benmoseley.blog/my-research/so-what-is-a-physics-informed-neural-network/

Unsupervised Translation of Programming Languages

A transcompiler, also known as source-to-source translator, is a system that converts source code from a high-level programming language (such as C++ or Python) to another. Transcompilers are primarily used for interoperability, and to port codebases written in an obsolete or deprecated language (e.g. COBOL, Python 2) to a modern one. They typically rely on handcrafted rewrite rules, applied to the source code abstract syntax tree. Unfortunately, the resulting translations often lack readability, fail to respect the target language conventions, and require manual modifications in order to work properly. The overall translation process is timeconsuming and requires expertise in both the source and target languages, making code-translation projects expensive.
Although neural models significantly outperform their rule-based counterparts in the context of natural language translation, their applications to transcompilation have been limited due to the scarcity of parallel data in this domain. In this paper, we propose to leverage recent approaches in unsupervised machine translation to train a fully unsupervised neural transcompiler. We train our model on source code from open source GitHub projects, and show that it can translate functions between C++, Java, and Python with high accuracy.
Our method relies exclusively on monolingual source code, requires no expertise in the source or target languages, and can easily be generalized to other
programming languages. We also build and release a test set composed of 852 parallel functions, along with unit tests to check the correctness of translations. We show that our model outperforms rule-based commercial baselines by a significant margin.

research · programming · machine_learning · neural_networks · paper

Tue Jun 9 23:34:08 2020 · permalink

·

https://arxiv.org/abs/2006.03511

Deep Learning for Guitar Effect Emulation | Teddy Koker

Since the 1940s, electric guitarists, keyboardists, and other instrumentalists have been using effects pedals, devices that modify the sound of the original audio source. Typical effects include distortion, compression, chorus, reverb, and delay. Early effects pedals consisted of basic analog circuits, often along with vacuum tubes, which were later replaced with transistors. Although many pedals today apply effects digitally with modern signal processing techniques, many purists argue that the sound of analog pedals can not be replaced by their digital counterparts. We’ll follow a deep learning approach to see if we can use machine learning to replicate the sound of an iconic analog effect pedal, the Ibanez Tube Screamer. This post will be mostly a reproduction of the work done by Alec Wright et al. in Real-Time Guitar Amplifier Emulation with Deep Learning1. Alec Wright et al., “Real-Time Guitar Amplifier Emulation with ↩

guitar · effects · emulator · deep_learning · neural_networks · audio · article

Mon May 11 20:57:37 2020 * · permalink

·

https://teddykoker.com/2020/05/deep-learning-for-guitar-effect-emulation/

GIMP-ML - Set of Machine Learning Python plugins for GIMP

graphics · software · neural_networks · opensource · research · source_code

Sun May 10 14:34:34 2020 * · permalink

·

https://github.com/kritiksoman/GIMP-ML/blob/master/README.md

Up to two billion times acceleration of scientific simulations with deep neural architecture search

Computer simulations are invaluable tools for scientific discovery. However, accurate simulations are often slow to execute, which limits their applicability to extensive parameter exploration, large-scale data analysis, and uncertainty quantification. A promising route to accelerate simulations by building fast emulators with machine learning requires large training datasets, which can be prohibitively expensive to obtain with slow simulations. Here we present a method based on neural architecture search to build accurate emulators even with a limited number of training data. The method successfully accelerates simulations by up to 2 billion times in 10 scientific cases including astrophysics, climate science, biogeochemistry, high energy density physics, fusion energy, and seismology, using the same super-architecture, algorithm, and hyperparameters. Our approach also inherently provides emulator uncertainty estimation, adding further confidence in their use. We anticipate this work will accelerate research involving expensive simulations, allow more extensive parameters exploration, and enable new, previously unfeasible computational discovery.

paper · open_access · simulation · research · deep_learning · neural_networks · optimization

Fri Jan 24 04:17:46 2020 * · permalink

·

https://arxiv.org/abs/2001.08055

A distributional code for value in dopamine-based reinforcement learning

Analyses of single-cell recordings from mouse ventral tegmental area are consistent with a model of reinforcement learning in which the brain represents possible future rewards not as a single mean of stochastic outcomes, as in the canonical model, but instead as a probability distribution.

article · machine_learning · algorithm · research · neural_networks

Sat Jan 18 03:47:05 2020 * · permalink

·

https://www.nature.com/articles/s41586-019-1924-6

Using neural networks to solve advanced mathematics equations

Facebook AI has developed the first neural network that uses symbolic reasoning to solve advanced mathematics problems.

AI · math · science · research · article · post · neural_networks

Wed Jan 15 16:01:31 2020 * · permalink

·

https://ai.facebook.com/blog/using-neural-networks-to-solve-advanced-mathematics-equations/

Distilling knowledge from Neural Networks to build smaller and faster models

This article discusses GPT-2 and BERT models, as well using knowledge distillation to create highly accurate models with fewer parameters than their teachers

AI · machine_learning · neural_networks · article

Fri Nov 15 19:20:21 2019 * · permalink

·

https://blog.floydhub.com/knowledge-distillation/

aimAndShoot | A neuroevolution game experiment

A neuroevolution game experiment.

games · online · neural_networks

Fri Oct 25 19:45:37 2019 * · permalink

·

https://github.com/victorqribeiro/aimAndShoot

Intro to Recursive Neural Networks

A simple walkthrough of what RNNs are, how they work, and how to build one from scratch in Python.

python · article · neural_networks · tutorial

Fri Jul 26 01:02:44 2019 * · permalink

·

https://victorzhou.com/blog/intro-to-rnns/

olivia | Your new best friend built with an artificial neural network

Your new best friend built with an artificial neural network - olivia-ai/olivia

machine_learning · neural_networks · AI · opensource · source_code · coding_lang:python

Sun Jun 23 12:22:27 2019 * · permalink

·

https://github.com/olivia-ai/olivia

Time Series Prediction with LSTM Recurrent Neural Networks in Python with Keras

LSTM · neural_networks · machine_learning · time_series · article

Mon Jun 17 16:02:08 2019 · permalink

·

https://machinelearningmastery.com/time-series-prediction-lstm-recurrent-neural-networks-python-keras/

Time Series Prediction Using LSTM Deep Neural Networks

This article focuses on using a Deep LSTM Neural Network architecture to provide multidimensional time series forecasting using Keras and Tensorflow - specifically on stock market datasets to provide momentum indicators of stock price.

The following article sections will briefly touch on LSTM neuron cells, give a toy example of predicting a sine wave then walk through the application to a stochastic time series. The article assumes a basic working knowledge of simple deep neural networks.

LSTM · article · neural_networks · time_series · forecasting · machine_learning · ai

Sat Oct 13 18:47:07 2018 * · permalink

·

https://www.altumintelligence.com/articles/a/Time-Series-Prediction-Using-LSTM-Deep-Neural-Networks

Car Detection & Recognition Using DNN Networks

License plate detection is a common use case which has been solved (somewhat) several times, but felt that we could provide something better than the current options.

plates · article · machine_learning · neural_networks

Sun Sep 2 11:51:57 2018 * · permalink

·

https://medium.com/swlh/car-detection-recognition-using-dnn-networks-3ac7603d2e9b

Time Series Forecasting with the Long Short-Term Memory Network in Python - Machine Learning Mastery

The Long Short-Term Memory recurrent neural network has the promise of learning long sequences of observations. It seems a perfect match for time series forecasting, and in fact, it may be. In this tutorial, you will discover how to develop an LSTM forecast model for a one-step univariate time series forecasting problem. After completing this …

machine_learning · neural_networks · python

Fri Apr 7 20:09:56 2017 · permalink

·

http://machinelearningmastery.com/time-series-forecasting-long-short-term-memory-network-python/

CS231n Convolutional Neural Networks for Visual Recognition

Course materials and notes for Stanford class CS231n: Convolutional Neural Networks for Visual Recognition.

machine_learning · neural_networks · research · course

Sat Mar 4 19:30:09 2017 · permalink

·

http://cs231n.github.io/

Understanding LSTM Networks -- colah's blog

neural_networks · research · article · blog

Mon Feb 27 01:41:17 2017 · permalink

·

http://colah.github.io/posts/2015-08-Understanding-LSTMs/