Of LLMs, Gradients, and Quantum Mechanics

Can Quantum Computing help improving our ability to train Large Neural Networks encoding language models (LLMs)?

Riccardo Di Sipio

Published in

Towards Data Science

13 min read

3 days ago

—

Photo by Alessio Soggetti (@asoggetti) from Unsplash.com

What is “training”?

In the lingo of Artificial Intelligence (AI) studies, “training” means optimizing a statistical model, often implemented as a neural network, to make predictions based on some input data and a measure of how good these predictions are (“cost” or “loss” function). There are three main paradigms in which such procedure can happen: supervised, unsupervised (often autoregressive), and reinforcement learning. In supervised learning, each data point is labelled so the model predictions can be directly compared to the true values (e.g. this is the image of a cat or a dog). In unsupervised training, there are no explicit labels, but the comparison is carried out with features extracted from the data itself (e.g. predicting the next word in a sentence). Finally, reinforcement learning is based on optimizing the long-term returns of a sequence of decisions (predictions) based on the interaction between the statistical model and the environment (should the car slow down or speed up at a yellow traffic light?).

In all these cases, the optimization of the parameters of the model is a lengthy process which requires a…

Principal Component Analysis — Hands-On Tutorial

Dimensionality reduction through Principal Component Analysis (PCA). Farzad Mahmoodinobar · Follow Published in Towards Data Science · 12 min read · 10 hours ago —

September 18, 2024

HAX at TechCrunch Early Stage 2024: Empowering hard tech founders | TechCrunch

Discover the forefront of hard tech innovation in an exclusive session sponsored by HAX at TechCrunch Early Stage 2024. Led by SOSV general partner Duncan

April 3, 2024

Reinforcement Learning, Part 7: Introduction to Value-Function Approximation

Scaling reinforcement learning from tabular methods to large spaces Vyacheslav Efimov · Follow Published in Towards Data Science · 10 min read · 5 hours

August 22, 2024

Supercharge Your Portfolio with Future Tech Stocks!

Join us for Profitable Insights & Expert Tips!

With expert analysis, comprehensive market coverage, and actionable insights, our newsletter equips you with the knowledge & tools necessary to make informed decisions & maximize your potential returns in the dynamic world of future tech stocks.