Chaim Rand, Author at Future Tech Stocks

Optimizing Transformer Models for Variable-Length Input Sequences

How PyTorch NestedTensors, FlashAttention2, and xFormers can Boost Performance and Reduce AI Costs Chaim Rand · Follow Published in Towards Data Science · 14 min read · 12 hours ago — Photo by Tanja Zöllner on Unsplash As generative AI (genAI) models grow in both popularity and scale, so do the computational demands and costs associated with their training and deployment. Optimizing these models is crucial for enhancing their runtime performance and reducing their operational

Chaim Rand November 26, 2024

On the Programmability of AWS Trainium and Inferentia

Accelerating AI/ML Model Training with Custom Operators — Part 4 Chaim Rand · Follow Published in Towards Data Science · 12 min read · 18 hours ago — Photo by Agata Bres on Unsplash In this post we continue our exploration of the opportunities for runtime optimization of machine learning (ML) workloads through custom operator development. This time, we focus on the tools provided by the AWS Neuron SDK for developing and running new kernels

Chaim Rand November 1, 2024

Training AI Models on CPU

Revisiting CPU for ML in an Era of GPU Scarcity Chaim Rand · Follow Published in Towards Data Science · 13 min read · 1 day ago — Photo by Quino Al on Unsplash The recent successes in AI are often attributed to the emergence and evolutions of the GPU. The GPU’s architecture, which typically includes thousands of multi-processors, high-speed memory, dedicated tensor cores, and more, is particularly well-suited to meet the intensive demands of

Chaim Rand September 2, 2024

Chaim Rand

Optimizing Transformer Models for Variable-Length Input Sequences

On the Programmability of AWS Trainium and Inferentia

Training AI Models on CPU

Supercharge Your Portfolio with Future Tech Stocks!

Join us for Profitable Insights & Expert Tips!

Chaim Rand

Optimizing Transformer Models for Variable-Length Input Sequences

On the Programmability of AWS Trainium and Inferentia

Training AI Models on CPU

Supercharge Your Portfolio with Future Tech Stocks!

Join us for Profitable Insights & Expert Tips!

Subscribe