Daniel Kharitonov

Unsupervised LLM Evaluations

Practitioners guide to judging outputs of large language models Daniel Kharitonov · Follow Published in Towards Data Science · 12 min read · 23 hours ago — <TLDR> Evaluating AI-generated outputs is critical for building robust applications of large language models because it allows complex AI applications to be split into simple stages with built-in error control. It is relatively straightforward to evaluate generative outputs in a supervised mode, where the “right answers” can be

Daniel Kharitonov November 2, 2024

Enforcing JSON outputs in commercial LLMs

A comprehensive guide Daniel Kharitonov · Follow Published in Towards Data Science · 9 min read · 21 hours ago — TL;DR We tested the structured output capabilities of Google Gemini Pro, Anthropic Claude, and OpenAI GPT. In their best-performing configurations, all three models can generate structured outputs on a scale of thousands of JSON objects. However, the API capabilities vary significantly in the effort required to prompt the models to produce JSONs and in

Daniel Kharitonov August 28, 2024

Supercharge Your Portfolio with Future Tech Stocks!

Join us for Profitable Insights & Expert Tips!

With expert analysis, comprehensive market coverage, and actionable insights, our newsletter equips you with the knowledge & tools necessary to make informed decisions & maximize your potential returns in the dynamic world of future tech stocks.

Daniel Kharitonov

Unsupervised LLM Evaluations

Enforcing JSON outputs in commercial LLMs

Supercharge Your Portfolio with Future Tech Stocks!

Join us for Profitable Insights & Expert Tips!

Subscribe