Orca 2: Enhancing Reasoning in Smaller Language Models – BigBench-Hard Subtask Metrics | HackerNoon

Writings, Papers and Blogs on Text Models
May 29, 2024
8:00 pm

Authors:

(1) Arindam Mitra;

(2) Luciano Del Corro, work done while at Microsoft;

(3) Shweti Mahajan, work done while at Microsoft;

(4) Andres Codas, denote equal contributions;

(5) Clarisse Simoes, denote equal contributions;

(6) Sahaj Agarwal;

(7) Xuxi Chen, work done while at Microsoft;;

(8) Anastasia Razdaibiedina, work done while at Microsoft;

(9) Erik Jones, work done while at Microsoft;

(10) Kriti Aggarwal, work done while at Microsoft;

(11) Hamid Palangi;

(12) Guoqing Zheng;

(13) Corby Rosset;

(14) Hamed Khanpour;

(15) Ahmed Awadall.

Table 7, 8, 9, and 10 showcase the zero-shot performance of Orca 2 and the baseline models on each BBH MCQ reasoning task, with accuracy being the metric used to evaluate performance.

THUNDERX3 unveils its FLEX Pro Mesh Chair, new LAB-X Motor Desk at Computex 2024

THUNDERX3 is one of the top 5 gaming chair brands by sales in the world, and at Computex 2024, the company has announced the launch

June 4, 2024

How to Grow Your Career Without Feeling Stuck

The Promotion Playbook Whether you are just starting up or aspiring to make another leap ensure that you are ready and your boss knows that

April 17, 2024

This Foldable Keyboard Is Actually A Mini PC Rocking A Zen 4 Ryzen CPU

Ever since William Gibson’s Neuromancer, thousands or even millions of nerds worldwide have fantasized about the idea of the ‘deck’, or ‘cyberdeck’ as it has

October 7, 2024

Supercharge Your Portfolio with Future Tech Stocks!

Join us for Profitable Insights & Expert Tips!

With expert analysis, comprehensive market coverage, and actionable insights, our newsletter equips you with the knowledge & tools necessary to make informed decisions & maximize your potential returns in the dynamic world of future tech stocks.