Orca 2: Enhancing Reasoning in Smaller Language Models – BigBench-Hard Subtask Metrics | HackerNoon

Writings, Papers and Blogs on Text Models
May 29, 2024
8:00 pm

Authors:

(1) Arindam Mitra;

(2) Luciano Del Corro, work done while at Microsoft;

(3) Shweti Mahajan, work done while at Microsoft;

(4) Andres Codas, denote equal contributions;

(5) Clarisse Simoes, denote equal contributions;

(6) Sahaj Agarwal;

(7) Xuxi Chen, work done while at Microsoft;;

(8) Anastasia Razdaibiedina, work done while at Microsoft;

(9) Erik Jones, work done while at Microsoft;

(10) Kriti Aggarwal, work done while at Microsoft;

(11) Hamid Palangi;

(12) Guoqing Zheng;

(13) Corby Rosset;

(14) Hamed Khanpour;

(15) Ahmed Awadall.

Table 7, 8, 9, and 10 showcase the zero-shot performance of Orca 2 and the baseline models on each BBH MCQ reasoning task, with accuracy being the metric used to evaluate performance.

AMD Ryzen 9 9950X overclocked to 6.5GHz on LN2 cooling, breaks CInebench R23 world record

AMD’s new Zen 5-based Ryzen 9 9950X processor is nearly here, with Bilibili tech influencer “Ordinary Uncle Tony” using LN2 cooling and overclocking the 9950X

July 30, 2024

Guild of Guardians Beginner’s Guide: Everything You Need to Know to Start Your Adventure

Guild of Guardians (GOG) is an exciting new mobile game developed by Mineloader and brought to life by Immutable. GOG combines fantasy adventure with blockchain

May 14, 2024

Notable Capital’s Hans Tung on why founders need to play the long game | TechCrunch

Hans Tung, a managing partner at Notable Capital, formerly GGV Capital, has a lot of thoughts on the state of venture capital today. With $4.2 billion

April 30, 2024

Supercharge Your Portfolio with Future Tech Stocks!

Join us for Profitable Insights & Expert Tips!

With expert analysis, comprehensive market coverage, and actionable insights, our newsletter equips you with the knowledge & tools necessary to make informed decisions & maximize your potential returns in the dynamic world of future tech stocks.