Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

GEM benchmark

https://gem-benchmark.com
Activity Feed Request to join this org

AI & ML interests

We develop infrastructure for the evaluation of generated text.

Recent Activity

lewtun  submitted a paper about 1 month ago
Single-minus gluon tree amplitudes are nonzero
lewtun  submitted a paper about 1 month ago
Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL
gentaiscool  authored a paper 2 months ago
PingPong: A Natural Benchmark for Multi-Turn Code-Switching Dialogues
View all activity

Sebastian Gehrmann's profile pictureAbhik Bhattacharjee's profile pictureLewis Tunstall's profile pictureAlex Wang's profile pictureJenny Chim's profile pictureRatish Puduppully's profile pictureYacine Jernite's profile pictureTosin Adewumi's profile pictureSK Mainul Islam's profile pictureAshish Upadhyay's profile pictureAshish Shrivastava's profile pictureMathias Creutz's profile pictureRabin's profile pictureAshwin Devaraj's profile pictureJurik Juraska's profile pictureQi Zhu's profile pictureMoussa Kamal Eddine's profile pictureNico Daheim's profile pictureTianhao Shen's profile pictureronald cardenas acosta's profile pictureSebastien Montella's profile pictureYufang Hou's profile pictureHiroaki Hayashi's profile pictureVipul Raheja's profile pictureAnna Shvets's profile pictureJenna Kanerva's profile pictureChandra B's profile pictureGenta Indra Winata's profile pictureTahmid Hasan's profile pictureBernd's profile pictureWang's profile pictureBryan Wilie's profile pictureCristina Garbacea's profile pictureLi Zhang's profile pictureMille's profile pictureVikas Raunak's profile pictureNouamane Tazi's profile pictureLeonardo Ribeiro's profile pictureJordan Clive's profile pictureFaisal Ladhak's profile pictureSamarth Agarwal's profile pictureVivian Tsai's profile pictureBingsheng Yao's profile pictureDoan Anh Tien's profile pictureOndrej Dusek's profile pictureAlbert Villanova del Moral's profile pictureDaniel Hershcovich's profile pictureGyan Prakash's profile pictureOwais Ahmad's profile pictureSimon van de Fliert's profile pictureJiaan Wang's profile pictureAbinaya Mahendiran's profile picturePawan Sasanka Ammanamanchi's profile pictureJohn Jackson's profile pictureMastane Achab's profile pictureAmeer Azam's profile pictureGrantley Cullar's profile pictureParaskevi Kivroglou's profile pictureMuhammad Imran Zaman's profile pictureDuong Trong Chi's profile pictureMuhammad Noman Gul's profile pictureRuslan Magana Vsevolodovna's profile pictureVan os's profile pictureEnder's profile pictureMrSomething's profile pictureNymbo's profile pictureToan M. DINH's profile pictureSalman Rasheed's profile pictureRam Kadiyala's profile pictureKaustubh Dholé's profile pictureVinit Tavde's profile pictureHusnain's profile pictureSanshruthR's profile pictureAbdul Samad Siddiqui's profile picturedada's profile pictureDJ Sri Vigneshwar's profile pictureFrank Soboczenski's profile pictureSitam Meur's profile pictureAmmar's profile pictureNguyễn Vũ Dương's profile pictureMinhazul Hasan Sohan's profile pictureHilda Cran May's profile pictureAayan Mishra's profile pictureRadu Butucelea's profile pictureAIMaster7's profile picturewave's profile pictureDan Clipca's profile pictureSubhansh Malviya's profile pictureVincenzo Gallo's profile pictureVORTEX's profile pictureParvesh Rawal's profile pictureAniket Kumar's profile pictureRAY's profile pictureAgblevor's profile pictureMohammad Othman's profile pictureTanmoy Shome's profile pictureNiansuh's profile pictureagsdgdae's profile picture
Organization Card
Community About org cards

Edit this README.md markdown file to author your organization card.

spaces 4

Runtime error
9

DatasetCardForm

👁

Jun 29, 2022
Runtime error
3

Gem Submissions

💎

Jun 23, 2022
Running
3

Gem Results

📊

Display benchmark results in a resizable iframe

Feb 28, 2022

models 0

None public yet

datasets 44

GEM/xlsum

Updated Oct 3, 2024 • 2.39k • 5

GEM/wiki_auto_asset_turk

Viewer • Updated May 29, 2024 • 510k • 348 • 8

GEM/gem

Updated Jan 18, 2024 • 3.19k • 35

GEM/opusparcus

Updated Jan 9, 2024 • 482 • 2

GEM/Augmented_CACAPO_for_E2E

Viewer • Updated Feb 26, 2023 • 47.3k • 20

GEM/CACAPO_E2E

Viewer • Updated Feb 26, 2023 • 20.1k • 160

GEM/Elongated_CACAPO_for_E2E

Updated Feb 26, 2023 • 129

GEM/xwikis

Updated Feb 22, 2023 • 8.82k • 4

GEM/wiki_lingua

Updated Feb 16, 2023 • 456 • 50

GEM/xmediasum

Viewer • Updated Feb 15, 2023 • 40k • 27 • 4
View 44 datasets
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs