Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Harshit Gupta's picture
68 109

Harshit Gupta

hrgupta
bekoeignatiue's profile picture 21world's profile picture
·
  • hrgupta

AI & ML interests

None yet

Recent Activity

upvoted an article about 23 hours ago
How to generate text: using different decoding methods for language generation with Transformers
liked a model 6 days ago
ibm-granite/granite-4.0-h-350m
liked a model 13 days ago
MiniMaxAI/MiniMax-M2
View all activity

Organizations

Hugging Face Discord Community's profile picture

hrgupta 's collections 1

IMP Papers
This collection is a list of papers I find to be very interesting.
  • The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

    Paper • 2402.17764 • Published Feb 27, 2024 • 625
  • MiniMax-01: Scaling Foundation Models with Lightning Attention

    Paper • 2501.08313 • Published Jan 14 • 301
  • Group Sequence Policy Optimization

    Paper • 2507.18071 • Published Jul 24 • 309
  • Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth

    Paper • 2509.03867 • Published Sep 4 • 209
IMP Papers
This collection is a list of papers I find to be very interesting.
  • The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

    Paper • 2402.17764 • Published Feb 27, 2024 • 625
  • MiniMax-01: Scaling Foundation Models with Lightning Attention

    Paper • 2501.08313 • Published Jan 14 • 301
  • Group Sequence Policy Optimization

    Paper • 2507.18071 • Published Jul 24 • 309
  • Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth

    Paper • 2509.03867 • Published Sep 4 • 209
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs