Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
10
201
Abhishek Verma
abskvrm
Follow
21world's profile picture
Kaytheist's profile picture
2 followers
·
46 following
AI & ML interests
None yet
Recent Activity
reacted
to
Javedalam
's
post
with 🚀
about 8 hours ago
GLM-OCR: A Tiny 0.9B-Parameter Model That Punches Far Above Its Weight Released today by Z.ai, GLM-OCR is a compact vision-language model designed specifically for document understanding. At just 0.9 billion parameters, it belongs to a new generation of lightweight AI systems proving that raw model size is no longer the only path to high performance. Despite its small footprint, GLM-OCR posts exceptionally strong results across major document benchmarks. It scores 94.6 on OmniDocBench, 94.0 on OCRBench, and an impressive 96.5 on UniMERNet for formula recognition—numbers that place it alongside, and in some cases ahead of, significantly larger specialized OCR models. The takeaway is clear: efficiency is rapidly becoming a defining feature of modern AI design. Developed by Z.ai, a research group focused on advancing multimodal foundation models, GLM-OCR reflects a broader shift toward highly optimized architectures that deliver serious capability without requiring massive compute resources. In practical testing, the model ran successfully in Google Colab on an NVIDIA L4 GPU, demonstrating that advanced document AI is no longer restricted to large research clusters. Engineers, researchers, and developers can now deploy high-quality OCR workflows from relatively accessible hardware. GLM-OCR signals an important trend in artificial intelligence: smaller, purpose-built models are beginning to rival heavyweight systems while being dramatically easier to run. For anyone working with scanned documents, PDFs, or structured text extraction, this release is a strong indicator of where efficient multimodal AI is heading next. The google colab notebook for the model https://colab.research.google.com/drive/1SiXjxPdb-7UJWhtAjPrMZYqPJhyLk9Rc?usp=sharing The Huggingface model page https://huggingface.co/zai-org/GLM-OCR
reacted
to
efecelik
's
post
with 👍
23 days ago
why ACE-Step model isn't popular that much? imo it makes really good music. https://huggingface.co/ACE-Step/ACE-Step-v1-3.5B
liked
a model
23 days ago
baichuan-inc/Baichuan-M3-235B
View all activity
Organizations
None yet
abskvrm
's models
None public yet