Frank Denis PRO
jedisct1
AI & ML interests
None yet
Recent Activity
reacted to ajibawa-2023's post with ๐ about 5 hours ago
C-Code-Large
Dataset: https://huggingface.co/datasets/ajibawa-2023/C-Code-Large
C-Code-Large is a large-scale corpus of C programming language source code comprising more than 4 million code samples stored in .jsonl format. The dataset is designed to support research and development in large language model (LLM) pretraining, static analysis, and software engineering automation for the C ecosystem.
By offering a high-volume, language-focused dataset, C-Code-Large enables targeted experimentation in low-level programming, memory-constrained environments, and performance-critical systems, where C continues to be a dominant language.
C-Code-Large addresses the lack of large, curated, C-specific datasets, making it possible to conduct focused research on procedural programming paradigms, manual memory management, and system-level abstractions.
new activity 1 day ago
z-lab/Qwen3.5-27B-PARO:Bogus template? upvoted an article 8 days ago
Introducing Storage Buckets on the Hugging Face Hub