AI & ML interests

Health x AI

Recent Activity

Articles

MaziyarPanahi 
posted an update 1 day ago
view post
Post
152
Training mRNA Language Models Across 25 Species for $165

We built an end-to-end protein AI pipeline covering structure prediction, sequence design, and codon optimization. After comparing multiple transformer architectures for codon-level language modeling, CodonRoBERTa-large-v2 emerged as the clear winner with a perplexity of 4.10 and a Spearman CAI correlation of 0.40, significantly outperforming ModernBERT. We then scaled to 25 species, trained 4 production models in 55 GPU-hours, and built a species-conditioned system that no other open-source project offers. Complete results, architectural decisions, and runnable code below.

https://huggingface.co/blog/OpenMed/training-mrna-models-25-species
MaziyarPanahi 
published an article 1 day ago
view article
Article

Training mRNA Language Models Across 25 Species for $165

•
6
MaziyarPanahi 
posted an update 8 days ago
view post
Post
2104
We annotated 119K medical images with two frontier VLMs (Qwen 3.5, Kimi K2.5), cross-validated at 93% agreement, and produced 110K training records, all for under $500. Fine-tuning 3 small models (2-3B params) improved all benchmarks: best model reaches +15.0% average exact match.

Everything is open-sourced: datasets, adapters, and code.

https://huggingface.co/blog/OpenMed/synthvision
  • 2 replies
·
MaziyarPanahi 
published an article 9 days ago
view article
Article

SynthVision: Building a 110K Synthetic Medical VQA Dataset with Cross-Model Validation

•
15