Collections
Discover the best community collections!
Collections including paper arxiv:2403.09029
-
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Paper • 2403.09611 • Published • 129 -
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset
Paper • 2403.09029 • Published • 56 -
GiT: Towards Generalist Vision Transformer through Universal Language Interface
Paper • 2403.09394 • Published • 28
-
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset
Paper • 2403.09029 • Published • 56 -
Cleaner Pretraining Corpus Curation with Neural Web Scraping
Paper • 2402.14652 • Published -
LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images
Paper • 2403.11703 • Published • 17
-
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset
Paper • 2403.09029 • Published • 56 -
HuggingFaceM4/WebSight
Viewer • Updated • 2.75M • 5.06k • 360 -
HuggingFaceM4/VLM_WebSight_finetuned
Text Generation • 8B • Updated • 304 • 186 -
laion/filtered-wit
Viewer • Updated • 2.8M • 5.17k • 10
-
FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models
Paper • 2402.10986 • Published • 81 -
bigcode/starcoder2-15b
Text Generation • 16B • Updated • 8.67k • 627 -
Zephyr: Direct Distillation of LM Alignment
Paper • 2310.16944 • Published • 122 -
mixedbread-ai/mxbai-rerank-large-v1
Text Ranking • 0.4B • Updated • 27.8k • 130
-
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset
Paper • 2403.09029 • Published • 56 -
LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression
Paper • 2403.12968 • Published • 26 -
RAFT: Adapting Language Model to Domain Specific RAG
Paper • 2403.10131 • Published • 73 -
Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking
Paper • 2403.09629 • Published • 79
-
HuggingFaceM4/WebSight
Viewer • Updated • 2.75M • 5.06k • 360 -
HuggingFaceM4/VLM_WebSight_finetuned
Text Generation • 8B • Updated • 304 • 186 -
896
Screenshot to HTML
⚡Convert screenshots to HTML code
-
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset
Paper • 2403.09029 • Published • 56
-
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Paper • 2403.09611 • Published • 129 -
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset
Paper • 2403.09029 • Published • 56 -
GiT: Towards Generalist Vision Transformer through Universal Language Interface
Paper • 2403.09394 • Published • 28
-
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset
Paper • 2403.09029 • Published • 56 -
Cleaner Pretraining Corpus Curation with Neural Web Scraping
Paper • 2402.14652 • Published -
LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images
Paper • 2403.11703 • Published • 17
-
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset
Paper • 2403.09029 • Published • 56 -
LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression
Paper • 2403.12968 • Published • 26 -
RAFT: Adapting Language Model to Domain Specific RAG
Paper • 2403.10131 • Published • 73 -
Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking
Paper • 2403.09629 • Published • 79
-
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset
Paper • 2403.09029 • Published • 56 -
HuggingFaceM4/WebSight
Viewer • Updated • 2.75M • 5.06k • 360 -
HuggingFaceM4/VLM_WebSight_finetuned
Text Generation • 8B • Updated • 304 • 186 -
laion/filtered-wit
Viewer • Updated • 2.8M • 5.17k • 10
-
FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models
Paper • 2402.10986 • Published • 81 -
bigcode/starcoder2-15b
Text Generation • 16B • Updated • 8.67k • 627 -
Zephyr: Direct Distillation of LM Alignment
Paper • 2310.16944 • Published • 122 -
mixedbread-ai/mxbai-rerank-large-v1
Text Ranking • 0.4B • Updated • 27.8k • 130
-
HuggingFaceM4/WebSight
Viewer • Updated • 2.75M • 5.06k • 360 -
HuggingFaceM4/VLM_WebSight_finetuned
Text Generation • 8B • Updated • 304 • 186 -
896
Screenshot to HTML
⚡Convert screenshots to HTML code
-
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset
Paper • 2403.09029 • Published • 56