Papers Pertinent or Protuberant - a AlekseyCalvin Collection

AlekseyCalvin 's Collections

Papers Pertinent or Protuberant

Papers Pertinent or Protuberant

updated Sep 15

The Cow of Rembrandt - Analyzing Artistic Prompt Interpretation in Text-to-Image Models

Paper • 2507.23313 • Published Jul 31 • 1
SonicMaster: Towards Controllable All-in-One Music Restoration and Mastering

Paper • 2508.03448 • Published Aug 5 • 4
C3D-AD: Toward Continual 3D Anomaly Detection via Kernel Attention with Learnable Advisor

Paper • 2508.01311 • Published Aug 2 • 2
Normalized Attention Guidance: Universal Negative Guidance for Diffusion Model

Paper • 2505.21179 • Published May 27 • 13
Learning to Detect Multi-class Anomalies with Just One Normal Image Prompt

Paper • 2505.09264 • Published May 14 • 5
How to Reduce Change Detection to Semantic Segmentation

Paper • 2206.07557 • Published Jun 15, 2022
Real-IAD D3: A Real-World 2D/Pseudo-3D/3D Dataset for Industrial Anomaly Detection

Paper • 2504.14221 • Published Apr 19
AdaptCLIP: Adapting CLIP for Universal Visual Anomaly Detection

Paper • 2505.09926 • Published May 15 • 6
MetaUAS: Universal Anomaly Segmentation with One-Prompt Meta-Learning

Paper • 2505.09265 • Published May 14 • 4
IFDECORATOR: Wrapping Instruction Following Reinforcement Learning with Verifiable Rewards

Paper • 2508.04632 • Published Aug 6 • 2
Reasoning Language Models for Root Cause Analysis in 5G Wireless Networks

Paper • 2507.21974 • Published Jul 29 • 4
A Coarse-to-Fine Approach to Multi-Modality 3D Occupancy Grounding

Paper • 2508.01197 • Published Aug 2 • 4
Sculptor: Empowering LLMs with Cognitive Agency via Active Context Management

Paper • 2508.04664 • Published Aug 6 • 13
IAUNet: Instance-Aware U-Net

Paper • 2508.01928 • Published Aug 3 • 8
Position: The Current AI Conference Model is Unsustainable! Diagnosing the Crisis of Centralized AI Conference

Paper • 2508.04586 • Published Aug 6 • 12
LeanK: Learnable K Cache Channel Pruning for Efficient Decoding

Paper • 2508.02215 • Published Aug 4 • 12
Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis

Paper • 2507.23785 • Published Jul 31 • 18
LaTCoder: Converting Webpage Design to Code with Layout-as-Thought

Paper • 2508.03560 • Published Aug 5 • 24
Web-CogReasoner: Towards Knowledge-Induced Cognitive Reasoning for Web Agents

Paper • 2508.01858 • Published Aug 3 • 20
Agent Lightning: Train ANY AI Agents with Reinforcement Learning

Paper • 2508.03680 • Published Aug 5 • 70
Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens

Paper • 2508.01191 • Published Aug 2 • 236
Attention Basin: Why Contextual Position Matters in Large Language Models

Paper • 2508.05128 • Published Aug 7 • 4
Unlocking the Potential of MLLMs in Referring Expression Segmentation via a Light-weight Mask Decode

Paper • 2508.04107 • Published Aug 6 • 4
Hop, Skip, and Overthink: Diagnosing Why Reasoning Models Fumble during Multi-Hop Analysis

Paper • 2508.04699 • Published Aug 6 • 2
RPCANet++: Deep Interpretable Robust PCA for Sparse Object Segmentation

Paper • 2508.04190 • Published Aug 6 • 1
I Think, Therefore I Am Under-Qualified? A Benchmark for Evaluating Linguistic Shibboleth Detection in LLM Hiring Evaluations

Paper • 2508.04939 • Published Aug 6 • 2
REINA: Regularized Entropy Information-Based Loss for Efficient Simultaneous Speech Translation

Paper • 2508.04946 • Published Aug 7 • 1
I2CR: Intra- and Inter-modal Collaborative Reflections for Multimodal Entity Linking

Paper • 2508.02243 • Published Aug 4 • 2
Learning to Reason for Factuality

Paper • 2508.05618 • Published Aug 7 • 6
Steering One-Step Diffusion Model with Fidelity-Rich Decoder for Fast Image Compression

Paper • 2508.04979 • Published Aug 7 • 5
StrandDesigner: Towards Practical Strand Generation with Sketch Guidance

Paper • 2508.01650 • Published Aug 3 • 6
MOSEv2: A More Challenging Dataset for Video Object Segmentation in Complex Scenes

Paper • 2508.05630 • Published Aug 7 • 9
Can Large Multimodal Models Actively Recognize Faulty Inputs? A Systematic Evaluation Framework of Their Input Scrutiny Ability

Paper • 2508.04017 • Published Aug 6 • 11
Are We on the Right Way for Assessing Document Retrieval-Augmented Generation?

Paper • 2508.03644 • Published Aug 5 • 25
A Practical Guide to Fine-tuning Language Models with Limited Data

Paper • 2411.09539 • Published Nov 14, 2024
LinguaLIFT: An Effective Two-stage Instruction Tuning Framework for Low-Resource Language Tasks

Paper • 2412.12499 • Published Dec 17, 2024 • 1
Development of Pre-Trained Transformer-based Models for the Nepali Language

Paper • 2411.15734 • Published Nov 24, 2024
Extending LLMs to New Languages: A Case Study of Llama and Persian Adaptation

Paper • 2412.13375 • Published Dec 17, 2024
Facilitating large language model Russian adaptation with Learned Embedding Propagation

Paper • 2412.21140 • Published Dec 30, 2024 • 18
BayLing 2: A Multilingual Large Language Model with Efficient Language Alignment

Paper • 2411.16300 • Published Nov 25, 2024