AlekseyCalvin
's Collections
Papers Pertinent or Protuberant
updated
The Cow of Rembrandt - Analyzing Artistic Prompt Interpretation in
Text-to-Image Models
Paper
•
2507.23313
•
Published
•
1
SonicMaster: Towards Controllable All-in-One Music Restoration and
Mastering
Paper
•
2508.03448
•
Published
•
4
C3D-AD: Toward Continual 3D Anomaly Detection via Kernel Attention with
Learnable Advisor
Paper
•
2508.01311
•
Published
•
2
Normalized Attention Guidance: Universal Negative Guidance for Diffusion
Model
Paper
•
2505.21179
•
Published
•
13
Learning to Detect Multi-class Anomalies with Just One Normal Image
Prompt
Paper
•
2505.09264
•
Published
•
5
How to Reduce Change Detection to Semantic Segmentation
Paper
•
2206.07557
•
Published
Real-IAD D3: A Real-World 2D/Pseudo-3D/3D Dataset for Industrial Anomaly
Detection
Paper
•
2504.14221
•
Published
AdaptCLIP: Adapting CLIP for Universal Visual Anomaly Detection
Paper
•
2505.09926
•
Published
•
6
MetaUAS: Universal Anomaly Segmentation with One-Prompt Meta-Learning
Paper
•
2505.09265
•
Published
•
4
IFDECORATOR: Wrapping Instruction Following Reinforcement Learning with
Verifiable Rewards
Paper
•
2508.04632
•
Published
•
2
Reasoning Language Models for Root Cause Analysis in 5G Wireless
Networks
Paper
•
2507.21974
•
Published
•
4
A Coarse-to-Fine Approach to Multi-Modality 3D Occupancy Grounding
Paper
•
2508.01197
•
Published
•
4
Sculptor: Empowering LLMs with Cognitive Agency via Active Context
Management
Paper
•
2508.04664
•
Published
•
13
IAUNet: Instance-Aware U-Net
Paper
•
2508.01928
•
Published
•
8
Position: The Current AI Conference Model is Unsustainable! Diagnosing
the Crisis of Centralized AI Conference
Paper
•
2508.04586
•
Published
•
12
LeanK: Learnable K Cache Channel Pruning for Efficient Decoding
Paper
•
2508.02215
•
Published
•
12
Gaussian Variation Field Diffusion for High-fidelity Video-to-4D
Synthesis
Paper
•
2507.23785
•
Published
•
18
LaTCoder: Converting Webpage Design to Code with Layout-as-Thought
Paper
•
2508.03560
•
Published
•
24
Web-CogReasoner: Towards Knowledge-Induced Cognitive Reasoning for Web
Agents
Paper
•
2508.01858
•
Published
•
20
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper
•
2508.03680
•
Published
•
70
Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens
Paper
•
2508.01191
•
Published
•
236
Attention Basin: Why Contextual Position Matters in Large Language
Models
Paper
•
2508.05128
•
Published
•
4
Unlocking the Potential of MLLMs in Referring Expression Segmentation
via a Light-weight Mask Decode
Paper
•
2508.04107
•
Published
•
4
Hop, Skip, and Overthink: Diagnosing Why Reasoning Models Fumble during
Multi-Hop Analysis
Paper
•
2508.04699
•
Published
•
2
RPCANet++: Deep Interpretable Robust PCA for Sparse Object Segmentation
Paper
•
2508.04190
•
Published
•
1
I Think, Therefore I Am Under-Qualified? A Benchmark for Evaluating
Linguistic Shibboleth Detection in LLM Hiring Evaluations
Paper
•
2508.04939
•
Published
•
2
REINA: Regularized Entropy Information-Based Loss for Efficient
Simultaneous Speech Translation
Paper
•
2508.04946
•
Published
•
1
I2CR: Intra- and Inter-modal Collaborative Reflections for Multimodal
Entity Linking
Paper
•
2508.02243
•
Published
•
2
Learning to Reason for Factuality
Paper
•
2508.05618
•
Published
•
6
Steering One-Step Diffusion Model with Fidelity-Rich Decoder for Fast
Image Compression
Paper
•
2508.04979
•
Published
•
5
StrandDesigner: Towards Practical Strand Generation with Sketch Guidance
Paper
•
2508.01650
•
Published
•
6
MOSEv2: A More Challenging Dataset for Video Object Segmentation in
Complex Scenes
Paper
•
2508.05630
•
Published
•
9
Can Large Multimodal Models Actively Recognize Faulty Inputs? A
Systematic Evaluation Framework of Their Input Scrutiny Ability
Paper
•
2508.04017
•
Published
•
11
Are We on the Right Way for Assessing Document Retrieval-Augmented
Generation?
Paper
•
2508.03644
•
Published
•
25
A Practical Guide to Fine-tuning Language Models with Limited Data
Paper
•
2411.09539
•
Published
LinguaLIFT: An Effective Two-stage Instruction Tuning Framework for
Low-Resource Language Tasks
Paper
•
2412.12499
•
Published
•
1
Development of Pre-Trained Transformer-based Models for the Nepali
Language
Paper
•
2411.15734
•
Published
Extending LLMs to New Languages: A Case Study of Llama and Persian
Adaptation
Paper
•
2412.13375
•
Published
Facilitating large language model Russian adaptation with Learned
Embedding Propagation
Paper
•
2412.21140
•
Published
•
18
BayLing 2: A Multilingual Large Language Model with Efficient Language
Alignment
Paper
•
2411.16300
•
Published