Nanyang Technological University

university

Verified

https://www.ntu.edu.sg/

NTUsg

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

leoisufa authored a paper about 1 month ago

MotionAgent: Fine-grained Controllable Video Generation via Motion Field Agent

leoisufa authored a paper about 1 month ago

In-Context Learning with Unpaired Clips for Instruction-based Video Editing

linghan199 authored a paper about 2 months ago

ExpVid: A Benchmark for Experiment Video Understanding & Reasoning

View all activity

Papers

The Quest for Generalizable Motion Generation: Data, Model, and Evaluation

View all Papers

yumingj

authored a paper 6 days ago

RynnVLA-002: A Unified Vision-Language-Action and World Model

Paper • 2511.17502 • Published 9 days ago • 23

leoisufa

authored 2 papers about 1 month ago

MotionAgent: Fine-grained Controllable Video Generation via Motion Field Agent

Paper • 2502.03207 • Published Feb 5

In-Context Learning with Unpaired Clips for Instruction-based Video Editing

Paper • 2510.14648 • Published Oct 16

yumingj

authored 2 papers 2 months ago

MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources

Paper • 2509.21268 • Published Sep 25 • 101

RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation

Paper • 2509.15212 • Published Sep 18 • 21

teowu

authored a paper 6 months ago

VideoReasonBench: Can MLLMs Perform Vision-Centric Complex Video Reasoning?

Paper • 2505.23359 • Published May 29 • 39

teowu

authored 3 papers 8 months ago

Teaching LMMs for Image Quality Scoring and Interpreting

Paper • 2503.09197 • Published Mar 12 • 1

Generative Frame Sampler for Long Video Understanding

Paper • 2503.09146 • Published Mar 12 • 1

Kimi-VL Technical Report

Paper • 2504.07491 • Published Apr 10 • 132

yumingj

authored a paper 10 months ago

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Paper • 2501.13106 • Published Jan 22 • 90

teowu

authored 3 papers about 1 year ago

Adaptive Image Quality Assessment via Teaching Large Multimodal Model to Compare

Paper • 2405.19298 • Published May 29, 2024

LIME: Less Is More for MLLM Evaluation

Paper • 2409.06851 • Published Sep 10, 2024 • 2

VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation

Paper • 2411.13281 • Published Nov 20, 2024 • 21

duongngocyen

authored a paper about 1 year ago

Multi-expert Prompting Improves Reliability, Safety, and Usefulness of Large Language Models

Paper • 2411.00492 • Published Nov 1, 2024 • 6

teowu

authored a paper about 1 year ago

Aria: An Open Multimodal Native Mixture-of-Experts Model

Paper • 2410.05993 • Published Oct 8, 2024 • 111

kwatcharasupat

authored 3 papers over 1 year ago

Facing the Music: Tackling Singing Voice Separation in Cinematic Audio Source Separation

Paper • 2408.03588 • Published Aug 7, 2024 • 8

Latte: Cross-framework Python Package for Evaluation of Latent-Based Generative Models

Paper • 2112.10638 • Published Dec 20, 2021

ARAUS: A Large-Scale Dataset and Baseline Models of Affective Responses to Augmented Urban Soundscapes

Paper • 2207.01078 • Published Jul 3, 2022

teowu

authored a paper over 1 year ago

LongVideoBench: A Benchmark for Long-context Interleaved Video-Language Understanding

Paper • 2407.15754 • Published Jul 22, 2024 • 20

kwatcharasupat

authored a paper over 1 year ago

Automating Urban Soundscape Enhancements with AI: In-situ Assessment of Quality and Restorativeness in Traffic-Exposed Residential Areas

Paper • 2407.05744 • Published Jul 8, 2024