Collections
Discover the best community collections!
Collections including paper arxiv:2510.02209
-
StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?
Paper • 2510.02209 • Published • 49 -
MM-DREX: Multimodal-Driven Dynamic Routing of LLM Experts for Financial Trading
Paper • 2509.05080 • Published -
TradingGroup: A Multi-Agent Trading System with Self-Reflection and Data-Synthesis
Paper • 2508.17565 • Published -
QTMRL: An Agent for Quantitative Trading Decision-Making Based on Multi-Indicator Guided Reinforcement Learning
Paper • 2508.20467 • Published
-
A Tale of Tails: Model Collapse as a Change of Scaling Laws
Paper • 2402.07043 • Published • 16 -
What Characterizes Effective Reasoning? Revisiting Length, Review, and Structure of CoT
Paper • 2509.19284 • Published • 22 -
OnePiece: Bringing Context Engineering and Reasoning to Industrial Cascade Ranking System
Paper • 2509.18091 • Published • 33 -
Strategic Dishonesty Can Undermine AI Safety Evaluations of Frontier LLM
Paper • 2509.18058 • Published • 12
-
StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?
Paper • 2510.02209 • Published • 49 -
'Finance Wizard' at the FinLLM Challenge Task: Financial Text Summarization
Paper • 2408.03762 • Published • 1 -
FinanceQA: A Benchmark for Evaluating Financial Analysis Capabilities of Large Language Models
Paper • 2501.18062 • Published • 3 -
Baichuan4-Finance Technical Report
Paper • 2412.15270 • Published • 3
-
StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?
Paper • 2510.02209 • Published • 49 -
MM-DREX: Multimodal-Driven Dynamic Routing of LLM Experts for Financial Trading
Paper • 2509.05080 • Published -
TradingGroup: A Multi-Agent Trading System with Self-Reflection and Data-Synthesis
Paper • 2508.17565 • Published -
QTMRL: An Agent for Quantitative Trading Decision-Making Based on Multi-Indicator Guided Reinforcement Learning
Paper • 2508.20467 • Published
-
A Tale of Tails: Model Collapse as a Change of Scaling Laws
Paper • 2402.07043 • Published • 16 -
What Characterizes Effective Reasoning? Revisiting Length, Review, and Structure of CoT
Paper • 2509.19284 • Published • 22 -
OnePiece: Bringing Context Engineering and Reasoning to Industrial Cascade Ranking System
Paper • 2509.18091 • Published • 33 -
Strategic Dishonesty Can Undermine AI Safety Evaluations of Frontier LLM
Paper • 2509.18058 • Published • 12
-
StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?
Paper • 2510.02209 • Published • 49 -
'Finance Wizard' at the FinLLM Challenge Task: Financial Text Summarization
Paper • 2408.03762 • Published • 1 -
FinanceQA: A Benchmark for Evaluating Financial Analysis Capabilities of Large Language Models
Paper • 2501.18062 • Published • 3 -
Baichuan4-Finance Technical Report
Paper • 2412.15270 • Published • 3