Zhi Yang's picture

1 5 1

Zhi Yang

yangzhi1

·

tobi0520

AI & ML interests

None yet

Recent Activity

submitted a paper 5 days ago

FinVault: Benchmarking Financial Agent Safety in Execution-Grounded Environments

upvoted a paper 11 days ago

Controlled Self-Evolution for Algorithmic Code Optimization

authored a paper 11 days ago

BizFinBench.v2: A Unified Dual-Mode Bilingual Benchmark for Expert-Level Financial Capability Alignment

View all activity

Organizations

submitted a paper to Daily Papers 5 days ago

FinVault: Benchmarking Financial Agent Safety in Execution-Grounded Environments

Paper • 2601.07853 • Published 18 days ago • 9

upvoted a paper 11 days ago

Controlled Self-Evolution for Algorithmic Code Optimization

Paper • 2601.07348 • Published 15 days ago • 112

authored 2 papers 11 days ago

BizFinBench.v2: A Unified Dual-Mode Bilingual Benchmark for Expert-Level Financial Capability Alignment

Paper • 2601.06401 • Published 17 days ago • 10

FinEval: A Chinese Financial Domain Knowledge Evaluation Benchmark for Large Language Models

Paper • 2308.09975 • Published Aug 19, 2023

authored 2 papers 12 days ago

FinVault: Benchmarking Financial Agent Safety in Execution-Grounded Environments

Paper • 2601.07853 • Published 18 days ago • 9

EvoFSM: Controllable Self-Evolution for Deep Research with Finite State Machines

Paper • 2601.09465 • Published 13 days ago • 40

upvoted 2 papers 12 days ago

KnowMe-Bench: Benchmarking Person Understanding for Lifelong Digital Companions

Paper • 2601.04745 • Published 19 days ago • 56

MemGovern: Enhancing Code Agents through Learning from Governed Human Experiences

Paper • 2601.06789 • Published 16 days ago • 77

upvoted a paper 13 days ago

FinVault: Benchmarking Financial Agent Safety in Execution-Grounded Environments

Paper • 2601.07853 • Published 18 days ago • 9

upvoted a paper 14 days ago

Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning

Paper • 2601.06943 • Published 16 days ago • 207

liked a model over 2 years ago

IDEA-CCNL/Ziya-LLaMA-13B-v1

Text Generation • Updated Sep 13, 2023 • 1.08k • 276