arxiv:2510.04550
Pengfei He
bigboss24
AI & ML interests
Trustworthy
Recent Activity
authored a paper about 2 months ago
TRAJECT-Bench:A Trajectory-Aware Benchmark for Evaluating Agentic Tool
Use upvoted a paper about 2 months ago
TRAJECT-Bench:A Trajectory-Aware Benchmark for Evaluating Agentic Tool
Use upvoted a paper about 2 months ago
Co-RedTeam: Orchestrated Security Discovery and Exploitation with LLM AgentsOrganizations
None yet