1 17 1

Sukmin Cho

zomss

AI & ML interests

None yet

Recent Activity

upvoted a paper 13 days ago

Budget-Aware Tool-Use Enables Effective Agent Scaling

upvoted a paper 19 days ago

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models

upvoted a paper 26 days ago

Adaptive Multi-Agent Response Refinement in Conversational Systems

View all activity

Organizations

None yet

upvoted a paper 13 days ago

Budget-Aware Tool-Use Enables Effective Agent Scaling

Paper • 2511.17006 • Published 17 days ago • 25

upvoted a paper 19 days ago

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models

Paper • 2511.08577 • Published 27 days ago • 104

upvoted a paper 26 days ago

Adaptive Multi-Agent Response Refinement in Conversational Systems

Paper • 2511.08319 • Published 27 days ago • 40

upvoted a paper 27 days ago

LUT-LLM: Efficient Large Language Model Inference with Memory-based Computations on FPGAs

Paper • 2511.06174 • Published 30 days ago • 5

upvoted 2 papers about 2 months ago

KORMo: Korean Open Reasoning Model for Everyone

Paper • 2510.09426 • Published Oct 10 • 81

When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs

Paper • 2510.07499 • Published Oct 8 • 48

upvoted a paper 2 months ago

Cache-to-Cache: Direct Semantic Communication Between Large Language Models

Paper • 2510.03215 • Published Oct 3 • 97

upvoted a paper 3 months ago

EpiCache: Episodic KV Cache Management for Long Conversational Question Answering

Paper • 2509.17396 • Published Sep 22 • 19

upvoted a paper 4 months ago

DINOv3

Paper • 2508.10104 • Published Aug 13 • 285

upvoted a paper 7 months ago

System Prompt Optimization with Meta-Learning

Paper • 2505.09666 • Published May 14 • 71

upvoted a paper 8 months ago

Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning

Paper • 2504.17192 • Published Apr 24 • 120

upvoted 6 papers 10 months ago

Autellix: An Efficient Serving Engine for LLM Agents as General Programs

Paper • 2502.13965 • Published Feb 19 • 19

Lossless Acceleration of Large Language Models with Hierarchical Drafting based on Temporal Locality in Speculative Decoding

Paper • 2502.05609 • Published Feb 8 • 19

Typos that Broke the RAG's Back: Genetic Attack on RAG Pipeline by Simulating Documents in the Wild via Low-level Perturbations

Paper • 2404.13948 • Published Apr 22, 2024 • 2

VideoRAG: Retrieval-Augmented Generation over Video Corpus

Paper • 2501.05874 • Published Jan 10 • 75

Revisiting In-Context Learning with Long Context Language Models

Paper • 2412.16926 • Published Dec 22, 2024 • 32

Knowledge-Augmented Large Language Models for Personalized Contextual Query Suggestion

Paper • 2311.06318 • Published Nov 10, 2023 • 3

Sukmin Cho

AI & ML interests

Recent Activity

Organizations

zomss's activity