ChuGyouk's picture

Building on HF

ChuGyouk PRO

ChuGyouk

·

https://gyoukchu.vercel.app/

AI & ML interests

LLM(LMM) RL & Medical AI

Recent Activity

liked a dataset 3 days ago

TeichAI/claude-4.5-opus-high-reasoning-250x

liked a model 6 days ago

shb777/Llama-3.3-8B-Instruct-128K

liked a model 6 days ago

allura-forge/Llama-3.3-8B-Instruct

View all activity

Organizations

None yet

upvoted a paper about 1 month ago

Black-Box On-Policy Distillation of Large Language Models

Paper • 2511.10643 • Published Nov 13, 2025 • 49

upvoted a paper about 2 months ago

Instruction-Guided Lesion Segmentation for Chest X-rays with Automatically Generated Large-Scale Dataset

Paper • 2511.15186 • Published Nov 19, 2025 • 25

upvoted 4 papers 3 months ago

When to Ensemble: Identifying Token-Level Points for Stable and Fast LLM Ensembling

Paper • 2510.15346 • Published Oct 17, 2025 • 33

Meta-Awareness Enhances Reasoning Models: Self-Alignment Reinforcement Learning

Paper • 2510.03259 • Published Sep 26, 2025 • 57

No Prompt Left Behind: Exploiting Zero-Variance Prompts in LLM Reinforcement Learning via Entropy-Guided Advantage Shaping

Paper • 2509.21880 • Published Sep 26, 2025 • 52

ReviewScore: Misinformed Peer Review Detection with Large Language Models

Paper • 2509.21679 • Published Sep 25, 2025 • 63

upvoted a collection 3 months ago

Qwen3Guard

7 items • Updated 5 days ago • 60

upvoted a collection 4 months ago

EAGLE3

The collection of eagle3 series models for Qwen3 and Hunyuan. • 9 items • Updated Nov 3, 2025 • 2

upvoted a paper 4 months ago

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25, 2025 • 211

upvoted a collection 5 months ago

Korean Math Dataset

한국어 수학 데이터 • 16 items • Updated 15 days ago • 13

upvoted a collection 6 months ago

EXAONE-4.0

EXAONE unified model series of 1.2B and 32B, integrating non-reasoning and reasoning modes. • 20 items • Updated Jul 29, 2025 • 52

upvoted a paper 6 months ago

First Return, Entropy-Eliciting Explore

Paper • 2507.07017 • Published Jul 9, 2025 • 23

upvoted a collection 7 months ago

V-JEPA 2

A frontier video understanding model developed by FAIR, Meta, which extends the pretraining objectives of https://ai.meta.com/blog/v-jepa-yann • 8 items • Updated Jun 13, 2025 • 180

upvoted a paper 7 months ago

Reasoning Model is Stubborn: Diagnosing Instruction Overriding in Reasoning Models

Paper • 2505.17225 • Published May 22, 2025 • 64

upvoted 2 collections 8 months ago

MedGemma Release

Collection of Gemma 3 variants for performance on medical text and image comprehension to accelerate building healthcare-based AI applications. • 7 items • Updated Jul 11, 2025 • 369

Qwen3

84 items • Updated 5 days ago • 1.54k

upvoted a paper 9 months ago

Trillion 7B Technical Report

Paper • 2504.15431 • Published Apr 21, 2025 • 37

upvoted an article 9 months ago

Article

Welcome Llama 4 Maverick & Scout on Hugging Face

+5

Apr 5, 2025

•

146

upvoted a paper 9 months ago

DistiLLM-2: A Contrastive Approach Boosts the Distillation of LLMs

Paper • 2503.07067 • Published Mar 10, 2025 • 31

upvoted an article 10 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

267