Manuel Romero's picture

In a Training Loop 🔄

Manuel Romero PRO

mrm8488

·

https://mrm8488.github.io

AI & ML interests

#AI Research and Democratization. NLP/NLG 🤗

Recent Activity

liked a model 2 days ago

manoskary/musicbert-large

upvoted a paper 4 days ago

Recursive Language Models Meet Uncertainty: The Surprising Effectiveness of Self-Reflective Program Search for Long Context

upvoted a paper 4 days ago

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

View all activity

Organizations

liked a model 2 days ago

manoskary/musicbert-large

Fill-Mask • 0.3B • Updated Oct 13, 2025 • 250 • 6

upvoted 2 papers 4 days ago

Recursive Language Models Meet Uncertainty: The Surprising Effectiveness of Self-Reflective Program Search for Long Context

Paper • 2603.15653 • Published 18 days ago • 11

Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation

Paper • 2603.19220 • Published 5 days ago • 56

upvoted an article 7 days ago

Article

LoRA Fine-Tuning BitNet b1.58 LLMs on Heterogeneous Edge GPUs via QVAC Fabric

8 days ago

•

14

liked a dataset 8 days ago

open-thoughts/OpenThoughts3-1.2M

Viewer • Updated Jun 9, 2025 • 1.2M • 10.4k • 215

upvoted a paper 12 days ago

Decoupling Reasoning and Confidence: Resurrecting Calibration in Reinforcement Learning from Verifiable Rewards

Paper • 2603.09117 • Published 15 days ago • 9

liked a model 12 days ago

principled-intelligence/Qwen3.5-2B-text-only

Text Generation • 2B • Updated 13 days ago • 263 • 5

upvoted a collection 12 days ago

Qwen3.5-text-only

Qwen3.5-text-only • 4 items • Updated 12 days ago • 11

upvoted an article 14 days ago

Article

ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models?

Feb 19

•

19

liked a model 14 days ago

UKPLab/GritHopper-7B

Sentence Similarity • Updated Feb 2 • 24 • 7

liked 4 datasets 17 days ago

google-research-datasets/paws-x

Viewer • Updated Jan 4, 2024 • 374k • 6.24k • 50

dennlinger/eur-lex-sum

Updated Sep 11, 2024 • 1.16k • 47

HuggingFaceFW/finepdfs-edu

Viewer • Updated Nov 11, 2025 • 49.5M • 7.07k • 84

PleIAs/common_corpus

Viewer • Updated Feb 19 • 69.9k • 183k • 387

liked 2 datasets 25 days ago

nvidia/Nemotron-Terminal-Synthetic-Tasks

Updated 29 days ago • 486 • 15

nvidia/Nemotron-Terminal-Corpus

Viewer • Updated 25 days ago • 366k • 3.12k • 101

upvoted a paper 25 days ago

Diffusion-Pretrained Dense and Contextual Embeddings

Paper • 2602.11151 • Published Feb 11 • 22

liked a dataset 29 days ago

Metacreation/GigaMIDI

Viewer • Updated Feb 6 • 3.44M • 800 • 35

upvoted a collection about 1 month ago

GPT 5 Codex

Distilled models and datasets for GPT 5 Codex • 7 items • Updated Dec 20, 2025 • 5

liked a dataset about 1 month ago

TeichAI/claude-4.5-opus-high-reasoning-250x

Viewer • Updated Nov 28, 2025 • 250 • 3.14k • 343