Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Building on HF
42.5
TFLOPS
5
21
104
Tyler Williams
PRO
unmodeled-tyler
Follow
Smooke's profile picture
Rebel2964's profile picture
kaveeshwaran's profile picture
113 followers
·
46 following
https://quantaintellect.com
unmodeledtyler
unmodeled-tyler
unmodeledtyler
AI & ML interests
AI research engineer & solo operator of VANTA Research/Quanta Intellect
Recent Activity
updated
a collection
about 21 hours ago
Recommended Papers
upvoted
a
paper
about 21 hours ago
General Intelligence Requires Rethinking Exploration
reacted
to
salma-remyx
's
post
with 🔥
about 22 hours ago
The space of possible improvements for your AI model is large while evaluation is costly. So I was excited to discover the ICML 2026 paper from Kobalczyk, Lin, Letham, Zhao, Balandat, and Bakshy titled "LILO: Bayesian Optimization with Natural Language Feedback." The method learns efficiently from expert preferences, balancing exploration and exploitation in a principled way with Bayesian Optimization for expensive-to-evaluate black-box objectives. Experimenting with the technique, I trained a Gaussian Process proxy model on the implicit preferences in my code repo's commit history at VQASynth. The result: I used the model's preference scores to re-rank candidate papers recommended based on my interests in spatial reasoning and multimodal data synthesis. Semantic relevance is a high-recall method for finding arXiv papers personalized to your interests. Adding contributor preferences, extracted from the merge history of your code offers a high-precision filter. So what's next? I'm using the model to synthesize a larger volume of preference data to finetune an open-weight coding model with DPO and LoRA. Tuning Coding Agents via Implicit Preference Distillation arXiv: https://arxiv.org/pdf/2510.17671 Substack: https://remyxai.substack.com/p/lilo-and-myx VQASynth: https://github.com/remyxai/VQASynth
View all activity
Organizations
unmodeled-tyler
's datasets
2
Sort:Â Recently updated
unmodeled-tyler/DoW-UFO-UAP-1
Viewer
•
Updated
5 days ago
•
5.7k
•
1.06k
•
4
unmodeled-tyler/vessel-browser-tool-loop
Viewer
•
Updated
Mar 21
•
1
•
29