Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Lokendra Bairwa's picture
9

Lokendra Bairwa

lokendra-drizz
·

AI & ML interests

None yet

Organizations

None yet

upvoted a paper 3 months ago

SeeingEye: Agentic Information Flow Unlocks Multimodal Reasoning In Text-only LLMs

Paper • 2510.25092 • Published Oct 29, 2025 • 8
upvoted 5 papers 5 months ago

Essential-Web v1.0: 24T tokens of organized web data

Paper • 2506.14111 • Published Jun 17, 2025 • 46

Fin-PRM: A Domain-Specialized Process Reward Model for Financial Reasoning in Large Language Models

Paper • 2508.15202 • Published Aug 21, 2025 • 5

AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs

Paper • 2508.16153 • Published Aug 22, 2025 • 160

CARFT: Boosting LLM Reasoning via Contrastive Learning with Annotated Chain-of-Thought-based Reinforced Fine-Tuning

Paper • 2508.15868 • Published Aug 21, 2025 • 3

StepWiser: Stepwise Generative Judges for Wiser Reasoning

Paper • 2508.19229 • Published Aug 26, 2025 • 20
upvoted 2 papers 6 months ago

Efficient Differentially Private Fine-Tuning of LLMs via Reinforcement Learning

Paper • 2507.22565 • Published Jul 30, 2025 • 9

GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning

Paper • 2507.19457 • Published Jul 25, 2025 • 30
upvoted a paper 8 months ago

Orthogonal Finetuning Made Scalable

Paper • 2506.19847 • Published Jun 24, 2025 • 11
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs